Replicate OAI transcribe interface #41

FL33TW00D · 2023-11-02T03:14:30Z

In order to be a drop-in replacement for OAI, we need to replicate the transcribe interface

def transcribe(
    model: "Whisper",
    audio: Union[str, np.ndarray, torch.Tensor],
    *,
    verbose: Optional[bool] = None,
    temperature: Union[float, Tuple[float, ...]] = (0.0, 0.2, 0.4, 0.6, 0.8, 1.0),
    compression_ratio_threshold: Optional[float] = 2.4,
    logprob_threshold: Optional[float] = -1.0,
    no_speech_threshold: Optional[float] = 0.6,
    condition_on_previous_text: bool = True,
    initial_prompt: Optional[str] = None,
    word_timestamps: bool = False,
    prepend_punctuations: str = "\"'“¿([{-",
    append_punctuations: str = "\"'.。,，!！?？:：”)]}、",
    **decode_options,
):

Beam sampling
Word level timestamps
Initial prompting

FL33TW00D added the enhancement New feature or request label Nov 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replicate OAI transcribe interface #41

Replicate OAI transcribe interface #41

FL33TW00D commented Nov 2, 2023 •

edited

Replicate OAI transcribe interface #41

Replicate OAI transcribe interface #41

Comments

FL33TW00D commented Nov 2, 2023 • edited

FL33TW00D commented Nov 2, 2023 •

edited