Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Replicate OAI transcribe interface #41

Open
1 of 3 tasks
FL33TW00D opened this issue Nov 2, 2023 · 0 comments
Open
1 of 3 tasks

Replicate OAI transcribe interface #41

FL33TW00D opened this issue Nov 2, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@FL33TW00D
Copy link
Owner

FL33TW00D commented Nov 2, 2023

In order to be a drop-in replacement for OAI, we need to replicate the transcribe interface

def transcribe(
    model: "Whisper",
    audio: Union[str, np.ndarray, torch.Tensor],
    *,
    verbose: Optional[bool] = None,
    temperature: Union[float, Tuple[float, ...]] = (0.0, 0.2, 0.4, 0.6, 0.8, 1.0),
    compression_ratio_threshold: Optional[float] = 2.4,
    logprob_threshold: Optional[float] = -1.0,
    no_speech_threshold: Optional[float] = 0.6,
    condition_on_previous_text: bool = True,
    initial_prompt: Optional[str] = None,
    word_timestamps: bool = False,
    prepend_punctuations: str = "\"'“¿([{-",
    append_punctuations: str = "\"'.。,,!!??::”)]}、",
    **decode_options,
):
  • Beam sampling
  • Word level timestamps
  • Initial prompting
@FL33TW00D FL33TW00D added the enhancement New feature or request label Nov 2, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Status: Todo
Development

No branches or pull requests

1 participant