Port of OpenAI's Whisper model in C/C++
-
Updated
May 7, 2024 - C
Port of OpenAI's Whisper model in C/C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
🧠 Leon is your open-source personal assistant.
kaldi-asr/kaldi is the official location of the Kaldi project.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Faster Whisper transcription with CTranslate2
Speech recognition module for Python, supporting several engines and APIs, online and offline.
A PyTorch-based Speech Toolkit
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
💬 Speech recognition for your site
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
Lingvo
Open source, local, and self-hosted Amazon Echo/Google Home competitive Voice Assistant alternative
🎙Speech recognition using the tensorflow deep learning framework, sequence-to-sequence neural networks
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
Gathers machine learning and Tensorflow deep learning models for NLP problems, 1.13 < Tensorflow < 2.0
Add a description, image, and links to the speech-to-text topic page so that developers can more easily learn about it.
To associate your repository with the speech-to-text topic, visit your repo's landing page and select "manage topics."