⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
-
Updated
May 19, 2024 - Python
⚡ TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subwords
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
🧠 Leon is your open-source personal assistant.
Implementation of keyword spotting or wake up word
REBORN: Reinforcement-Learned Boundary Segmentation with Iterative Training for Unsupervised ASR
Multimodal Computer Vision application leveraging object detections, gesture recognition and speech to text, in order to help user ask questions about their environment.
MuskanAi is a personal Digital Assistant which is capable of performing all Automation task whether it is Controlling your Devices, Browsing the Internet and Emotional Understanding..
Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
Speech-to-text server framework with next-gen Kaldi
The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.
Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!
Port of OpenAI's Whisper model in C/C++
Whisper command line client compatible with original OpenAI client based on CTranslate2.
ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.
Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.
A voice recorder and recording transmitter for Commbase
A voice recorder and recording transmitter for Commbase
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
A reactive and remote-ready version of STT engine for Commbase
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."