speech-recognition

Multimodal Computer Vision application leveraging object detections, gesture recognition and speech to text, in order to help user ask questions about their environment.

computer-vision speech-recognition object-detection gesture-recognition multimodal multimodal-deep-learning

Updated May 19, 2024
Python

4darsh-Dev / MuskanAi

Sponsor

Star

MuskanAi is a personal Digital Assistant which is capable of performing all Automation task whether it is Controlling your Devices, Browsing the Internet and Emotional Understanding..

python machine-learning natural-language-processing django web-development deep-learning django-rest-framework speech-recognition trending-repositories html-css-javascript natural-language-understanding digital-assistant emotional-analysis pytorch-nlp aritificalintelligence

Updated May 19, 2024
Jupyter Notebook

Achieve your goals and keep your data private with Lotti. This life tracking app is designed to help you stay motivated and on track, all while keeping your personal information safe and secure. Now with on-device speech recognition.

windows macos ios journal health speech-recognition time-tracker speech-to-text android-app flutter linux-app fitness-app

Updated May 19, 2024
Dart

Chenyme / Chenyme-AAVT

Star

这是一个全自动（音频）视频翻译项目。利用Whisper识别声音，AI大模型翻译字幕，最后合并字幕视频，生成翻译后的视频。

speech-recognition whisper video-translation gpt-4 faster-whisper gpt-4o

Updated May 19, 2024
Python

k2-fsa / sherpa

Star

Speech-to-text server framework with next-gen Kaldi

python cpp websocket pytorch speech-recognition transducer asr ctc end-to-end-asr

Updated May 19, 2024
C++

jackwuwei / gptspeaker

Star

The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.

raspberry-pi ai smarthome chatbot tts speech-recognition speech-to-text voice-assistant chatgpt

Updated May 19, 2024
Python

th-schmidt / whisply

Star

Transcribe, translate, diarize, annotate and subtitle video (and audio) with Whisper ... fast!

subtitles speech-recognition automatic-speech-recognition speech-to-text whisper-ai

Updated May 19, 2024
Python

ggerganov / whisper.cpp

Sponsor

Star

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

Updated May 19, 2024
C

Softcatala / whisper-ctranslate2

Star

Whisper command line client compatible with original OpenAI client based on CTranslate2.

speech-recognition speech-to-text whisper openai- openai-whisper

Updated May 19, 2024
Python

judahpaul16 / gpt-home

Star

ChatGPT at home! Basically a better Google Nest Hub or Amazon Alexa home assistant. Built on the Raspberry Pi using the OpenAI API.

Updated May 19, 2024
Python

kurianbenoy / Indic-Subtitler

Star

Open source subtitling platform 💻 for transcribing and translating videos/audios in Indic languages.

deep-learning nextjs transformers inference webapp speech-recognition openai speech-to-text quantization whisper asr fastapi faster-whisper whisperx vegam-whisper

Updated May 19, 2024
Jupyter Notebook

mydroidandi / commbase-recorder-transmitter-s

Star

A voice recorder and recording transmitter for Commbase

android shell ios-app dash speech-recognition smartphone-interaction speech-to-text stt iphone-app smartphone-app voice-recorder commbase commbase-stt-whisper-reactive-p

Updated May 19, 2024
Shell

mydroidandi / commbase-recorder-transmitter-b

Star

A voice recorder and recording transmitter for Commbase

android bash ios-app speech-recognition smartphone-interaction speech-to-text stt iphone-app smartphone-app voice-recorder commbase commbase-stt-whisper-reactive-p

Updated May 19, 2024
Shell

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 19, 2024
Python

mydroidandi / commbase-stt-whisper-reactive-p

Star

A reactive and remote-ready version of STT engine for Commbase

android python ssh raspberry-pi remote-control engine assistant speech-recognition recorder automatic-speech-recognition stt assistive-technology remote-access-tool secure-shell openai-whisper commbase

Updated May 19, 2024
Python

Improve this page

Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

speech-recognition

Here are 4,608 public repositories matching this topic...

TensorSpeech / TensorFlowASR

huggingface / transformers

leon-ai / leon

z430 / keyword-spotting

andybi7676 / reborn-uasr

darmangerd / vubot

4darsh-Dev / MuskanAi

matthiasn / lotti

Chenyme / Chenyme-AAVT

k2-fsa / sherpa

jackwuwei / gptspeaker

th-schmidt / whisply

ggerganov / whisper.cpp

Softcatala / whisper-ctranslate2

judahpaul16 / gpt-home

kurianbenoy / Indic-Subtitler

mydroidandi / commbase-recorder-transmitter-s

mydroidandi / commbase-recorder-transmitter-b

DmitryRyumin / ICASSP-2023-24-Papers

mydroidandi / commbase-stt-whisper-reactive-p

Improve this page

Add this topic to your repo