asr

Here are 1,018 public repositories matching this topic...

swapnil233 / qualsearch-nextjs

Qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, data presentation, etc.

ux transcription asr ux-testing ux-research caqdas diarization ux-analytics deepgram thematic-analysis automatic-speaker-recognition

Updated May 26, 2024
TypeScript

DmitryRyumin / ICASSP-2023-24-Papers

Star

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Updated May 26, 2024
Python

NVIDIA / NeMo

Star

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr multimodal speech-translation large-language-models speaker-diariazation generative-ai

Updated May 25, 2024
Python

MooersLab / bash-whisper-transcription

Star

Bash function to ease the transcription of audio files with OpenAI's whisper.

audio bash automation automatic-speech-recognition speech-to-text beginner-friendly stt whisper automate-the-boring-stuff asr bash-function audio-messages audio-file-trancription

Updated May 25, 2024
Python

R3gm / SoniTranslate

Star

Synchronized Translation for Videos. Video dubbing

text-to-speech translation tts speech-to-text stt audio-processing asr document-translator dubbing diarization automatic-dubbing subtitle-to-speech translate-audio translate-video video-dubbing

Updated May 25, 2024
Python

aosses-tue / fastACI

Star

fastACI toolbox: the MATLAB toolbox for investigating auditory perception using reverse correlation.

aci perception psychophysics asr audition revcorr

Updated May 25, 2024
MATLAB

MahmoudAshraf97 / whisper-diarization

Star

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

Updated May 25, 2024
Jupyter Notebook

mydroidandi / commbase-stt-whisper-proactive-p

Star

A proactive version of STT engine for Commbase

python engine speech-recognition automatic-speech-recognition speech-to-text stt asr commbase libcommbase commbase-stt-whisper-p commbase-stt-vosk-p

Updated May 25, 2024
Python

k2-fsa / sherpa-onnx

Star

Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift

android windows macos linux raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v asr arm32 onnx vits openkylin

Updated May 25, 2024
C++

unnumsykar / knowledge-transfer-GenAI

Star

how to compress large knowledge base (.mp4, .mp3, .wav) and transfer it into readable, short, summarized form for effective knowledge transfer

asr gpt-4 genai-usecase

Updated May 24, 2024

metame-ai / awesome-audio-plaza

Star

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

awesome tts music-generation asr audio-generation zero-shot-tts awesome-music-generation

Updated May 24, 2024

EricApgar / live-speech-to-text

Star

Live speech to text transcription.

raspberry-pi offline automatic-speech-recognition asr hugging-face

Updated May 24, 2024
Python

deepgram-devs / deepgram-conversational-demo

Star

Deepgram Conversational AI demo

react nextjs tts stt asr deepgram vercel

Updated May 24, 2024
TypeScript

wenet-e2e / wenet

Star

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

Updated May 24, 2024
Python

flozi00 / atra

Star

An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands

chatbot speech transformers inference speech-recognition asr llm stable-diffusion

Updated May 23, 2024
Jupyter Notebook

voicegain / platform

Star

Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)

deep-neural-networks ivr speech-to-text rtc transcription asr mrcp

Updated May 23, 2024
HTML

Garvys / rustfst

Star

Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.

Updated May 23, 2024
Rust

blip-radar / vatsim-parser

Star

Parser for a variety of VATSIM-related file formats

vatsim euroscope asr sct topsky-plugin

Updated May 23, 2024
Rust

k2-fsa / sherpa

Star

Speech-to-text server framework with next-gen Kaldi

python cpp websocket pytorch speech-recognition transducer asr ctc end-to-end-asr

Updated May 23, 2024
C++

PaddlePaddle / PaddleSpeech

Star

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated May 23, 2024
Python

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

Here are 1,018 public repositories matching this topic...

swapnil233 / qualsearch-nextjs

DmitryRyumin / ICASSP-2023-24-Papers

NVIDIA / NeMo

MooersLab / bash-whisper-transcription

R3gm / SoniTranslate

aosses-tue / fastACI

MahmoudAshraf97 / whisper-diarization

mydroidandi / commbase-stt-whisper-proactive-p

k2-fsa / sherpa-onnx

unnumsykar / knowledge-transfer-GenAI

metame-ai / awesome-audio-plaza

EricApgar / live-speech-to-text

deepgram-devs / deepgram-conversational-demo

wenet-e2e / wenet

flozi00 / atra

voicegain / platform

Garvys / rustfst

blip-radar / vatsim-parser

k2-fsa / sherpa

PaddlePaddle / PaddleSpeech

Improve this page

Add this topic to your repo