Qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, data presentation, etc.
-
Updated
May 26, 2024 - TypeScript
Qualitative data analysis software for UX research. User interview tagging, AI-supported analysis, data presentation, etc.
ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Bash function to ease the transcription of audio files with OpenAI's whisper.
Synchronized Translation for Videos. Video dubbing
fastACI toolbox: the MATLAB toolbox for investigating auditory perception using reverse correlation.
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
A proactive version of STT engine for Commbase
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
how to compress large knowledge base (.mp4, .mp3, .wav) and transfer it into readable, short, summarized form for effective knowledge transfer
Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation
Live speech to text transcription.
Production First and Production Ready End-to-End Speech Recognition Toolkit
An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker commands
Voicegain Enterprise Speech-to-Text Platform (API, Portal, etc.)
Rust re-implementation of OpenFST - library for constructing, combining, optimizing, and searching weighted finite-state transducers (FSTs). A Python binding is also available.
Parser for a variety of VATSIM-related file formats
Speech-to-text server framework with next-gen Kaldi
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Add a description, image, and links to the asr topic page so that developers can more easily learn about it.
To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."