voice-activity-detection

Star

Here are 132 public repositories matching this topic...

thurti / vad-audio-worklet

Sponsor

Star

Voice Activity Detection (VAD) AudioWorklet

speech vad voice-activity-detection audioworklet audioworkletprocessor

Updated Jun 10, 2024
JavaScript

modelscope / FunASR

Star

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

Updated Jun 10, 2024
Python

gtreshchev / RuntimeAudioImporter

Star

Runtime Audio Importer plugin for Unreal Engine. Importing audio of various formats at runtime.

audio plugin mp3 audio-files audio-player mp3-player vad audio-formats unreal-engine ue4 blueprints audio-converter unreal-engine-4 voice-activity-detection ue4-plugin bink ue5 unreal-engine-5 ue5-plugin

Updated Jun 9, 2024
C++

ina-foss / InaGVAD

Star

Voice activity detection and speaker gender segmentation audiovisual corpus

radio benchmark corpus tv dataset gender audio-segmentation voice-activity-detection gender-prediction speech-dataset gender-bias speech-activity-detection speaker-gender speech-corpus audio-dataset audiovisual-dataset acoustic-diversity gender-representation

Updated Jun 6, 2024
Jupyter Notebook

jim-schwoebel / voice_datasets

Star

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

Updated Jun 6, 2024

mgonzs13 / whisper_ros

Star

silero-vad + whisper.cpp (speech-to-text) for ROS 2

speech-recognition vad speech-to-text ros2 voice-activity-detection whisper-cpp ggml

Updated Jun 5, 2024
C++

Yifei-ZHAO96 / Tr-VAD

Star

Tr-VAD: An Efficient Transformer based Voice Activity Detection Model

vad voice-activity-detection

Updated Jun 4, 2024
Python

pyannote / pyannote-audio

Star

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

Updated Jun 4, 2024
Jupyter Notebook

kristofferv98 / SemanthaVoiceAssistant

Star

A comprehensive AI companion leveraging advanced semantic analysis, sentiment detection, and voice processing to provide personalized and context-aware interactions using Autogen, semantic-router, and VoiceProcessingToolkit.

Updated Jun 2, 2024
Python

juanmc2005 / diart

Star

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

Updated Jun 1, 2024
Python

snakers4 / silero-vad

Star

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands pytorch voice-recognition voice-control voice-detection voice-activity-detection onnx

Updated May 30, 2024
Python

nianlonggu / WhisperSeg

Star

Code for ICASSP 2024 paper WhisperSeg: Positive Transfer of the Whisper Speech Transformer to Human and Animal Voice Activity Detection

transformer whisper audio-segmentation voice-activity-detection icassp2024 animal-sound-detection whisperseg

Updated May 27, 2024
Python

IntendedConsequence / vadc

Star

Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech

pytorch vad voice-activity-detection onnxruntime tinygrad silero-vad

Updated May 23, 2024
C++

zhenghuatan / rVADfast

Star

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

voice-activity-detection

Updated May 21, 2024
Python

Speech-Interaction-Technology-Aalto-U / itsp

Star

Introduction to Speech Processing

speaker-recognition speech-processing speech-analysis voice-activity-detection speech-enhancement speech-modelling speech-coding speech-quality-evaluation

Updated May 11, 2024
Jupyter Notebook

baxtree / subaligner

Star

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers. https://subaligner.readthedocs.io/

Updated May 10, 2024
Python

duj12 / ASR-2Pass

Star

ASR 2Pass onnxruntime and websocket server, based on FunASR(https://github.com/alibaba-damo-academy/FunASR).

websocket speech-recognition inverse-text-normalization voice-activity-detection onnxruntime punctuation-restoration streaming-speech-to-text

Updated May 9, 2024
HTML

noisetorch / NoiseTorch

Star

Real-time microphone noise suppression on Linux.

linux voice pulseaudio hacktoberfest noise-reduction voice-activity-detection voice-activated noise-suppression hacktoberfest2023

Updated Apr 28, 2024
Go

HolgerBovbjerg / SSL-PVAD

Star

A repository for code used to produce the results the ICASSP 2024 paper: "SELF-SUPERVISED PRETRAINING FOR ROBUST PERSONALIZED VOICE ACTIVITY DETECTION IN ADVERSE CONDITIONS"

speech-processing voice-activity-detection self-supervised-learning personalized-machine-learning

Updated Apr 18, 2024
Python

Picovoice / cobra

Star

On-device voice activity detection (VAD) powered by deep learning

speech-recognition vad voice-activity-detection on-device voice-activity voice-activity-detector

Updated Apr 8, 2024
Python

Improve this page

Add a description, image, and links to the voice-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the voice-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voice-activity-detection

Here are 132 public repositories matching this topic...

thurti / vad-audio-worklet

modelscope / FunASR

gtreshchev / RuntimeAudioImporter

ina-foss / InaGVAD

jim-schwoebel / voice_datasets

mgonzs13 / whisper_ros

Yifei-ZHAO96 / Tr-VAD

pyannote / pyannote-audio

kristofferv98 / SemanthaVoiceAssistant

juanmc2005 / diart

snakers4 / silero-vad

nianlonggu / WhisperSeg

IntendedConsequence / vadc

zhenghuatan / rVADfast

Speech-Interaction-Technology-Aalto-U / itsp

baxtree / subaligner

duj12 / ASR-2Pass

noisetorch / NoiseTorch

HolgerBovbjerg / SSL-PVAD

Picovoice / cobra

Improve this page

Add this topic to your repo