A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
-
Updated
Mar 8, 2024 - Python
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
A high-quality speech analysis, manipulation and synthesis system
Praat: Doing Phonetics By Computer
General Speech Restoration
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
My-Voice Analysis is a Python library for the analysis of voice (simultaneous speech, high entropy) without the need of a transcription. It breaks utterances and detects syllable boundaries, fundamental frequency contours, and formants.
feature extraction from speech signals
A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.
General Speech Restoration
Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).
A vocoder framework which had been widely used in research community since 1999.
INTERSPEECH 2023 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
Script to calculate SNR and SDR using python
Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.
Pitch detection and pitch tracking, voicing unvoicing detection (VAD),基音检测
Toolkit to asses speech impairments in patients with neurological disorders
MATLAB real-time/interactive speech tools. This series is obsolete. SP3ARK is the up-to-date series (will be).
Predicting emotions based on speech audio samples of American English, German and British English languages using Support Vector Machine, K-Nearest Neighbor, Random Forest and Recurrent Neural Network. Analyzing the performance of each model based on the dataset.
Add a description, image, and links to the speech-analysis topic page so that developers can more easily learn about it.
To associate your repository with the speech-analysis topic, visit your repo's landing page and select "manage topics."