Alibaba Damo Academy

FunASR Public

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 3.9k 446

FunClip Public

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

Python 2.1k 191

3D-Speaker Public

A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

Python 782 64

KAN-TTS Public

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

Python 443 71

former3d Public

Python 95 9

SpokenNLP Public

A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

Python 88 11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Alibaba Damo Academy

Popular repositories

Repositories

People

Top languages

Most used topics