Skip to content

Popular repositories

  1. FunASR FunASR Public

    A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

    Python 3.9k 446

  2. FunClip FunClip Public

    Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

    Python 2.1k 191

  3. 3D-Speaker 3D-Speaker Public

    A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization

    Python 782 64

  4. KAN-TTS KAN-TTS Public

    KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

    Python 443 71

  5. former3d former3d Public

    Python 95 9

  6. SpokenNLP SpokenNLP Public

    A wide variety of research projects developed by the SpokenNLP team of Speech Lab, Alibaba Group.

    Python 88 11

Repositories

Showing 10 of 21 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.