VITS-based Voice Conversion focused on simplicity, quality and performance.
-
Updated
May 24, 2024 - Python
VITS-based Voice Conversion focused on simplicity, quality and performance.
Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AI Vtuber是一个由 【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】 驱动的虚拟主播【Live2D/UE/xuniren】,可以在 【Bilibili/抖音/快手/微信视频号/拼多多/斗鱼/YouTube/twitch/TikTok】 直播中与观众实时互动 或 直接在本地进行聊天。它使用TTS技术【edge-tts/VITS/elevenlabs/bark/bert-vits2/睿声】生成回答并可以选择【so-vits-svc/DDSP-SVC】变声;指令协同SD画图。
so-vits-svc fork with realtime support, improved interface and more features.
一个使用C++编写的音频处理软件
RVC CLI enables seamless interaction with Retrieval-based Voice Conversion through commands or HTTP requests.
So-VITS-SVC 本地部署/训练/推理/使用帮助文档 So-VITS-SVC Local Deployment/Training/Inference/Usage Help Document
Easily train a good VC model with voice data <= 10 mins!
A simple VITS HTTP API, developed by extending Moegoe with additional features.
Persian text-to-speech streamlit interface
Core Engine of Singing Voice Conversion & Singing Voice Clone
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。
pingo智能GPT演示平台
Let your GNOME desktop speak to you. Reads your desktop notifications out-loud with human-like voice using Piper.
Add a description, image, and links to the vits topic page so that developers can more easily learn about it.
To associate your repository with the vits topic, visit your repo's landing page and select "manage topics."