A nearly-live implementation of OpenAI's Whisper.
-
Updated
May 24, 2024 - Python
Whisper is an autoregressive language model developed by OpenAI. It is trained on a large corpus of text using a transformer architecture and is capable of generating high-quality natural language text. Whisper can be used for tasks such as language modeling, text completion, and text generation. It has shown impressive performance on various benchmarks and has been released by OpenAI to encourage research in the field of language modeling. Whisper is not yet available for public use, but it has the potential to transform the field of natural language processing and generate new opportunities for language-based applications.
A nearly-live implementation of OpenAI's Whisper.
A real-time transcription and translation tool implemented in Python based on the fast-whisper library.
An AI-powered Virtual YouTuber (Vtuber) utilizing Google's Gemini language model to create engaging, personalized, and context-aware interactions. This project explores the potential of AI in human-computer interaction and virtual content creation.
a self-hosted webui for 30+ generative ai
(Windows/Linux) Local WebUI with neural network models (LLM, Stable Diffusion, AudioCraft, AudioLDM2, TTS, Bark, Whisper, Demucs, LibreTranslate, ZeroScope2, TripoSR, Shap-E, GLIGEN, Wav2Lip, Roop, Rembg, CodeFormer, Moondream 2) on python (In Gradio interface)
Transcribe is OpenAI's chatGPT based real time transcription, conversation, Language learning platform. It provides live transcripts from microphone and speaker. It generates a suggested conversation response using OpenAI's GPT API. It will read out the responses, simulating a real live conversation in English or another language.
Faster Whisper transcription with CTranslate2
Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with any open-source language models, speech recognition models, and multimodal models, whether in the cloud, on-premises, or even on your laptop.
视频转图文 AI 跨平台客户端(win mac linux) electron vite vue3 sqlite3 naive-ui
Swift native on-device speech recognition with Whisper for Apple Silicon
Production First and Production Ready End-to-End Speech Recognition Toolkit
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
A stream-translator fork with VAD based audio slicing & GPT / Gemini translation.
OSINT Platform - Provides image analysis, digital footprints, video transcription and more. Retrieval Augmented Generation (RAG) capable platform
这是一个全自动(音频)视频翻译项目。利用Whisper识别声音,AI大模型翻译字幕,最后合并字幕视频,生成翻译后的视频。
The simplest way to serve AI/ML models in production
Transcribe on your own!
Your personal voice assistant based on OpenAI ChatGPT.
Created by OpenAI
Released August 2021
Latest release 6 months ago