Robust Speech Recognition via Large-Scale Weak Supervision
Multilingual Automatic Speech Recognition with word-level timestamps
A nearly-live implementation of OpenAI's Whisper
Comprehensive Gradio WebUI for audio processing
An Open Source text-to-speech system built by inverting Whisper
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
Speech-AI-Forge is a project developed around TTS generation model
Unlimited, private and free Speech-To-Text program
A Pythonic framework to simplify AI service building
A python tool that uses GPT-4, FFmpeg, and OpenCV
Refractoring ChatBot+LLM, Gpt-3.5-turbo, ChatGPT Bot/Voice Assistant
Generate blog articles from video or audio
A subtitle generator for Japanese Adult Videos.
Run GGUF models easily with a UI or API. One File. Zero Install.
Singing voice change based on whisper, lora for singing voice clone
Video automatic transcribe and translated subtitle generator
Whisper is a file-based time-series database format for Graphite