SOTA Open Source TTS
Code for openai.fm, a demo for the OpenAI Speech API
A cross-platform software for text translation and recognition
A nearly-live implementation of OpenAI's Whisper
A text-to-speech, speech-to-text and speech-to-speech library
A small clipboard reader
Speech to Text to Speech, sends text as OSC messages
Video translation and dubbing tool powered by LLMs
Amica is an open source interface for interactive communication
A TTS that fits in your CPU (and pocket)
A single Gradio + React WebUI with extensions for ACE-Step
Generate audiobooks from EPUBs, PDFs and text with captions
Generate audiobooks from e-books
High-Quality Voice Cloning TTS for 600+ Languages
Towards Human-Sounding Speech
A simple native web interface that uses ChatTTS to synthesize text
Industrial-level controllable zero-shot text-to-speech system
Speech Note Linux app. Note taking, reading and translating
The deep learning toolkit for speech-to-text
EPUB to audiobook converter, optimized for Audiobookshelf
The official Python SDK for the ElevenLabs API
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Offline inference engine for art, real-time voice conversations
Automatically translates the text of a video based on a subtitle file
Multi-lingual large voice generation model, providing inference