SOTA discrete acoustic codec models with 40/75 tokens per second
Open source text-to-speech tool, supports extra-long text
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Automatically translates the text of a video based on a subtitle file
Cross-platform AI language practice app
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model