State-of-the-art TTS model under 25MB
Industrial-level controllable zero-shot text-to-speech system
Open-source multi-speaker long-form text-to-speech model
Open-source framework for intelligent speech interaction
LLM-based Reinforcement Learning audio edit model
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Dia-1.6B generates lifelike English dialogue and vocal expressions