Framework for building neural networks
SOTA discrete acoustic codec models with 40/75 tokens per second
Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
An Open Source text-to-speech system built by inverting Whisper
Free, high-quality text-to-speech API endpoint to replace OpenAI
Multi-Voice and Prompt-Controlled TTS Engine
Mice speech to text with MX Cinnamon OS ISO
Chinese voice dialogue robot/smart speaker project
A webui for different audio related Neural Networks
Singing Voice Synthesis via Shallow Diffusion Mechanism
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
Generative Adversarial Networks for Efficient and High Fidelity Speech
The open-source virtual assistant for Ubuntu based Linux distributions
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model