LLM-based Reinforcement Learning audio edit model
The official Python SDK for the ElevenLabs API
Interface for OuteTTS models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Sample code and notebooks for Generative AI on Google Cloud
StreamSpeech is a seamless model for offline speech recognition
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Beamforming and Speech Recognition Toolkit