A simple, high-quality voice conversion tool focused on ease of use
Instant voice cloning by MIT and MyShell. Audio foundation model
On-device Speech-to-Intent engine powered by deep learning
Official PyTorch Implementation
Multi-lingual large voice generation model, providing inference
Spark-TTS Inference Code
PersonaPlex code
Repo of Qwen2-Audio chat & pretrained large audio language model
Offline Text To Speech synthesis for python
Offline inference engine for art, real-time voice conversations
Aider is AI pair programming in your terminal
Generate high-definition story short videos with one click using AI
A natural language interface for computers
Fully Local Manus AI. No APIs, No $200 monthly bills
An Open Source text-to-speech system built by inverting Whisper
SOTA Open Source TTS
A TTS model capable of generating ultra-realistic dialogue
Long-form streaming TTS system for multi-speaker dialogue generation
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
LLM-based Reinforcement Learning audio edit model
*VoxShare* is a simple Python-based push-to-talk multicast voice chat
Singing voice change based on whisper, lora for singing voice clone
A webui for different audio related Neural Networks
Kalliope is a framework to create your own personal assistant
Main repository of Project Alice, contains main unit source code