Core ML tools contain supporting tools for Core ML model conversion
1 min voice data can also be used to train a good TTS model
Advancing Open-source World Models
State-of-the-art TTS model under 25MB
SoTA open-source TTS
Qwen3-TTS is an open-source series of TTS models
A high-throughput and memory-efficient inference and serving engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Python inference and LoRA trainer package for the LTX-2 audio–video
Build AI-powered semantic search applications
The official gpt4free repository
A lightweight audio-to-MIDI converter with pitch bend detection
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Interact with your documents using the power of GPT
Open-source multi-speaker long-form text-to-speech model
gpt-4o for windows, macos and linux
High-quality multi-lingual text-to-speech library by MyShell.ai
Interface for OuteTTS models
Open-Source Financial Large Language Models
A Python wrapper you can't refuse
Instant voice cloning by MIT and MyShell. Audio foundation model
Machine learning in Python
Fast stable diffusion on CPU and AI PC
No fortress, purely open ground. OpenManus is Coming
A Family of Open Sourced Music Foundation Models