Open-source multi-speaker long-form text-to-speech model
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Toolkit to help you get started with Spec-Driven Development
Generate short videos with one click using AI LLM
Qwen3-TTS is an open-source series of TTS models
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Text and image to video generation: CogVideoX and CogVideo
SoTA open-source TTS
Open source driver assistance system
Instant voice cloning by MIT and MyShell. Audio foundation model
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
AI-powered video clipping and highlight generation
State-of-the-art TTS model under 25MB
EPUB to audiobook converter, optimized for Audiobookshelf
A simple native web interface that uses ChatTTS to synthesize text
Label Studio is a multi-type data labeling and annotation tool
Autonomous research from idea to paper. Chat an Idea. Get a Paper 🦞
The official Meta Llama 3 GitHub site
Open-source autonomous AI software engineer
A Domain-Fronting Relay that routes traffic though GAS
Open source healthcare AI
SOTA Open Source TTS
TensorFlow is an open source library for machine learning
UI-TARS-desktop version that can operate on your local personal device
Open source AI VTuber platform with voice chat and Live2D avatars