World's first open-source, agentic video production system
A TTS that fits in your CPU (and pocket)
Generating Immersive, Explorable, and Interactive 3D Worlds
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
A sound cloning tool with a web interface, using your voice
Voice Recognition to Text Tool
Framework for building AI-powered interactive digital humans and agent
Open-source multi-speaker long-form text-to-speech model
Qwen-Image is a powerful image generation foundation model
Open source personal AI Assistant for Linux, Windows and Mac
General-purpose image editing model that delivers high-fidelity
An Open Source text-to-speech system built by inverting Whisper
Easy-to-use and powerful NLP library with Awesome model zoo
Han Language Processing
Automated translation solution for visual novels
Open Source Document Management System for Digital Archives
Unified web UI for training and running open models locally
Knowledge Graph Generation from Any Text
The simplest, fastest repository for training/finetuning models
Spark-TTS Inference Code
Multi-lingual large voice generation model, providing inference
Stable Diffusion web UI
AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A full spaCy pipeline and models for scientific/biomedical documents