Code and models for ICML 2024 paper, NExT-GPT
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Easy-to-use and powerful NLP library with Awesome model zoo
Open source machine learning framework to automate text conversations
Synchronized Translation for Videos
Official Python inference and LoRA trainer package
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Open Source Document Management System for Digital Archives
Generate music based on natural language prompts using LLMs
Unified web UI for training and running open models locally
The open big data serving engine
Large-language-model & vision-language-model based on Linear Attention
Knowledge Graph Generation from Any Text
The python library for real-time communication
Like the macOS say command, but with a modern voice
Moonshot's most powerful AI model
Open-source multi-speaker long-form text-to-speech model
A Multi-Modal World Model for Reconstructing, Generating, Simulation
The most powerful local music generation model
AI suite powered by state-of-the-art models and providing advanced AI
A Web UI for easy subtitle using whisper model
A sound cloning tool with a web interface, using your voice
End-to-end speech processing toolkit
Generate matching and non matching strings based on regex patterns
A high-quality PDF to Markdown tool based on large language model