A TTS model capable of generating ultra-realistic dialogue
A single Gradio + React WebUI with extensions for ACE-Step
NLTK Source
Unleashing 10,000+ Word Generation from Long Context LLMs
Unified Multimodal Understanding and Generation Models
Official python implementation of UTCP. UTCP is an open standard
Provides CTP stock options and Zhongtai Securities XTP
Open source AI VTuber platform with voice chat and Live2D avatars
Offical Implementation for "Recursive Multi-Agent Systems"
Repository containing notebooks of my posts on Medium
Qwen3-ASR is an open-source series of ASR models
Making RAG Simpler with Small and Open-Sourced Language Models
A New Axis of Sparsity for Large Language Models
"Big Model" trains a visual multimodal VLM with 26M parameters
Flexible Photo Recrafting While Preserving Your Identity
Bailing is a voice dialogue robot similar to GPT-4o
Implementation of "MobileCLIP" CVPR 2024
A python tool that uses GPT-4, FFmpeg, and OpenCV
Build a large language model from 0 only with Python foundation
SOTA discrete acoustic codec models with 40/75 tokens per second
PPTAgent: Generating and Evaluating Presentations
Plain python implementations of basic machine learning algorithms
Central interface to connect your LLM's with external data
Ultra-Efficient LLMs on End Device
Advanced NLP with spaCy: A free online course