AI-powered code assistant for Vim. OpenAI and ChatGPT plugin for Vim
A 0.1B Omni model trained from scratch
Spark-TTS Inference Code
Instant voice cloning by MIT and MyShell. Audio foundation model
General-purpose image editing model that delivers high-fidelity
tiktoken is a fast BPE tokeniser for use with OpenAI's models
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Deep Research framework, combining language models with tools
Using AI models to automatically provide commentary and edit videos
Open source personal AI Assistant for Linux, Windows and Mac
Text-space optimizer that trains reusable natural-language skills
Multimodal embedding and reranking models built on Qwen3-VL
HY-Motion model for 3D character animation generation
Han Language Processing
NLP Cloud serves high performance pre-trained or custom models for NER
The most powerful local music generation model
Underthesea - Vietnamese NLP Toolkit
Create videos with Stable Diffusion
Miso TTS is an 8 billion, highly emotive text-to-speech model
Controllable & emotion-expressive zero-shot TTS
Controllable and fast Text-to-Speech for over 7000 languages
LLM
Agent harness to make your slop code well-engineered and beautiful
Code and models for ICML 2024 paper, NExT-GPT
GLM-4-Voice | End-to-End Chinese-English Conversational Model