Official repository for LTX-Video
Personalize Any Characters with a Scalable Diffusion Transformer
Qwen3-TTS is an open-source series of TTS models
Z80-μLM is a 2-bit quantized language model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Qwen-Image is a powerful image generation foundation model
Contexts Optical Compression
Collection of Gemma 3 variants that are trained for performance
From Images to High-Fidelity 3D Assets
Inference framework for 1-bit LLMs
Advanced language and coding AI model
HY-Motion model for 3D character animation generation
GLM-4-Voice | End-to-End Chinese-English Conversational Model
RGBD video generation model conditioned on camera input
Diffusion Bee is the easiest way to run Stable Diffusion locally
Advancing Open-source World Models
Pokee Deep Research Model Open Source Repo
Powerful AI language model (MoE) optimized for efficiency/performance
The official repo of Qwen chat & pretrained large language model
A Customizable Image-to-Video Model based on HunyuanVideo
Agentic, Reasoning, and Coding (ARC) foundation models
A Powerful Native Multimodal Model for Image Generation
Phi-3.5 for Mac: Locally-run Vision and Language Models
Multimodal Diffusion with Representation Alignment
Easy Docker setup for Stable Diffusion with user-friendly UI