High-Resolution Image Synthesis with Latent Diffusion Models
Official inference repo for FLUX.1 models
High-Fidelity and Controllable Generation of Textured 3D Assets
Open-source industrial-grade ASR models
Sharp Monocular Metric Depth in Less Than a Second
Fast stable diffusion on CPU and AI PC
General-purpose image editing model that delivers high-fidelity
gpt-oss-120b and gpt-oss-20b are two open-weight language models
MOSS‑TTS Family open‑source speech and sound generation model
HY-Motion model for 3D character animation generation
Generating Immersive, Explorable, and Interactive 3D Worlds
Advanced language and coding AI model
Powerful AI language model (MoE) optimized for efficiency/performance
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Code for running inference and finetuning with SAM 3 model
AI PPT Track Terminator, the strongest PPT Skill ever
Robust Speech Recognition Across Languages, Dialects
Advancing Open-source World Models
Python inference and LoRA trainer package for the LTX-2 audio–video
Text and image to video generation: CogVideoX and CogVideo
DeepSeek Coder: Let the Code Write Itself
Ultra-Efficient LLMs on End Device
Ling-V2 is a MoE LLM provided and open-sourced by InclusionAI
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Video Object and Interaction Deletion