Official Python inference and LoRA trainer package
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
A Powerful Native Multimodal Model for Image Generation
Qwen-Image is a powerful image generation foundation model
Qwen2.5-VL is the multimodal large language model series
Pushing the Limits of Mathematical Reasoning in Open Language Models
Controllable & emotion-expressive zero-shot TTS
DeepMind model for tracking arbitrary points across videos & robotics
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
LLM-based Reinforcement Learning audio edit model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Let us control diffusion models