Official Python inference and LoRA trainer package
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Official inference repo for FLUX.2 models
A Powerful Native Multimodal Model for Image Generation
Pushing the Limits of Mathematical Reasoning in Open Language Models
Qwen-Image is a powerful image generation foundation model
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
Qwen2.5-VL is the multimodal large language model series
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Controllable & emotion-expressive zero-shot TTS
DeepMind model for tracking arbitrary points across videos & robotics
LLM-based Reinforcement Learning audio edit model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
A CNN model that predicts human joints from RGB images of a person
Let us control diffusion models
An advanced bilingual image editing with semantic control