RGBD video generation model conditioned on camera input
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
An experimental version of DeepSeek model
HY-Motion model for 3D character animation generation
Foundation Models for Time Series
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Industrial-level controllable zero-shot text-to-speech system
LTX-Video Support for ComfyUI
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Qwen3-Coder is the code version of Qwen3
Qwen-Image is a powerful image generation foundation model
A Powerful Native Multimodal Model for Image Generation
The official repo of Qwen chat & pretrained large language model
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Qwen2.5-VL is the multimodal large language model series
A Systematic Framework for Interactive World Modeling
Diffusion Transformer with Fine-Grained Chinese Understanding
Uncommon Objects in 3D dataset
Renderer for the harmony response format to be used with gpt-oss
Generating Immersive, Explorable, and Interactive 3D Worlds
Collection of Gemma 3 variants that are trained for performance
Multimodal-Driven Architecture for Customized Video Generation
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Generate Any 3D Scene in Seconds
Global weather forecasting model using graph neural networks and JAX