Generating Immersive, Explorable, and Interactive 3D Worlds
Real-time behaviour synthesis with MuJoCo, using Predictive Control
DeepSeek Coder: Let the Code Write Itself
Generate Any 3D Scene in Seconds
The official repo of Qwen chat & pretrained large language model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Reference PyTorch implementation and models for DINOv3
Qwen-Image is a powerful image generation foundation model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Pushing the Limits of Mathematical Reasoning in Open Language Models
Industrial-level controllable zero-shot text-to-speech system
Multimodal Diffusion with Representation Alignment
Large Multimodal Models for Video Understanding and Editing
Revolutionizing Database Interactions with Private LLM Technology
High-Resolution Image Synthesis with Latent Diffusion Models
Qwen3-Coder is the code version of Qwen3
A Customizable Image-to-Video Model based on HunyuanVideo
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Foundation Models for Time Series
Hackable and optimized Transformers building blocks
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Capable of understanding text, audio, vision, video
Diversity-driven optimization and large-model reasoning ability
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Renderer for the harmony response format to be used with gpt-oss