ChatGPT interface with better UI
Powerful AI language model (MoE) optimized for efficiency/performance
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Models for object and human mesh reconstruction
Chat & pretrained large audio language model proposed by Alibaba Cloud
A state-of-the-art open visual language model
Diffusion Transformer with Fine-Grained Chinese Understanding
Real-time behaviour synthesis with MuJoCo, using Predictive Control
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A Customizable Image-to-Video Model based on HunyuanVideo
RGBD video generation model conditioned on camera input
Qwen-Image is a powerful image generation foundation model
Wan2.2: Open and Advanced Large-Scale Video Generative Model
FAIR Sequence Modeling Toolkit 2
Pokee Deep Research Model Open Source Repo
Chat & pretrained large vision language model
Multimodal-Driven Architecture for Customized Video Generation
Reference PyTorch implementation and models for DINOv3
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
Example Discord bot written in Python that uses the completions API
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
The official repo of Qwen chat & pretrained large language model
Qwen2.5-VL is the multimodal large language model series
Official inference repo for FLUX.2 models