ChatGPT interface with better UI
Powerful AI language model (MoE) optimized for efficiency/performance
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Models for object and human mesh reconstruction
Example Discord bot written in Python that uses the completions API
Chat & pretrained large audio language model proposed by Alibaba Cloud
A state-of-the-art open visual language model
Official inference repo for FLUX.2 models
Diffusion Transformer with Fine-Grained Chinese Understanding
A Customizable Image-to-Video Model based on HunyuanVideo
RGBD video generation model conditioned on camera input
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Qwen-Image is a powerful image generation foundation model
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
CodeGeeX2: A More Powerful Multilingual Code Generation Model
FAIR Sequence Modeling Toolkit 2
Multimodal-Driven Architecture for Customized Video Generation
Reference PyTorch implementation and models for DINOv3
Pokee Deep Research Model Open Source Repo
Chat & pretrained large vision language model
Fast and Universal 3D reconstruction model for versatile tasks
FlashMLA: Efficient Multi-head Latent Attention Kernels
Qwen2.5-VL is the multimodal large language model series