A Family of Open Sourced Music Foundation Models
Official inference repo for FLUX.2 models
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Python inference and LoRA trainer package for the LTX-2 audio–video
gpt-oss-120b and gpt-oss-20b are two open-weight language models
From Images to High-Fidelity 3D Assets
Long-form streaming TTS system for multi-speaker dialogue generation
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Official DeiT repository
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Reference PyTorch implementation and models for DINOv3
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Moonshot's most powerful AI model
CodeGeeX2: A More Powerful Multilingual Code Generation Model
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Industrial-level controllable zero-shot text-to-speech system
The official PyTorch implementation of Google's Gemma models
Inference framework for 1-bit LLMs
Generate Any 3D Scene in Seconds
Inference script for Oasis 500M
Fast and Universal 3D reconstruction model for versatile tasks
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Multimodal embedding and reranking models built on Qwen3-VL
High-resolution models for human tasks
Instructions on how to use the Realtime API on Microcontrollers