Python inference and LoRA trainer package for the LTX-2 audio–video
From Images to High-Fidelity 3D Assets
Long-form streaming TTS system for multi-speaker dialogue generation
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
A Family of Open Sourced Music Foundation Models
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Inference framework for 1-bit LLMs
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A Pragmatic VLA Foundation Model
Multimodal embedding and reranking models built on Qwen3-VL
High-resolution models for human tasks
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Official code for Style Aligned Image Generation via Shared Attention
A library for Multilingual Unsupervised or Supervised word Embeddings
Tencent’s 36-language state-of-the-art translation model