Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Open-source multi-speaker long-form text-to-speech model
Qwen3.5 is the large language model series developed by Qwen team
Visual Causal Flow
Python SDK for Claude Agent
Industrial-level controllable zero-shot text-to-speech system
From Images to High-Fidelity 3D Assets
RGBD video generation model conditioned on camera input
Open Source Speech Language Model
Qwen3-ASR is an open-source series of ASR models
Video understanding codebase from FAIR for reproducing video models
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
General-purpose image editing model that delivers high-fidelity
Long-form streaming TTS system for multi-speaker dialogue generation
Audio foundation model excelling in audio understanding
Contexts Optical Compression
Controllable & emotion-expressive zero-shot TTS
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Pushing the Limits of Mathematical Reasoning in Open Language Models
Inference script for Oasis 500M
Foundational Models for State-of-the-Art Speech and Text Translation
Analyze computation-communication overlap in V3/R1
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Pokee Deep Research Model Open Source Repo