AI Suite for upscaling, interpolating & restoring images/videos
Release for Improved Denoising Diffusion Probabilistic Models
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
A Conversational Speech Generation Model
Open Multilingual Multimodal Chat LMs
Dataset of GPT-2 outputs for research in detection, biases, and more
Code for the paper Hybrid Spectrogram and Waveform Source Separation
Official repo for consistency models
Official PyTorch Implementation of "Scalable Diffusion Models"
800,000 step-level correctness labels on LLM solutions to MATH problem
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A latent text-to-image diffusion model
A collection of high-quality models for the MuJoCo physics engine
PyTorch implementation of MAE
GLIDE: a diffusion-based text-conditional image synthesis model
Generate embeddings from large-scale graph-structured data
Open language model developed by NVIDIA as part of Nemotron-3 family
Tencent’s 36-language state-of-the-art translation model
High-compute ultra-reasoning model surpassing model surpassing GPT-5