NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Implementation of model parallel autoregressive transformers on GPUs
High-Resolution Image Synthesis with Latent Diffusion Models
LLaMA: Open and Efficient Foundation Language Models
Open-source AI suite for Windows with Real-ESRGAN, GFPGAN & RIFE. v4.1
Open-Source Financial Large Language Models!
Open-source, high-performance Mixture-of-Experts large language model
Powerful open source image generation model
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open Multilingual Multimodal Chat LMs
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Janus-Series: Unified Multimodal Understanding and Generation Models
Open-source pre-training implementation of Google's LaMDA in PyTorch
An implementation of model parallel GPT-2 and GPT-3-style models
GLIDE: a diffusion-based text-conditional image synthesis model
Release for Improved Denoising Diffusion Probabilistic Models
Renderer for the harmony response format to be used with gpt-oss