NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Chinese LLaMA-2 & Alpaca-2 Large Model Phase II Project
Chinese LLaMA & Alpaca large language model + local CPU/GPU training
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion
Implementation of model parallel autoregressive transformers on GPUs
High-Resolution Image Synthesis with Latent Diffusion Models
LLaMA: Open and Efficient Foundation Language Models
Open-Source Financial Large Language Models!
Powerful open source image generation model
Open-source, high-performance Mixture-of-Experts large language model
A Conversational Speech Generation Model
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
Open Multilingual Multimodal Chat LMs
An Open Bilingual Chat LLM | Open Source Bilingual Conversation LLM
Janus-Series: Unified Multimodal Understanding and Generation Models
Open-source pre-training implementation of Google's LaMDA in PyTorch
An implementation of model parallel GPT-2 and GPT-3-style models
JetBrains’ 4B parameter code model for completions
Dia-1.6B generates lifelike English dialogue and vocal expressions
Tencent’s 36-language state-of-the-art translation model
OpenAI’s compact 20B open model for fast, agentic, and local use