Massively parallel rigidbody physics simulation
Unified Multimodal Understanding and Generation Models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Research code artifacts for Code World Model (CWM)
A Customizable Image-to-Video Model based on HunyuanVideo
Deep learning optimization library: makes distributed training easy
The no-nonsense RAG chunking library
A lightweight, powerful framework for multi-agent workflows
Phi-3.5 for Mac: Locally-run Vision and Language Models
Implementation of AudioLM audio generation model in Pytorch
Pokee Deep Research Model Open Source Repo
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
The official Meta Llama 3 GitHub site
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
High-resolution models for human tasks
Official implementation of DreamCraft3D
Renderer for the harmony response format to be used with gpt-oss
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Multimodal-Driven Architecture for Customized Video Generation
The NVIDIA AgentIQ toolkit is an open-source library
Transformers4Rec is a flexible and efficient library
NVIDIA Federated Learning Application Runtime Environment
Documentation for Google's Gen AI site - including Gemini API & Gemma
Examples and guides for using the OpenAI API
A robust, efficient, low-latency speech-to-text library