Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Multilingual Automatic Speech Recognition with word-level timestamps
A system for quickly generating training data with weak supervision
Open platform for training, serving, and evaluating language models
PaddlePaddle End-to-End Development Toolkit
Deep learning optimization library making distributed training easy
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
95% token savings. 155x faster queries. 16 languages
Follow along with my AI Agents Masterclass videos
Open source AI Agents hosted on the oTTomator Live Agent Studio
Framework for building neural networks
Fast and Universal 3D reconstruction model for versatile tasks
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
The Simple Agent Development Kit
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Open-source choice to scale, assess and maintain natural language data
Graph Neural Network Library for PyTorch
An industrial grade federated learning framework
SOTA discrete acoustic codec models with 40/75 tokens per second
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
Global weather forecasting model using graph neural networks and JAX
GPT4V-level open-source multi-modal model based on Llama3-8B
Evals is a framework for evaluating LLMs and LLM systems
NVIDIA Federated Learning Application Runtime Environment