A Powerful Native Multimodal Model for Image Generation
TextWorld is a sandbox learning environment for the training
Framework that is dedicated to making neural data processing
Toolkit for audio, music, and speech generation
Set of tools to assess and improve LLM security
Provides code for running inference with the SegmentAnything Model
Documentation for Google's Gen AI site - including Gemini API & Gemma
Training framework for Stable Baselines3 reinforcement learning agents
Monte Carlo tree search in JAX
The Library for LLM-based multi-agent applications
Virtual AI anchor that combines state-of-the-art technology
Foundational model for human-like, expressive TTS
Sharp Monocular Metric Depth in Less Than a Second
GUI/CLI tool for downloading Xiaohongshu
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Educational framework exploring multi-agent orchestration
Implementation of the Surya Foundation Model for Heliophysics
NVIDIA Federated Learning Application Runtime Environment
PPTAgent: Generating and Evaluating Presentations
Scalable machine learning for time series forecasting
Advanced evolutionary computation library built on top of PyTorch
Implementation of RLHF (Reinforcement Learning with Human Feedback)
Massively parallel rigidbody physics simulation
Official inference library for Mistral models
Multi-Agent daTa geneRation Infra and eXperimentation framework