OpenDILab Decision AI Engine
CLIP, Predict the most relevant text snippet given an image
Technical principles related to large models
An Easy-to-use, Scalable and High-performance RLHF Framework
A library for accelerating Transformer models on NVIDIA GPUs
High-Fidelity and Controllable Generation of Textured 3D Assets
TextWorld is a sandbox learning environment for the training
Robust Speech Recognition via Large-Scale Weak Supervision
Fast multimodal LLM for real-time voice interaction and AI apps
Deep learning driven jazz generation using Keras & Theano
Play couplet with seq2seq model
Designed for training LLM/VLM agents via RL
Constrained Value Alignment via Safe Reinforcement Learning
Scalable RL solution for advanced reasoning of language models
Unleashing 10,000+ Word Generation from Long Context LLMs
Implementation for MatMul-free LM
The absolute trainer to light up AI agents
Framework for building neural networks
Flax is a neural network library for JAX
An open source library for GPU-accelerated robot learning
4M: Massively Multimodal Masked Modeling
An implementation of a deep learning recommendation model (DLRM)
Self-supervised visual learning using momentum contrast in PyTorch
Memory-efficient and performant finetuning of Mistral's models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model