Bringing BERT into modernity via both architecture changes and scaling
Scalable RL solution for advanced reasoning of language models
Autoregressive Model Beats Diffusion
Implementation for MatMul-free LM
Build a large language model from 0 only with Python foundation
The absolute trainer to light up AI agents
An Open-Source AI Agent Platform for Financial Analysis using LLMs
Framework for building neural networks
Flax is a neural network library for JAX
An open source library for GPU-accelerated robot learning
4M: Massively Multimodal Masked Modeling
An implementation of a deep learning recommendation model (DLRM)
Self-supervised visual learning using momentum contrast in PyTorch
Memory-efficient and performant finetuning of Mistral's models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Composable building blocks to build Llama Apps
A Universal Customization Method for Single and Multi Conditioning
Build a machine learning model from a prompt
A python library for self-supervised learning on images
Evaluate your LLM's response with Prometheus and GPT4
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass
Hackable and optimized Transformers building blocks
Message Passing Neural Networks for Molecule Property Prediction
Implementation of DeepLabCut
Tool for visualizing and tracking your machine learning experiments