Train any agents simply by 'talking'
Curated list of datasets and tools for post-training
Project aimed at extracting, exporting, and analyzing chat records
Train multi-step agents for real-world tasks using GRPO
Claude Code is an agentic coding tool that lives in your terminal
Designed for training LLM/VLM agents via RL
TextWorld is a sandbox learning environment for the training
Faster and easier training and deployments
Learning to Reason with Search for LLMs via Reinforcement Learning
A simple, performant and scalable Jax LLM
Training framework for Stable Baselines3 reinforcement learning agents
Deep learning optimization library: makes distributed training easy
A fast TTS architecture with conditional flow matching
"Big Model" trains a visual multimodal VLM with 26M parameters
Retrieval and Retrieval-augmented LLMs
Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real
RF-DETR is a real-time object detection and segmentation
A Next-Generation Training Engine Built for Ultra-Large MoE Models
MLOps tools for managing & orchestrating the ML LifeCycle
Volcano Engine Reinforcement Learning for LLMs
Scalable machine learning for time series forecasting
Turn expensive prompts into cheap fine-tuned models
Roadmap to becoming an Artificial Intelligence Expert in 2022
AI Code Security Anti-Patterns distilled from 150+ sources
Learn how to develop, deploy and iterate on production-grade ML