Reflexion: Language Agents with Verbal Reinforcement Learning
MiroThinker is an open source deep research agent
Harmonized and Coherent Human Image Animation
An end-to-end Data Scientist
Language Model Reinforcement Learning Environments frameworks
SWE-agent takes a GitHub issue and tries to automatically fix it
Constrained Value Alignment via Safe Reinforcement Learning
AgentHandover observes, learns and teaches agents with skills
Blender Model Context Protocol Integration
Designed for training LLM/VLM agents via RL
Recipes to train reward model for RLHF
Scalable RL solution for advanced reasoning of language models
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Robust recipes to align language models with human and AI preferences
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Train multi-step agents for real-world tasks using GRPO
Context data platform for building observable, self-learning AI agents
FAIR Sequence Modeling Toolkit 2
Combination of multiple linters to install as a GitHub Action
Python package for AutoML on Tabular Data with Feature Engineering
SDK for connecting to AWS IoT from a device using Python
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Official Repo for ICML 2024 paper
Desktop piano playable with a PC keyboard, mouse, or MIDI device.
A free and open-source tool to download YouTube videos or playlists as