The first real AI developer
Create UIs for your machine learning model in Python in 3 minutes
GLM-4 series: Open Multilingual Multimodal Chat LMs
Supercharge Your LLM Application Evaluations
Practice implementing softmax, attention, GPT-2 and more
Open source async coding agent that plans, codes, and opens PRs
AI-Powered tool for automated pull request analysis
Seamlessly integrate LLMs into scikit-learn
Constrained Value Alignment via Safe Reinforcement Learning
ClawTeam: Agent Swarm Intelligence (One Command → Full Automation)
Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Reflexion: Language Agents with Verbal Reinforcement Learning
MiroThinker is an open source deep research agent
An end-to-end Data Scientist
Language Model Reinforcement Learning Environments frameworks
Management of Yandex Station and other smart home devices
SWE-agent takes a GitHub issue and tries to automatically fix it
AgentHandover observes, learns and teaches agents with skills
Blender Model Context Protocol Integration
Designed for training LLM/VLM agents via RL
Recipes to train reward model for RLHF
Scalable RL solution for advanced reasoning of language models
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Robust recipes to align language models with human and AI preferences
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation