Supercharge Your LLM Application Evaluations
Inference script for Oasis 500M
Efficiently computes derivatives of numpy code
Autonomous harness engineering
Python observability platform for tracing apps, metrics, and logs
Supercharge Your Model Training
Your Fully-Automated Personal AI Assistant
AI-Researcher: Autonomous Scientific Innovation
950 line, minimal, extensible LLM inference engine built from scratch
The official PyTorch implementation of Google's Gemma models
Composable transformations of Python+NumPy programs
Build a modern LLM from scratch. Every line commented
Workshop-Level Automated Scientific Discovery via Agentic Tree Search
Visual tool for building, testing, and deploying AI agent workflows
Building an Intelligent Agent from Scratch
Harmonized and Coherent Human Image Animation
An end-to-end Data Scientist
Multi-agent autonomous startup system for Claude Code
Language Model Reinforcement Learning Environments frameworks
Rename anything
High-quality implementations of standard and SOTA methods
Playground and cheatsheet for learning Python
Designed for training LLM/VLM agents via RL
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
Build multimodal language agents for fast prototype and production