A Gym environment for web task automation
All-in-one AI companion! Desktop girlfriend + virtual streamer
Hypernetworks that adapt LLMs for specific benchmark tasks
An LLM Compiler for Parallel Function Calling
Claraverse is a opesource privacy focused ecosystem to replace ChatGPT
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Weaving the Digital Agent Galaxy
Semi-Structured Agentic Framework. Workflows build themselves
Run an army of Claude Code, Codex, etc. on your machine
OpenCompass is an LLM evaluation platform
Schema-Guided Reasoning (SGR) has agentic system design
Open-Source Financial Large Language Models
From Vibe Coding to Agentic Engineering
Claude Code opened to any LLM
AI Browser Automation
TigerBot: A multi-language multi-task LLM
Demystify AI agents by building them yourself. Local LLMs
Low-code framework for building custom LLMs, neural networks
State of the art LLM and coding model
Moonshot's most powerful AI model
Modular AI runtime for robots
A high-performance ML model serving framework, offers dynamic batching
Integrating LLMs into structured NLP pipelines
Code for the paper "Evaluating Large Language Models Trained on Code"
LangChain powered shell command generator and runner CLI