A Gym environment for web task automation
Hypernetworks that adapt LLMs for specific benchmark tasks
An LLM Compiler for Parallel Function Calling
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Weaving the Digital Agent Galaxy
Semi-Structured Agentic Framework. Workflows build themselves
OpenCompass is an LLM evaluation platform
Schema-Guided Reasoning (SGR) has agentic system design
Open-Source Financial Large Language Models
AI-powered penetration testing assistant using local LLM on linux
Designed for text embedding and ranking tasks
TigerBot: A multi-language multi-task LLM
Low-code framework for building custom LLMs, neural networks
Modular AI runtime for robots
A high-performance ML model serving framework, offers dynamic batching
Integrating LLMs into structured NLP pipelines
Code for the paper "Evaluating Large Language Models Trained on Code"
LangChain powered shell command generator and runner CLI
Build multimodal language agents for fast prototype and production
Make your agents learn from experience
The SOTA Open-Source Browser Agent
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Multilingual sentence & image embeddings with BERT
Take control of your AI agents
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph