A Gym environment for web task automation
All-in-one AI companion! Desktop girlfriend + virtual streamer
Hypernetworks that adapt LLMs for specific benchmark tasks
An LLM Compiler for Parallel Function Calling
Claraverse is a opesource privacy focused ecosystem to replace ChatGPT
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
Weaving the Digital Agent Galaxy
Semi-Structured Agentic Framework. Workflows build themselves
Run an army of Claude Code, Codex, etc. on your machine
OpenCompass is an LLM evaluation platform
Schema-Guided Reasoning (SGR) has agentic system design
Open-Source Financial Large Language Models
AI-powered penetration testing assistant using local LLM on linux
From Vibe Coding to Agentic Engineering
Claude Code opened to any LLM
AI Browser Automation
TigerBot: A multi-language multi-task LLM
Demystify AI agents by building them yourself. Local LLMs
State of the art LLM and coding model
Low-code framework for building custom LLMs, neural networks
Modular AI runtime for robots
Moonshot's most powerful AI model
A high-performance ML model serving framework, offers dynamic batching
Integrating LLMs into structured NLP pipelines
Code for the paper "Evaluating Large Language Models Trained on Code"