A Gym environment for web task automation
Parallax is a distributed model serving framework
Search all of YouTube from the command line
Easy token price estimates for 400+ LLMs. TokenOps
Deploy your agentic worfklows to production
MoBA: Mixture of Block Attention for Long-Context LLMs
A simple, easy-to-hack GraphRAG implementation
Open-source evaluation toolkit of large multi-modality models (LMMs)
The first AI agent that builds permissionless integrations
Open-source model for program synthesis
Cybersecurity AI (CAI), the framework for AI Security
Unified framework for building enterprise RAG pipelines
Run LLM prompts from your shell
Analyzing Hacker News discussions from a decade ago in hindsight
Extension of Google Research’s PaperBanana
Vertically Unified Agents for Graph Retrieval-Augmented Reasoning
Chat with your documents using local AI
LongBench v2 and LongBench (ACL 25'&24')
Uncertainty Quantification for Language Models, is a Python package
Streamlines and simplifies prompt design for both developers
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Towards Efficient Self-Evolving Agent System
Chat with any codebase in under two minutes | Fully local
local-first semantic code search engine
Learning to Reason with Search for LLMs via Reinforcement Learning