On the Structural Pruning of Large Language Models
Unified KV Cache Compression Methods for Auto-Regressive Models
Python library to compile, build & package AWS Lambda functions
Why use many token when few token do trick
Natural language workflows for AI agents
Pythonic Smart Contract Language for the EVM
Open Agent Harness with a built-in personal agent, Ohmo
Development repository for the Triton language and compiler
Open source RAG framework for building scalable modular AI apps
Universal LLM Deployment Engine with ML Compilation
OSRFramework, the Open Sources Research Framework is a AGPLv3+ project
Powerful, mature open-source cross-platform game engine for Python
Event-driven networking engine written in Python
Superduper: Integrate AI models and machine learning workflows
Knowledge Graph Generation from Any Text
Library for efficiently connecting and optimizing teams of AI agents
Gracefully face hCaptcha challenge with multimodal llms
A lightweight vLLM implementation built from scratch
The Arcade Learning Environment (ALE) -- a platform for AI research
A lightweight data processing framework built on DuckDB and 3FS
ContextGem: Effortless LLM extraction from documents
Reverse engineering Gemini's SynthID detection
A lightweight, powerful framework for multi-agent workflows
Open-source framework for conversational voice AI agents
Open-source LLM Friendly Web Crawler & Scraper