Easy token price estimates for 400+ LLMs. TokenOps
Deploy your agentic worfklows to production
MoBA: Mixture of Block Attention for Long-Context LLMs
the terminal client for Ollama
NeurIPS2025 Spotlight] Quantized Attention
An open-source, modern-design AI training tracking and visualization
A simple, easy-to-hack GraphRAG implementation
The first AI agent that builds permissionless integrations
Open-source model for program synthesis
AirLLM 70B inference with single 4GB GPU
Unified framework for building enterprise RAG pipelines
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG
Ongoing research training transformer models at scale
Public opinion analysis system
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
ZAPI by Adopt AI is an open-source Python library
Run LLM prompts from your shell
Quick illustration of how one can easily read books together with LLMs
AI-Powered Wiki Generator for GitHub/Gitlab/Bitbucket Repositories
A simple but powerful self-hosted finance tracker
Analyzing Hacker News discussions from a decade ago in hindsight
A modern selfhosted media management system for your media library
Fast-stable-diffusion + DreamBooth