Powerful AI language model (MoE) optimized for efficiency/performance
Easy token price estimates for 400+ LLMs. TokenOps
This repository contains the official implementation of FastVLM
Bidirectional token-classification model for identifiable info
Token-Efficient AI Agent with same budget, higher intelligence density
Persistent context and multi-instance coordination
14-stage Fusion Pipeline for LLM token compression
Why use many token when few token do trick
Implementation of Phenaki Video, which uses Mask GIT
Real-time Claude Code usage monitor with predictions and warnings
The best way to use Hermes Agent from the web or from your phone
LLM-based Reinforcement Learning audio edit model
Open-source, high-performance AI model with advanced reasoning
Create prompt-friendly codebase digests from any Git repository URL
OpenSpace: Make Your Agents: Smarter, Low-Cost, Self-Evolving
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Compress tool outputs, logs, files, and RAG chunks
Minimal reproduction of OneRec
Real-time multi-AI collaboration: Claude, Codex & Gemini
Provides line-oriented text file editing capabilities
Build your own Cowork, AI Scientist and other SoTA Agents
A Powerful Native Multimodal Model for Image Generation
Large Language Model Text Generation Inference
Offical Implementation for "Recursive Multi-Agent Systems"
MoBA: Mixture of Block Attention for Long-Context LLMs