Easy token price estimates for 400+ LLMs. TokenOps
From nobody to big model (LLM) hero
Deploy your agentic worfklows to production
MoBA: Mixture of Block Attention for Long-Context LLMs
NeurIPS2025 Spotlight] Quantized Attention
General technology for enabling AI capabilities w/ LLMs and MLLMs
The first AI agent that builds permissionless integrations
A python module to repair invalid JSON from LLMs
Unified framework for building enterprise RAG pipelines
One-stop solution for creating your digital avatar from chat history
Chat with your SQL database
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Open source demo platform where you can easily showcase your AI models
LongBench v2 and LongBench (ACL 25'&24')
Open-weight, large-scale hybrid-attention reasoning model
Text-space optimizer that trains reusable natural-language skills
Play ChatGPT and other LLM with Xiaomi AI Speaker
A Telegram bot for Large Language Models
NBA sports betting using machine learning
How to optimize some algorithm in cuda
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Adding guardrails to large language models
Seamlessly integrate LLMs into scikit-learn
State-of-the-art Parameter-Efficient Fine-Tuning
AI Agent Evaluator & Red Team Platform