High-performance inference framework for large language models
Accessible large language models via k-bit quantization for PyTorch
Context engineering is the new vibe coding
Utilities intended for use with Llama models
Open-source platform for building enterprise-grade agents
MobileLLM Optimizing Sub-billion Parameter Language Models
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Open-source large language model family from Tencent Hunyuan
A suite of tools to develop RAG, semantic search, and other AI apps
Flexible and powerful framework for managing multiple AI agents
A program that can do anything to earn money without human operators
A Model Context Protocol (MCP) server
Browse the web, directly from Cursor etc.
Shell command execution server implementing the Model Context Protocol
Develop software autonomously
FlashInfer: Kernel Library for LLM Serving
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Witness the aha moment of VLM with less than $3
Go ahead and axolotl questions
Evaluation suite designed to assess the performance of LLMs
A Modular Simulation Framework and Benchmark for Robot Learning
An API standard for multi-agent reinforcement learning environments
World of apps for benchmarking interactive coding agent
The behavior guidance framework for customer-facing LLM agents
Neural Network Compression Framework for enhanced OpenVINO