A middleware to provide an openAI compatible endpoint
Shell command execution server implementing the Model Context Protocol
FlashInfer: Kernel Library for LLM Serving
A unified interface for distributed computing
Python scraper based on AI
Private AI platform for agents, enterprise search and RAG pipelines
Ongoing research training transformer models at scale
The Clay Foundation Model - An open source AI model and interface
The official Python client for the Huggingface Hub
Optimize your code automatically with AI
[NeurIPS 2023 Spotlight] LightZero
A guidance language for controlling large language models
Official inference library for Mistral models
A high performance implementation of HDBSCAN clustering
We write your reusable computer vision tools
Focus on creating classic Python small examples and cases
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
Video-based AI memory library. Store millions of text chunks in MP4
A lightweight, powerful framework for multi-agent workflows
Parse files for optimal RAG
ML engineer that reads papers, trains models, and ships ML models
Why use many token when few token do trick
An Efficient Agentic Model for Computer Use
SGLang is a fast serving framework for large language models
AI-data warehouse to enrich, transform and analyze unstructured data