Expert Parallelism Load Balancer
A bidirectional pipeline parallelism algorithm
Visualize streams of multimodal data
Build better UIs faster
The modern API client that lives in your terminal
Detect backward incompatible migrations for your django project
Management commands to help backup and restore your project database
Opiniated RAG for integrating GenAI in your apps
Package manager and build abstraction tool for FPGA/ASIC development
Package manager based on libdnf and libsolv. Replaces YUM
A middleware to provide an openAI compatible endpoint
A Model Context Protocol (MCP) server
An official Qdrant Model Context Protocol (MCP) server implementation
Browse the web, directly from Cursor etc.
Optimizing inference proxy for LLMs
Witness the aha moment of VLM with less than $3
Evaluation suite designed to assess the performance of LLMs
TextWorld is a sandbox learning environment for the training
An API standard for multi-agent reinforcement learning environments
World of apps for benchmarking interactive coding agent
Create custom engineering agents for your codebase
The behavior guidance framework for customer-facing LLM agents
Neural Network Compression Framework for enhanced OpenVINO
Openai style api for open large language models
Simple, unified interface to multiple Generative AI providers