A high-throughput and memory-efficient inference and serving engine
Web-based Traffic and Security Network Traffic Monitoring
Garnet is a remote cache-store from Microsoft Research
A high-performance inference system for large language models
A solid, high-performance, JDBC connection pool at last
Open-source, scalable, and fault-tolerant MQTT broker
Running large language models on a single GPU
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Techniques and numbers for estimating system's performance
Fast JSON parser and validator for Go
C++ library for high performance inference on NVIDIA GPUs
950 line, minimal, extensible LLM inference engine built from scratch
Modern Load Testing as Code
AI memory OS for LLM and Agent systems
Shardeum is an EVM based autoscaling blockchain
The official Rust implementation of Conflux protocol
Concurrent and multi-stage data ingestion and data processing
Deep learning optimization library: makes distributed training easy
MySQL binlog
Fast and memory-efficient exact attention
A user-space file system for interacting with Google Cloud Storage
Minimal Python framework for scalable AI inference servers fast
High-performance inference server for text embeddings models API layer
Parallax is a distributed model serving framework
The async Python driver for MongoDB and Tornado or asyncio