A high-throughput and memory-efficient inference and serving engine
Web-based Traffic and Security Network Traffic Monitoring
Garnet is a remote cache-store from Microsoft Research
Open-source, scalable, and fault-tolerant MQTT broker
A solid, high-performance, JDBC connection pool at last
Running large language models on a single GPU
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation
Techniques and numbers for estimating system's performance
Fast JSON parser and validator for Go
950 line, minimal, extensible LLM inference engine built from scratch
Shardeum is an EVM based autoscaling blockchain
The official Rust implementation of Conflux protocol
Modern Load Testing as Code
AI memory OS for LLM and Agent systems
Concurrent and multi-stage data ingestion and data processing
Deep learning optimization library: makes distributed training easy
MySQL binlog
A user-space file system for interacting with Google Cloud Storage
Fast and memory-efficient exact attention
Minimal Python framework for scalable AI inference servers fast
High-performance inference server for text embeddings models API layer
Parallax is a distributed model serving framework
The async Python driver for MongoDB and Tornado or asyncio
CoreNet: A library for training deep neural networks
Distributed database warehouse for transactions, search and analytics