Minimal Python framework for scalable AI inference servers fast
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-performance ML model serving framework, offers dynamic batching
Angular Toastr
The official repository for ERNIE 4.5 and ERNIEKit
GitHub Actions for DigitalOcean - doctl
Capable of understanding text, audio, vision, video
High-Resolution Image Synthesis with Latent Diffusion Models
Serving system for machine learning models
OpenCL integration for Python, plus shiny features
Fast ML inference & training for ONNX models in Rust
A simple, performant and scalable Jax LLM
A massively parallel, high-level programming language
MII makes low-latency and high-throughput inference possible
Open-Source Low-Latency Accelerated Linux WebRTC HTML5 Remote Desktop
Interactive data visualizations and plotting in Julia
Benchmark CPU, GPU, memory, and storage
TensorRT LLM provides users with an easy-to-use Python API
Mooncake is the serving platform for Kimi
TIGRE: Tomographic Iterative GPU-based Reconstruction Toolbox
oneAPI Deep Neural Network Library (oneDNN)
MiniMax-M2, a model built for Max coding & agentic workflows
The Compute Library is a set of computer vision and machine learning
Generate music based on natural language prompts using LLMs
NeurIPS2025 Spotlight] Quantized Attention