A high-performance image compression microservice based on MCP
Deep learning optimization library: makes distributed training easy
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Neural Network Compression Framework for enhanced OpenVINO
Technical principles related to large models
Lets make video diffusion practical
Unified KV Cache Compression Methods for Auto-Regressive Models
48khz stereo neural audio codec for general audio
Redundancy-aware KV Cache Compression for Reasoning Models
The highest-scoring AI memory system ever benchmarked
Implementation of TurboQuant (ICLR 2026)
SOTA discrete acoustic codec models with 40/75 tokens per second
14-stage Fusion Pipeline for LLM token compression
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
Awesome multilingual OCR toolkits based on PaddlePaddle
Data Lake for Deep Learning. Build, manage, and query datasets
AIMET is a library that provides advanced quantization and compression
Claude Code plugin that automatically captures everything Claude does
Koog is the official Kotlin framework for building AI agents
Data and tools for generating and inspecting OLMo pre-training data
Running large language models on a single GPU
A tension reasoning engine over 131 S-class problems
From-scratch PyTorch implementation of Google's TurboQuant
Contexts Optical Compression
AI gateway with token compression for Claude Code, Codex, and more