A high-performance image compression microservice based on MCP
Deep learning optimization library: makes distributed training easy
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Technical principles related to large models
Neural Network Compression Framework for enhanced OpenVINO
Lets make video diffusion practical
Unified KV Cache Compression Methods for Auto-Regressive Models
48khz stereo neural audio codec for general audio
Implementation of TurboQuant (ICLR 2026)
Redundancy-aware KV Cache Compression for Reasoning Models
The highest-scoring AI memory system ever benchmarked
SOTA discrete acoustic codec models with 40/75 tokens per second
Claude Code plugin that automatically captures everything Claude does
Awesome multilingual OCR toolkits based on PaddlePaddle
LMDeploy is a toolkit for compressing, deploying, and serving LLMs
14-stage Fusion Pipeline for LLM token compression
Koog is the official Kotlin framework for building AI agents
Data Lake for Deep Learning. Build, manage, and query datasets
AIMET is a library that provides advanced quantization and compression
Data and tools for generating and inspecting OLMo pre-training data
Running large language models on a single GPU
A tension reasoning engine over 131 S-class problems
From-scratch PyTorch implementation of Google's TurboQuant
Contexts Optical Compression
Libraries for applying sparsification recipes to neural networks