Semi-Structured Agentic Framework. Workflows build themselves
Parallax is a distributed model serving framework
Minimal reproduction of OneRec
Redundancy-aware KV Cache Compression for Reasoning Models
The official implementation of RAPTOR
A high-quality PDF to Markdown tool based on large language model
Specify a github or local repo, github pull request
From nobody to big model (LLM) hero
Deploy your agentic worfklows to production
Mastering Applied AI, One Concept at a Time
NeurIPS2025 Spotlight] Quantized Attention
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
Open-source model for program synthesis
Memory Management Kit for Agents
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG
Cube Studio open source cloud native one-stop machine learning
Long-form streaming TTS system for multi-speaker dialogue generation
Document Index for Vectorless, Reasoning-based RAG
Open-Source Dual-Arm Mobile Robot with Motorized Lift
Harmonized and Coherent Human Image Animation
Latent Collaboration in Multi-Agent Systems
An end-to-end Data Scientist
Speakr is a personal, self-hosted web application