MiroThinker is an open source deep research agent
Bridging LLM and Recommender System
Semi-Structured Agentic Framework. Workflows build themselves
Parallax is a distributed model serving framework
Minimal reproduction of OneRec
Redundancy-aware KV Cache Compression for Reasoning Models
AI-powered tool for efficient abstract and PDF screening
AI-driven multi-agent research assistant automating hypothesis
A high-quality PDF to Markdown tool based on large language model
Specify a github or local repo, github pull request
From nobody to big model (LLM) hero
MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
the terminal client for Ollama
How to optimize some algorithm in cuda
NeurIPS2025 Spotlight] Quantized Attention
A simple, easy-to-hack GraphRAG implementation
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
Llama Chinese community, real-time aggregation
Production-grade platform for building agentic IM bots
One-stop solution for creating your digital avatar from chat history
Large Language Model Principles and Practice Tutorial from Scratch
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
Ready-to-run cloud templates for RAG