MoBA: Mixture of Block Attention for Long-Context LLMs
Mastering Applied AI, One Concept at a Time
Modular AI runtime for robots
How to optimize some algorithm in cuda
Open-source evaluation toolkit of large multi-modality models (LMMs)
General technology for enabling AI capabilities w/ LLMs and MLLMs
The first AI agent that builds permissionless integrations
A python module to repair invalid JSON from LLMs
Cybersecurity AI (CAI), the framework for AI Security
Llama Chinese community, real-time aggregation
Unified framework for building enterprise RAG pipelines
One-stop solution for creating your digital avatar from chat history
Large Language Model Principles and Practice Tutorial from Scratch
Memory Management Kit for Agents
Public opinion analysis system
Cube Studio open source cloud native one-stop machine learning
Context database designed specifically for AI Agents
Qwen3-ASR is an open-source series of ASR models
Quick illustration of how one can easily read books together with LLMs
Spark-TTS Inference Code
A frontier, first-principles handbook
Analyzing Hacker News discussions from a decade ago in hindsight
Making RAG Simpler with Small and Open-Sourced Language Models
Marrying Grounding DINO with Segment Anything & Stable Diffusion
Ultimate meta-skill for generating best-in-class Claude Code skills