Modular AI runtime for robots
NeurIPS2025 Spotlight] Quantized Attention
A simple, easy-to-hack GraphRAG implementation
General technology for enabling AI capabilities w/ LLMs and MLLMs
The first AI agent that builds permissionless integrations
Open-source model for program synthesis
Llama Chinese community, real-time aggregation
Unified framework for building enterprise RAG pipelines
One-stop solution for creating your digital avatar from chat history
Open-source AI hackers to find and fix your app’s vulnerabilities
Chat with your SQL database
Quick illustration of how one can easily read books together with LLMs
A frontier, first-principles handbook
Analyzing Hacker News discussions from a decade ago in hindsight
A New Axis of Sparsity for Large Language Models
LLM training in simple, raw C/CUDA
Ling is a MoE LLM provided and open-sourced by InclusionAI
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible
Scalable data pre processing and curation toolkit for LLMs
A high-performance ML model serving framework, offers dynamic batching
PandasAI is a Python library that integrates generative AI
Low-code framework for building custom LLMs, neural networks
Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge
BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)
User toolkit for analyzing and interfacing with Large Language Models