Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
An Efficient Web-enhanced Question Answering System
Implementation for MatMul-free LM
High-performance inference framework for large language models
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A lightweight vLLM implementation built from scratch
Large Audio Language Model built for natural interactions
Advanced techniques for RAG systems
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Low-latency REST API for serving text-embeddings
OpenCompass is an LLM evaluation platform
I Agent designed to interact with ROS1- and ROS2-based robotics system
A modular Agentic RAG built with LangGraph
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Repo of Qwen2-Audio chat & pretrained large audio language model
Tensor search for humans
Concatenate a directory full of files into a single prompt
Open-weight, large-scale hybrid-attention reasoning model
Large-language-model & vision-language-model based on Linear Attention
Capable of understanding text, audio, vision, video
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training