Driving with Graph Visual Question Answering
Benchmark LLMs by fighting in Street Fighter 3
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
Implementation for MatMul-free LM
High-performance inference framework for large language models
A Next-Generation Training Engine Built for Ultra-Large MoE Models
A lightweight vLLM implementation built from scratch
Large Audio Language Model built for natural interactions
Advanced techniques for RAG systems
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI
Research code artifacts for Code World Model (CWM)
Low-latency REST API for serving text-embeddings
BISHENG is an open LLM devops platform for next generation apps
OpenCompass is an LLM evaluation platform
Unifying 3D Mesh Generation with Language Models
I Agent designed to interact with ROS1- and ROS2-based robotics system
Multilingual sentence & image embeddings with BERT
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
Repo of Qwen2-Audio chat & pretrained large audio language model
Tensor search for humans
Official Repo for ICML 2024 paper
Concatenate a directory full of files into a single prompt
Open-weight, large-scale hybrid-attention reasoning model