NeurIPS2025 Spotlight] Quantized Attention
Llama Chinese community, real-time aggregation
RAG Search API
Document Index for Vectorless, Reasoning-based RAG
ZAPI by Adopt AI is an open-source Python library
Enables the best performance on NVIDIA RTX Graphics Cards
CoreNet: A library for training deep neural networks
A Personalized LLM-powered Agent Frameworks
An AI for Music Generation
Running large language models on a single GPU
Streamlines and simplifies prompt design for both developers
Learning to Reason with Search for LLMs via Reinforcement Learning
A tension reasoning engine over 131 S-class problems
An Efficient Web-enhanced Question Answering System
An agentless approach to automatically solve software development
Empowering Code Generation with OSS-Instruct
A simple, performant and scalable Jax LLM
LISA: Reasoning Segmentation via Large Language Model
Enhances Tesseract OCR output using LLMs (local or API)
Leaderboard Comparing LLM Performance at Producing Hallucinations
Skywork-R1V is an advanced multimodal AI model series
Code and models for ICML 2024 paper, NExT-GPT
LightLLM is a Python-based LLM (Large Language Model) inference
Instruction-tuning LLM with Chinese Medical Knowledge
Robust recipes to align language models with human and AI preferences