A lightweight vLLM implementation built from scratch
Universal LLM Deployment Engine with ML Compilation
Evaluate your LLM's response with Prometheus and GPT4
Test-Time Reinforcement Learning
Minimal reproduction of OneRec
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs
Gracefully face hCaptcha challenge with multimodal llms
Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD
Framework and no-code GUI for fine-tuning LLMs
A.S.E (AICGSecEval) is a repository-level AI-generated code security
Towards Efficient Self-Evolving Agent System
Learning to Reason with Search for LLMs via Reinforcement Learning
Cache-Augmented Generation: A Simple, Efficient Alternative to RAG
An LLM Compiler for Parallel Function Calling
Scalable RL solution for advanced reasoning of language models
A system for agentic LLM-powered data processing and ETL
Automatic question answering for local knowledge bases based on LLM
Tools like web browser, computer access and code runner for LLMs
Gemma open-weight LLM library, from Google DeepMind
Retrieval Augmented Generation (RAG) framework
Open source demo platform where you can easily showcase your AI models
Evals is a framework for evaluating LLMs and LLM systems
Bringing BERT into modernity via both architecture changes and scaling
MobileLLM Optimizing Sub-billion Parameter Language Models
AIConfig is a config-based framework to build generative AI apps