Code and models for ICML 2024 paper, NExT-GPT
Run PyTorch LLMs locally on servers, desktop and mobile
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A system for agentic LLM-powered data processing and ETL
Power CLI and Workflow manager for LLMs (core package)
Examples and tutorials to help developers build AI systems
LightLLM is a Python-based LLM (Large Language Model) inference
Build a large language model from 0 only with Python foundation
CV, NLP, LLM project applications, and advanced engineering deployment
Instruction-tuning LLM with Chinese Medical Knowledge
Robust recipes to align language models with human and AI preferences
Open Source Deep Research Alternative to Reason and Search
Accessible large language models via k-bit quantization for PyTorch
Accelerate local LLM inference and finetuning
Anomaly detection related books, papers, videos, and toolboxes
Retrieval and Retrieval-augmented LLMs
A lightweight vLLM implementation built from scratch
slime is an LLM post-training framework for RL Scaling
Large Audio Language Model built for natural interactions
95% token savings. 155x faster queries. 16 languages
Advanced techniques for RAG systems
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
MobileLLM Optimizing Sub-billion Parameter Language Models
Ring is a reasoning MoE LLM provided and open-sourced by InclusionAI