Run PyTorch LLMs locally on servers, desktop and mobile
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A system for agentic LLM-powered data processing and ETL
Power CLI and Workflow manager for LLMs (core package)
Examples and tutorials to help developers build AI systems
LightLLM is a Python-based LLM (Large Language Model) inference
Build a large language model from 0 only with Python foundation
CV, NLP, LLM project applications, and advanced engineering deployment
Instruction-tuning LLM with Chinese Medical Knowledge
Robust recipes to align language models with human and AI preferences
An Open-source Framework for Data-centric Language Agents
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Open Source Deep Research Alternative to Reason and Search
Accessible large language models via k-bit quantization for PyTorch
Accelerate local LLM inference and finetuning
Anomaly detection related books, papers, videos, and toolboxes
Retrieval and Retrieval-augmented LLMs
Large-Scale Agentic RL for High-Performance CUDA Kernel Generation
AI assistant based on large models that can actively think and plan
Implement a concise and clear Deep Search Agent from 0
One API call, pull Claude agent, completely sandboxed
CLI tool for configuring and monitoring Claude Code
A minimalist command line knowledge base manager
Long-form streaming TTS system for multi-speaker dialogue generation
slime is an LLM post-training framework for RL Scaling