DepGraph: Towards Any Structural Pruning
High-performance inference framework for large language models
A dataset consists of 15,140 ChatGPT prompts from Reddit
Code and models for ICML 2024 paper, NExT-GPT
Run PyTorch LLMs locally on servers, desktop and mobile
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A system for agentic LLM-powered data processing and ETL
Power CLI and Workflow manager for LLMs (core package)
Examples and tutorials to help developers build AI systems
LightLLM is a Python-based LLM (Large Language Model) inference
CV, NLP, LLM project applications, and advanced engineering deployment
Instruction-tuning LLM with Chinese Medical Knowledge
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Structured data extraction and instruction calling with ML, LLM
Robust recipes to align language models with human and AI preferences
An Open-source Framework for Data-centric Language Agents
Multimodal Agents as Smartphone Users, an LLM-based multimodal agent
Open Source Deep Research Alternative to Reason and Search
Accelerate local LLM inference and finetuning
Anomaly detection related books, papers, videos, and toolboxes
Retrieval and Retrieval-augmented LLMs
A lightweight vLLM implementation built from scratch
Implement a concise and clear Deep Search Agent from 0
Pretrained time-series foundation model developed by Google Research
One API call, pull Claude agent, completely sandboxed