CV, NLP, LLM project applications, and advanced engineering deployment
Accelerate local LLM inference and finetuning
Run PyTorch LLMs locally on servers, desktop and mobile
An elegent pytorch implement of transformers
Accessible large language models via k-bit quantization for PyTorch
A straightforward method for training your LLM
Fast Multimodal LLM on Mobile Devices
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
An open-source, modern-design AI training tracking and visualization
Training Large Language Model to Reason in a Continuous Latent Space
Toolkit for conversational AI
A large-scale model of medical consultation in Chinese
DepGraph: Towards Any Structural Pruning
Build a large language model from 0 only with Python foundation
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Multilingual sentence & image embeddings with BERT
MobileLLM Optimizing Sub-billion Parameter Language Models
Tensor search for humans
Inference Llama 2 in one file of pure C
PyTorch library of curated Transformer models and their components
Database system for building simpler and faster AI-powered application
Run 100B+ language models at home, BitTorrent-style
Inference code for Llama models