OpenDAN is an open source Personal AI OS
Inference Llama 2 in one file of pure C
LLM training in simple, raw C/CUDA
AirLLM 70B inference with single 4GB GPU
Qwen2.5-VL is the multimodal large language model series
High-performance inference framework for large language models
Large Language Model Principles and Practice Tutorial from Scratch
Quick illustration of how one can easily read books together with LLMs
On the Structural Pruning of Large Language Models
State-of-the-art Parameter-Efficient Fine-Tuning
The official implementation of RAPTOR
A simple, performant and scalable Jax LLM
Refer and Ground Anything Anywhere at Any Granularity
Diversity-driven optimization and large-model reasoning ability
Run 100B+ language models at home, BitTorrent-style
Explore large language models in 512MB of RAM
Code for the paper Fine-Tuning Language Models from Human Preferences