OmniRoute is an AI gateway for multi-provider LLM
LLM training in simple, raw C/CUDA
Self-hosted AI accounting app. LLM analyzer for receipts
AirLLM 70B inference with single 4GB GPU
Diversity-driven optimization and large-model reasoning ability
Qwen2.5-VL is the multimodal large language model series
The official implementation of RAPTOR
Personal AI Notebooks. Organize files & webpages and generate notes
A simple, performant and scalable Jax LLM
State-of-the-art Parameter-Efficient Fine-Tuning
Collection of tutorials for Prompt Engineering techniques
Apple Intelligence from the command line
A Ruby Implementation of the Model Context Protocol
Large Language Model Principles and Practice Tutorial from Scratch
Quick illustration of how one can easily read books together with LLMs
Dance with Intelligence in Your Code
On the Structural Pruning of Large Language Models
High-performance inference framework for large language models
Refer and Ground Anything Anywhere at Any Granularity
Inference Llama 2 in one file of pure C
Llama 2 Everywhere (L2E)
Run 100B+ language models at home, BitTorrent-style
Explore large language models in 512MB of RAM
Code for the paper Fine-Tuning Language Models from Human Preferences