Run models like Kimi-K2.5, GLM-5, DeepSeek, gpt-oss, Gemma, Qwen etc.
C#/.NET binding of llama.cpp, including LLaMa/GPT model inference
LLM training code for MosaicML foundation models
An elegent pytorch implement of transformers
A multimodal model for brain response prediction
Open source alternative to ChatGPT that runs 100% offline
Oobabooga - The definitive Web UI for local AI, with powerful features
Distribute and run LLMs with a single file
Open-source, high-performance AI model with advanced reasoning
Towards Human-Sounding Speech
Alibaba's high-performance LLM inference engine for diverse apps
Jlama is a modern LLM inference engine for Java
LLM Finetuning with peft
Run Local LLMs on Any Device. Open-source
LLM Frontend for Power Users
LightLLM is a Python-based LLM (Large Language Model) inference
Go ahead and axolotl questions
Code to accompany "A Method for Animating Children's Drawings"
Clippy, now with some AI
Evaluate and compare LLM outputs, catch regressions, improve prompts
Operating LLMs in production
Fast State-of-the-Art Static Embeddings
SGLang is a fast serving framework for large language models
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Easiest and laziest way for building multi-agent LLMs applications