Ongoing research training transformer models at scale
Train a 26M-parameter GPT from scratch in just 2h
Research-oriented chatbot framework
Open-source observability for your LLM application
Framework to easily create LLM powered bots over any dataset
Technical principles related to large models
LLM abstractions that aren't obstructions
Open-source, local-first memory for any tool-capable LLM agent
Integrating LLMs into structured NLP pipelines
Text-space optimizer that trains reusable natural-language skills
A straightforward method for training your LLM
Play ChatGPT and other LLM with Xiaomi AI Speaker
Claude + Obsidian knowledge companion
Machine Learning Journal for Intermediate to Advanced Topics
A Survey of Large Language Models
Language-model investigation agent with a terminal UI
950 line, minimal, extensible LLM inference engine built from scratch
Adding guardrails to large language models
Seamlessly integrate LLMs into scikit-learn
State-of-the-art Parameter-Efficient Fine-Tuning
Build a modern LLM from scratch. Every line commented
Multi-source content processor for NotebookLM
Test-Time Reinforcement Learning
Bridging LLM and Recommender System
Semi-Structured Agentic Framework. Workflows build themselves