Learn How LLM Transformer Models Work with Interactive Visualization
Julia Implementation of Transformer models
Fast inference engine for Transformer models
Build your chatbot within minutes on your favorite device
LLM training in simple, raw C/CUDA
Fully automatic censorship removal for language models
BitNet: Scaling 1-bit Transformers for Large Language Models
Go ahead and axolotl questions
Minimal reproduction of OneRec
Tensor library for machine learning
NeurIPS2025 Spotlight] Quantized Attention
Fast and memory-efficient exact attention
LightLLM is a Python-based LLM (Large Language Model) inference
Flux 2 image generation model pure C inference
Repo for SeedVR2 & SeedVR
Unified Multimodal Understanding and Generation Models
Accelerate local LLM inference and finetuning
The most powerful local music generation model
Node.js tool for optimizing SVG files
A Vue3 component library based on Material Design 2 and 3
Open-source GEO content production system with AI tasks
A minimal, responsive and feature-rich Jekyll theme
A theoretical reconstruction of the Claude Mythos architecture
Bringing BERT into modernity via both architecture changes and scaling
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step