Learn How LLM Transformer Models Work with Interactive Visualization
Julia Implementation of Transformer models
Fast inference engine for Transformer models
Build your chatbot within minutes on your favorite device
Fully automatic censorship removal for language models
LLM training in simple, raw C/CUDA
Go ahead and axolotl questions
Tensor library for machine learning
BitNet: Scaling 1-bit Transformers for Large Language Models
Minimal reproduction of OneRec
A theoretical reconstruction of the Claude Mythos architecture
NeurIPS2025 Spotlight] Quantized Attention
Fast and memory-efficient exact attention
LightLLM is a Python-based LLM (Large Language Model) inference
Flux 2 image generation model pure C inference
Accelerate local LLM inference and finetuning
Repo for SeedVR2 & SeedVR
Unified Multimodal Understanding and Generation Models
The most powerful local music generation model
Node.js tool for optimizing SVG files
Open-source GEO content production system with AI tasks
A Vue3 component library based on Material Design 2 and 3
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Bringing BERT into modernity via both architecture changes and scaling
Python library for portfolio optimization built on top of scikit-learn