Accelerate local LLM inference and finetuning
Hackable and optimized Transformers building blocks
Diffusion Transformer with Fine-Grained Chinese Understanding
A multimodal model for brain response prediction
LLM101n: Let's build a Storyteller
AI-driven neuro-symbolic solver for high-school geometry problems
Video Object and Interaction Deletion
Interpretable prompting and models for NLP
Multimodal Diffusion with Representation Alignment
Robust Speech Recognition via Large-Scale Weak Supervision
Drop-in replacement for standard residual connections in Transformers
A simple but complete full-attention transformer
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Foundation Model for Tabular Data
A RWKV management and startup tool, full automation, only 8MB
LLM training in simple, raw C/CUDA
Machine Learning Journal for Intermediate to Advanced Topics
Mainly record the knowledge and interview questions
Fully automatic censorship removal for language models
Official Python inference and LoRA trainer package
Bringing BERT into modernity via both architecture changes and scaling
TigerBot: A multi-language multi-task LLM
Inference script for Oasis 500M
HY-Motion model for 3D character animation generation
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model