MiniMax-M2, a model built for Max coding & agentic workflows
Open-weight, large-scale hybrid-attention reasoning model
Tool for exploring and debugging transformer model behaviors
My personal Claude Code configuration
Unified Multimodal Understanding and Generation Models
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Open-source industrial-grade ASR models
Towards self-verifiable mathematical reasoning
Fast and Universal 3D reconstruction model for versatile tasks
ICLR2024 Spotlight: curation/training code, metadata, distribution
A Conversational Speech Generation Model
Python example app from the OpenAI API quickstart tutorial
800,000 step-level correctness labels on LLM solutions to MATH problem
llama.go is like llama.cpp in pure Golang
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of YOLOv4
Reproduces results of "Fixing the train-test resolution discrepancy"
React app for inspecting, building and debugging with the Realtime API
Efficient MoE reasoning model for coding and math workloads
Portuguese ASR model fine-tuned on XLSR-53 for 16kHz audio input
Russian ASR model fine-tuned on Common Voice and CSS10 datasets