Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Hackable and optimized Transformers building blocks
Programmatic access to the AlphaGenome model
Z80-μLM is a 2-bit quantized language model
Achieving 3+ generation speedup on reasoning tasks
Official DeiT repository
Code release for ConvNeXt V2 model
Learning to Act by Watching Unlabeled Online Videos
Code for reproducing key results in the paper
Code for "Image Generation from Scene Graphs", Johnson et al, CVPR 201