Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Programmatic access to the AlphaGenome model
Z80-μLM is a 2-bit quantized language model
Achieving 3+ generation speedup on reasoning tasks
Hackable and optimized Transformers building blocks
Official DeiT repository
Code release for ConvNeXt V2 model
Learning to Act by Watching Unlabeled Online Videos
Code for reproducing key results in the paper