Open-source multi-speaker long-form text-to-speech model
An experimental version of DeepSeek model
Achieving 3+ generation speedup on reasoning tasks
A CNN model that predicts human joints from RGB images of a person
Official DeiT repository
Python example app from the OpenAI API quickstart tutorial
Code release for ConvNeXt V2 model
Learning to Act by Watching Unlabeled Online Videos
Facebook AI Research Sequence-to-Sequence Toolkit
Code for reproducing key results in the paper
FP8 Qwen model for efficient multimodal coding and agent tasks
685B model with improved agents and consistency
Efficient 8B multimodal model tuned for advanced reasoning tasks.