Official DeiT repository
Code for the paper Hybrid Spectrogram and Waveform Source Separation
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
Implementation of model parallel autoregressive transformers on GPUs
Code release for ConvNeXt V2 model
Learning to Act by Watching Unlabeled Online Videos
PyTorch implementation of MAE
Environment generation code for the paper "Emergent Tool Use"
Learning Continuous Signed Distance Functions for Shape Representation
Generate embeddings from large-scale graph-structured data
A library for Multilingual Unsupervised or Supervised word Embeddings
Code for reproducing key results in the paper
Dual LSTM Encoder for Dialog Response Generation
High-compute ultra-reasoning model surpassing model surpassing GPT-5
JetBrains’ 4B parameter code model for completions
High-efficiency reasoning and agentic intelligence model