DeiT (Data-efficient Image Transformers) shows that Vision Transformers can be trained competitively on ImageNet-1k without external data by using strong training recipes and knowledge distillation. Its key idea is a specialized distillation strategy—including a learnable “distillation token”—that lets a transformer learn effectively from a CNN or transformer teacher on modest-scale datasets. The project provides compact ViT variants (Tiny/Small/Base) that achieve excellent accuracy–throughput trade-offs, making transformers practical beyond massive pretraining regimes. Training involves carefully tuned augmentations, regularization, and optimization schedules to stabilize learning and improve sample efficiency. The repo offers pretrained checkpoints, reference scripts, and ablation studies that clarify which ingredients matter most for data-efficient ViT training.

Features

  • Data-efficient ViT training that works on ImageNet-1k from scratch
  • Knowledge distillation with a dedicated distillation token
  • Compact model zoo (Tiny/Small/Base) with strong accuracy–speed balance
  • Clear training recipes with augmentations and regularization schedules
  • Pretrained checkpoints and reproducible reference scripts
  • Ablations and guidelines to adapt DeiT to new datasets and tasks

Project Samples

Project Activity

See All Activity >

Categories

AI Models

License

Apache License V2.0

Follow DeiT (Data-efficient Image Transformers)

DeiT (Data-efficient Image Transformers) Web Site

Other Useful Business Software
Train ML Models With SQL You Already Know Icon
Train ML Models With SQL You Already Know

BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
Try Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DeiT (Data-efficient Image Transformers)!

Additional Project Details

Programming Language

Python

Related Categories

Python AI Models

Registered

2025-10-07