TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. TimeSformer was influential in showing that pure transformer architectures—without convolutional backbones—can perform strongly on video classification tasks. Its flexible attention design allows experimenting with different factoring (spatial-then-temporal, joint, etc.) to trade off compute, memory, and accuracy.

Features

  • Spatiotemporal transformer attention for video modeling
  • Variants: divided spatial/temporal attention and joint attention schemas
  • PyTorch reference implementation with pretrained weights and scripts
  • Ability to reason about long-range temporal dependencies globally
  • Configurable parameters for patch size, frames, embedding dimension, and head count
  • Support for fine-tuning across video classification and recognition benchmarks

Project Samples

Project Activity

See All Activity >

Categories

Video, AI Models

License

Creative Commons Attribution License

Follow TimeSformer

TimeSformer Web Site

Other Useful Business Software
MongoDB Atlas runs apps anywhere Icon
MongoDB Atlas runs apps anywhere

Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
Start Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TimeSformer!

Additional Project Details

Programming Language

Python

Related Categories

Python Video Software, Python AI Models

Registered

2025-10-07