TimeSformer is a vision transformer architecture for video that extends the standard attention mechanism into spatiotemporal attention. The model alternates attention along spatial and temporal dimensions (or designs variants like divided attention) so that it can capture both appearance and motion cues in video. Because the attention is global across frames, TimeSformer can reason about dependencies across long time spans, not just local neighborhoods. The official implementation in PyTorch provides configurations, pretrained models, and training scripts that make it straightforward to evaluate or fine-tune on video datasets. TimeSformer was influential in showing that pure transformer architectures—without convolutional backbones—can perform strongly on video classification tasks. Its flexible attention design allows experimenting with different factoring (spatial-then-temporal, joint, etc.) to trade off compute, memory, and accuracy.

Features

  • Spatiotemporal transformer attention for video modeling
  • Variants: divided spatial/temporal attention and joint attention schemas
  • PyTorch reference implementation with pretrained weights and scripts
  • Ability to reason about long-range temporal dependencies globally
  • Configurable parameters for patch size, frames, embedding dimension, and head count
  • Support for fine-tuning across video classification and recognition benchmarks

Project Samples

Project Activity

See All Activity >

Categories

Video, AI Models

License

Creative Commons Attribution License

Follow TimeSformer

TimeSformer Web Site

Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of TimeSformer!

Additional Project Details

Programming Language

Python

Related Categories

Python Video Software, Python AI Models

Registered

2025-10-07