...It requires building both the library and a patched version of FFmpeg to fully enable hardware features. Overall, it extends FFmpeg functionality to better suit embedded GPU-powered environments.
Non-local Neural Networks for Video Classification
video-nonlocal-net implements Non-local Neural Networks for video understanding, adding long-range dependency modeling to 2D/3D ConvNet backbones. Non-local blocks compute attention-like responses across all positions in space-time, allowing a feature at one frame and location to aggregate information from distant frames and regions. This formulation improves action recognition and spatiotemporal reasoning, especially for classes requiring context beyond short temporal windows. The repo...