Difference between single frame network and slow fusion (motion-aware) networks
Motion-aware network should in theory benefit from motion information. In fact, the slow fusion networks are more likely to underperform when there is camera motion present, implying they struggle to learn complete invariance across all possible angles and speeds of camera translation and zoom.

0
1
Tags
Deep Learning
Data Science
Related
Time Information Fusion in CNNs
Multiresolution CNNs
Quantitative Findings of the Sports-1M video classification experiments using CNNs and feature histogram baseline models
Difference between single frame network and slow fusion (motion-aware) networks
Training CNN Models for Sports-1M Video Classification
Datasets used for Experimentation
Transfer Learning Experiments on UCF-101