Time Information Fusion in CNNs
Videos are sequences of images, where each image is revealed as time progresses and due to persistence of vision we perceive them as moving images. Videos are represented using 4 dimensional tensors( one channel dimension.one temporal dimension( time ) and two spatial dimension).There are different neural network models that can be used to learn the spatio-temporal features.
- Single-frame
- Early Fusion
- Late Fusion
- Slow Fusion
0
1
Tags
Data Science
Related
Time Information Fusion in CNNs
Multiresolution CNNs
Quantitative Findings of the Sports-1M video classification experiments using CNNs and feature histogram baseline models
Difference between single frame network and slow fusion (motion-aware) networks
Training CNN Models for Sports-1M Video Classification
Datasets used for Experimentation
Transfer Learning Experiments on UCF-101