Concept

Time Information Fusion in CNNs

Videos are sequences of images, where each image is revealed as time progresses and due to persistence of vision we perceive them as moving images. Videos are represented using 4 dimensional tensors( one channel dimension.one temporal dimension( time ) and two spatial dimension).There are different neural network models that can be used to learn the spatio-temporal features.

  • Single-frame
  • Early Fusion
  • Late Fusion
  • Slow Fusion

0

1

Updated 2021-08-19

Tags

Data Science

Learn After