Video classification with CNNs: Using the codec as a spatio-temporal activity sensor

A Chadha, A Abbas… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
We investigate video classification via a two-stream convolutional neural network (CNN)
design that directly ingests information extracted from compressed video bitstreams. Our
approach begins with the observation that all modern video codecs divide the input frames
into macroblocks (MBs). We demonstrate that selective access to MB motion vector (MV)
information within compressed video bitstreams can also provide for selective, motion-
adaptive, MB pixel decoding (aka, MB texture decoding). This in turn allows for the …

[引用][C] Video Classification With CNNs: Using the Codec as a Spatio-Temporal Activity Sensor

C Aaron - IEEE Transactions on Circuits and Systems for Video …
以上显示的是最相近的搜索结果。 查看全部搜索结果