Y Liu, J Ma, Y Xie, X Yang, X Tao, L Peng, W Gao - Neurocomputing, 2022 - Elsevier
… Next, the context c is passed to a spatial pooling layer to get a feature vector, and then a
fully-connected layer and a multi-class softmax function outputs the probabilities for video action …