X Ai,
VS Sheng, C Li, Z Cui - arXiv preprint arXiv:2208.07216, 2022 - arxiv.org
In order to deal with variant-length long videos, prior works extract multi-modal features and
fuse them to predict students' engagement intensity. In this paper, we present a new end-to …