A systematic literature review on multimodal machine learning: Applications, challenges, gaps and future directions

A Barua, MU Ahmed, S Begum - IEEE Access, 2023 - ieeexplore.ieee.org
Multimodal machine learning (MML) is a tempting multidisciplinary research area where
heterogeneous data from multiple modalities and machine learning (ML) are combined to …

A survey of content-aware video analysis for sports

HC Shih - IEEE Transactions on circuits and systems for video …, 2017 - ieeexplore.ieee.org
Sports data analysis is becoming increasingly large scale, diversified, and shared, but
difficulty persists in rapidly accessing the most crucial information. Previous surveys have …

Bayesian network-based customized highlight generation for broadcast soccer videos

MH Kolekar, S Sengupta - IEEE Transactions on Broadcasting, 2015 - ieeexplore.ieee.org
Sports highlight generation techniques aim at condensing a full-length video to a
significantly shortened version that still preserves the main interesting content of the original …

Crowdsourced time-sync video tagging using temporal and personalized topic modeling

B Wu, E Zhong, B Tan, A Horner, Q Yang - Proceedings of the 20th ACM …, 2014 - dl.acm.org
Time-sync video tagging aims to automatically generate tags for each video shot. It can
improve the user's experience in previewing a video's timeline structure compared to …

Deep unsupervised multi-view detection of video game stream highlights

C Ringer, MA Nicolaou - … of the 13th International Conference on the …, 2018 - dl.acm.org
We consider the problem of automatic highlight-detection in video game streams. Currently,
the vast majority of highlight-detection systems for games are triggered by the occurrence of …

A new multi-modal approach to bib number/text detection and recognition in Marathon images

P Shivakumara, R Raghavendra, L Qin, KB Raja… - Pattern Recognition, 2017 - Elsevier
Bib number/text detection and recognition in Marathon natural images is challenging
because of unconstrained poses created by background and bib number font variations …

VSTAR: visual semantic thumbnails and tAgs revitalization

S Carta, A Giuliani, L Piano, AS Podda… - Expert Systems with …, 2022 - Elsevier
Nowadays, video-sharing portals' popularity has entailed massive growth in data uploads
over the Internet. For several applications (eg, browsing, retrieval, or recommendation of …

Multimodal fusion of speech and text using semi-supervised LDA for indexing lecture videos

M Husain, SM Meena - 2019 National Conference on …, 2019 - ieeexplore.ieee.org
Lecture videos are the most popular learning materials due to their pedagogical benefits.
However, accessing a topic or subtopic of interest requires manual examination of each …

Event detection and highlight detection of broadcasted game videos

WT Chu, YC Chou - Proceedings of the 2nd Workshop on Computational …, 2015 - dl.acm.org
Efficient access of game videos is urgently demanded due to the emergence of live
streaming platforms and the explosive numbers of gamers and viewers. In this work we …

On broadcasted game video analysis: event detection, highlight detection, and highlight forecast

WT Chu, YC Chou - Multimedia Tools and Applications, 2017 - Springer
Efficient access to broadcasted computer game videos is urgently demanded due to the
emergence of live streaming platforms. The popularity of game video streaming builds a big …