Multimodal machine learning is a vibrant multi-disciplinary research field that aims to design computer agents with intelligent capabilities such as understanding, reasoning, and learning …
We propose a novel supervised learning technique for summarizing videos by automatically selecting keyframes or key subshots. Casting the task as a structured prediction problem …
Video contents are inherently heterogeneous. To exploit different feature modalities in a diverse video collection for video summarization, we propose to formulate the task as a multi …
Recent advances in sensor manufacture and computer vision technologies have simulated the applications of intelligent transportation systems, while a key yet under-addressed issue …
S Lal, S Duggal, I Sreedevi - 2019 IEEE Winter Conference on …, 2019 - ieeexplore.ieee.org
Automatically generating the summary of a video is a challenging problem due to its subjective nature. Most of the previous works in the field consider the entire video to extract …
Video abstraction is an interesting topic that aims at briefly representing the entire video stream by producing a short summary either statically or dynamically. In this paper, we …
RF Rachmadi, K Uchimura… - 2016 IEEE Region 10 …, 2016 - ieeexplore.ieee.org
Shared human actions in the video are the biggest problem for video classification system. For example, long jump sports video will share a running action with the long jump or …
Video summarization aims to extract representative frames to retain high-level information. Increasing concerns about privacy issues have been raised because conventional large …
Building multisensory AI systems that learn from multiple sensory inputs such as text, speech, video, real-world sensors, wearable devices, and medical data holds great promise …