P Li, Z Chen, LT Yang, Q Zhang… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Currently, a large number of industrial data, usually referred to big data, are collected from Internet of Things (IoT). Big data are typically heterogeneous, ie, each object in big datasets …
Active speaker detection is an important component in video analysis algorithms for applications such as speaker diarization, video re-targeting for meetings, speech …
Biometric recognition is a trending technology that uses unique characteristics data to identify or verify/authenticate security applications. Amidst the classically used biometrics …
Speaker identification refers to the task of localizing the face of a person who has the same identity as the ongoing voice in a video. This task not only requires collective perception …
H Yang, T Wang, L Yin - Proceedings of the 28th ACM international …, 2020 - dl.acm.org
Multimodal facial action units (AU) recognition aims to build models that are capable of processing, correlating, and integrating information from multiple modalities (ie, 2D images …
The task of searching certain people in videos has seen increasing potential in real-world applications, such as video organization and editing. Most existing approaches are devised …
Abstract Smart Internet of Things (smart IoT) have emerged as a transformative computing paradigm recently. This new approach has made great contributions in the area of cyber …
Y Ding, Y Xu, SX Zhang, Y Cong… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Speaker diarization, which is to find the speech segments of specific speakers, has been widely used in human-centered applications such as video conferences or human-computer …
N Ayari, H Abdelkawy, A Chibani… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Endowing ubiquitous robots with cognitive capabilities for recognizing emotions, sentiments, affects, and moods of humans in their context is an important challenge, which requires …