Audio-based automatic speech recognition (ASR) degrades significantly in noisy environments and is particularly vulnerable to interfering speech, as the model cannot …
C Huang, Y Tian, A Kumar… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Humans naturally perceive surrounding scenes by unifying sound and sight in a first-person view. Likewise, machines are advanced to approach human intelligence by learning with …
What will the future be? We wonder! In this survey, we explore the gap between current research in egocentric vision and the ever-anticipated future, where wearable computing …
Augmented reality devices have the potential to enhance human perception and enable other assistive functionalities in complex conversational environments. Effectively capturing …
L McCormack, A Politis, R Gonzalez… - … on Audio, Speech …, 2022 - ieeexplore.ieee.org
This article proposes a parametric signal-dependent method for the task of encoding microphone array signals into Ambisonic signals. The proposed method is presented and …
In a noisy conversation environment such as a dinner party, people often exhibit selective auditory attention, or the ability to focus on a particular speaker while tuning out others …
WN Hsu, T Remez, B Shi… - Proceedings of the …, 2023 - openaccess.thecvf.com
Prior works on improving speech quality with visual input typically study each type of auditory distortion separately (eg, separation, inpainting, video-to-speech) and present …
H Lu, WO Brimijoin - Trends in Hearing, 2022 - journals.sagepub.com
To optimally improve signal-to-noise ratio in noisy environments, a hearing assistance device must correctly identify what is signal and what is noise. Many of the biosignal-based …
It is well known that microphone arrays can be used to enhance a target speaker in a noisy, reverberant environment, with both spatial (eg beamforming) and statistical (eg source …