In this work, we show how to co-train a classifier for active speaker detection using audio- visual data. First, audio Voice Activity Detection (VAD) is used to train a personalized video …
J Wang - US Patent 10,667,069, 2020 - Google Patents
Embodiments of source separation for reverberant environment are disclosed. According to a method, first microphone signals for each individual one of at least one source are …
In this article, we propose a new method for joint cochannel speaker separation and recognition called adaptive-dictionary non-negative matrix deconvolution (DANMD). This …
As children we have all been taught to listen politely to others before speaking ourselves. Unfortunately, sometimes this lesson is forgotten, causing simultaneous or overlapping …
J Wang - US Patent 10,904,688, 2021 - Google Patents
Embodiments of source separation for reverberant environ ment are disclosed. According to a method, first microphone signals for each individual one of at least one source are …
Deze eindverhandeling beschrijft een aantal algoritmes die gebruikt worden voor bronsplitsing en sprekerherkenning gebaseerd op literatuurstudie. Eén van deze methodes …