Self-Supervised deep correlational multi-view clustering

B Xin, S Zeng, X Wang - 2021 International Joint Conference …, 2021 - ieeexplore.ieee.org
In conventional unsupervised multi-view clustering (MVC), learning of representations from
heterogeneous multiview data and its subsequent clustering are often separately optimized …

Articulatory and spectrum information fusion based on deep recurrent neural networks

J Yu, K Markov, T Matsui - IEEE/ACM Transactions on Audio …, 2019 - ieeexplore.ieee.org
Many studies have shown that articulatory features can significantly improve the
performance of automatic speech recognition systems. Unfortunately, such features are not …

Speech recognition using cepstral articulatory features

S Najnin, B Banerjee - Speech Communication, 2019 - Elsevier
Though speech recognition has been widely investigated in the past decades, the role of
articulation in recognition has received scant attention. Recognition accuracy increases …

Speech representation learning: Learning bidirectional encoders with single-view, multi-view, and multi-task methods

Q Tang - arXiv preprint arXiv:2308.00129, 2023 - arxiv.org
This thesis focuses on representation learning for sequence data over time or space, aiming
to improve downstream sequence prediction tasks by using the learned representations …

Apprentissage auto-supervisé des relations entre sons, gestes articulatoires et unités de la parole pour le contrôle de la production: vers un agent apprenant à parler

MA Georges - 2023 - theses.hal.science
Ce travail de thèse vise à étudier, par le biais de la modélisation et de la simulation, les
mécanismes d'apprentissage des relations entre les sons de la parole, les gestes …

[PDF][PDF] Phonemic learning based on articulatory-acoustic speech representations.

H Rasilo - CogSci, 2020 - cognitivesciencesociety.org
Infants learn to imitate and recognize words at an early age, but phonemic awareness
develops at a later age, guided by acquisition of literacy for example. We investigate a …

[PDF][PDF] Multi-view representation learning for speech (and language)

K Livescu - lxmls.it.pt
• This is the best mattress I have ever had. It is a perfect combination of firmness and
support. I have never slept better....• I hate this mattress. I can't believe I bought it. It seemed …

[PDF][PDF] ディープラーニングに基づくマルチモーダル情報融合の研究

于建国, ユージエングオ - 2019 - u-aizu.repo.nii.ac.jp
In order to survive in this complex world, we have to constantly obtain information about
events happening around us. A modality is a particular form of signal, from which we can …