Do neural network weights account for classes centers?

I Kansizoglou, L Bampis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
The exploitation of deep neural networks (DNNs) as descriptors in feature learning
challenges enjoys apparent popularity over the past few years. The above tendency focuses …

Speech Enhancement: A Survey of Approaches and Applications

S Chhetri, MS Joshi, CV Mahamuni… - … Conference on Edge …, 2023 - ieeexplore.ieee.org
The paper provides a comprehensive overview of speech enhancement techniques and
their applications. It discusses challenges in non-stationary noise, reverberation, and …

[HTML][HTML] Speech technology in healthcare

P Deepa, R Khilar - Measurement: Sensors, 2022 - Elsevier
As the population ages with advances in technology, health monitoring through early
detection is increasing. There are several approaches to the analysis, monitoring and …

Articulatory representation learning via joint factor analysis and neural matrix factorization

J Lian, AW Black, Y Lu, L Goldstein… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Articulatory representation learning is the fundamental research in modeling neural speech
production system. Our previous work has established a deep paradigm to decompose the …

A novel policy for pre-trained deep reinforcement learning for speech emotion recognition

T Rajapakshe, R Rana, S Khalifa, J Liu… - Proceedings of the 2022 …, 2022 - dl.acm.org
Deep Reinforcement Learning (deep RL) has gained tremendous success in gaming but it
has rarely been explored for Speech Emotion Recognition (SER). In the RL literature, policy …

High-fidelity audio generation and representation learning with guided adversarial autoencoder

KN Haque, R Rana, BW Schuller - IEEE Access, 2020 - ieeexplore.ieee.org
Generating high-fidelity conditional audio samples and learning representation from
unlabelled audio data are two challenging problems in machine learning research. Recent …

Deep neural convolutive matrix factorization for articulatory representation decomposition

J Lian, AW Black, L Goldstein… - arXiv preprint arXiv …, 2022 - arxiv.org
Most of the research on data-driven speech representation learning has focused on raw
audios in an end-to-end manner, paying little attention to their internal phonological or …

Artificial intelligence applications in the agriculture 4.0

GAS Megeto, AG Silva, RF Bulgarelli… - Revista Ciência …, 2020 - SciELO Brasil
The usage of digital data is one of the main characteristics of the Agriculture 4.0 era.
Different devices and sensors may be used to capture a variety of types of data that enable …

Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling

Y Saito, S Takamichi… - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org
We propose novel deep speaker representation learning that considers perceptual similarity
among speakers for multi-speaker generative modeling. Following its success in accurate …

Configurable privacy-preserving automatic speech recognition

R Aloufi, H Haddadi, D Boyle - arXiv preprint arXiv:2104.00766, 2021 - arxiv.org
Voice assistive technologies have given rise to far-reaching privacy and security concerns.
In this paper we investigate whether modular automatic speech recognition (ASR) can …