Deep representation learning in speech processing: Challenges, recent advances, and future trends

I Kansizoglou, L Bampis… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

The exploitation of deep neural networks (DNNs) as descriptors in feature learning
challenges enjoys apparent popularity over the past few years. The above tendency focuses …

被引用次数：15 相关文章所有 6 个版本

Speech Enhancement: A Survey of Approaches and Applications

S Chhetri, MS Joshi, CV Mahamuni… - … Conference on Edge …, 2023 - ieeexplore.ieee.org

The paper provides a comprehensive overview of speech enhancement techniques and
their applications. It discusses challenges in non-stationary noise, reverberation, and …

被引用次数：3 相关文章

[HTML] sciencedirect.com

[HTML][HTML] Speech technology in healthcare

P Deepa, R Khilar - Measurement: Sensors, 2022 - Elsevier

As the population ages with advances in technology, health monitoring through early
detection is increasing. There are several approaches to the analysis, monitoring and …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Articulatory representation learning via joint factor analysis and neural matrix factorization

J Lian, AW Black, Y Lu, L Goldstein… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Articulatory representation learning is the fundamental research in modeling neural speech
production system. Our previous work has established a deep paradigm to decompose the …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

A novel policy for pre-trained deep reinforcement learning for speech emotion recognition

T Rajapakshe, R Rana, S Khalifa, J Liu… - Proceedings of the 2022 …, 2022 - dl.acm.org

Deep Reinforcement Learning (deep RL) has gained tremendous success in gaming but it
has rarely been explored for Speech Emotion Recognition (SER). In the RL literature, policy …

被引用次数：15 相关文章所有 6 个版本

[PDF] ieee.org

High-fidelity audio generation and representation learning with guided adversarial autoencoder

KN Haque, R Rana, BW Schuller - IEEE Access, 2020 - ieeexplore.ieee.org

Generating high-fidelity conditional audio samples and learning representation from
unlabelled audio data are two challenging problems in machine learning research. Recent …

被引用次数：17 相关文章所有 8 个版本

[PDF] arxiv.org

Deep neural convolutive matrix factorization for articulatory representation decomposition

J Lian, AW Black, L Goldstein… - arXiv preprint arXiv …, 2022 - arxiv.org

Most of the research on data-driven speech representation learning has focused on raw
audios in an end-to-end manner, paying little attention to their internal phonological or …

被引用次数：7 相关文章所有 9 个版本

[PDF] scielo.br

Artificial intelligence applications in the agriculture 4.0

GAS Megeto, AG Silva, RF Bulgarelli… - Revista Ciência …, 2020 - SciELO Brasil

The usage of digital data is one of the main characteristics of the Agriculture 4.0 era.
Different devices and sensors may be used to capture a variety of types of data that enable …

被引用次数：17 相关文章所有 8 个版本

[PDF] ieee.org

Perceptual-similarity-aware deep speaker representation learning for multi-speaker generative modeling

Y Saito, S Takamichi… - IEEE/ACM Transactions on …, 2021 - ieeexplore.ieee.org

We propose novel deep speaker representation learning that considers perceptual similarity
among speakers for multi-speaker generative modeling. Following its success in accurate …

被引用次数：13 相关文章所有 7 个版本

[PDF] arxiv.org

Configurable privacy-preserving automatic speech recognition

R Aloufi, H Haddadi, D Boyle - arXiv preprint arXiv:2104.00766, 2021 - arxiv.org

Voice assistive technologies have given rise to far-reaching privacy and security concerns.
In this paper we investigate whether modular automatic speech recognition (ASR) can …

被引用次数：12 相关文章所有 5 个版本

高级搜索

QQ 群