Deep transfer learning for automatic speech recognition: Towards better generalization

H Kheddar, Y Himeur, S Al-Maadeed, A Amira… - Knowledge-Based …, 2023 - Elsevier
Automatic speech recognition (ASR) has recently become an important challenge when
using deep learning (DL). It requires large-scale training datasets and high computational …

Survey on AI sustainability: emerging trends on learning algorithms and research challenges

Z Chen, M Wu, A Chan, X Li… - IEEE Computational …, 2023 - ieeexplore.ieee.org
Artificial Intelligence (AI) is a fast-growing research and development (R&D) discipline which
is attracting increasing attention because it promises to bring vast benefits for consumers …

Defensive distillation-based adversarial attack mitigation method for channel estimation using deep learning models in next-generation wireless networks

FO Catak, M Kuzlu, E Catak, U Cali, O Guler - IEEE Access, 2022 - ieeexplore.ieee.org
Future wireless networks (5G and beyond), also known as Next Generation or NextG, are
the vision of forthcoming cellular systems, connecting billions of devices and people …

LLMs Meet Multimodal Generation and Editing: A Survey

Y He, Z Liu, J Chen, Z Tian, H Liu, X Chi, R Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
With the recent advancement in large language models (LLMs), there is a growing interest in
combining LLMs with multimodal learning. Previous surveys of multimodal large language …

Defense against adversarial attacks on hybrid speech recognition using joint adversarial fine-tuning with denoiser

S Joshi, S Kataria, Y Shao, P Zelasko, J Villalba… - arXiv preprint arXiv …, 2022 - arxiv.org
Adversarial attacks are a threat to automatic speech recognition (ASR) systems, and it
becomes imperative to propose defenses to protect them. In this paper, we perform …

There is more than one kind of robustness: Fooling whisper with adversarial examples

R Olivier, B Raj - arXiv preprint arXiv:2210.17316, 2022 - arxiv.org
Whisper is a recent Automatic Speech Recognition (ASR) model displaying impressive
robustness to both out-of-distribution inputs and random noise. In this work, we show that …

Transformers: A Security Perspective

BS Latibari, N Nazari, MA Chowdhury, KI Gubbi… - IEEE …, 2024 - ieeexplore.ieee.org
The Transformers architecture has recently emerged as a revolutionary paradigm in the field
of deep learning, particularly excelling in Natural Language Processing (NLP) and …

Representation learning to classify and detect adversarial attacks against speaker and speech recognition systems

J Villalba, S Joshi, P Żelasko, N Dehak - arXiv preprint arXiv:2107.04448, 2021 - arxiv.org
Adversarial attacks have become a major threat for machine learning applications. There is
a growing interest in studying these attacks in the audio domain, eg, speech and speaker …

Arabic Synonym BERT-based Adversarial Examples for Text Classification

N Alshahrani, S Alshahrani, E Wali… - arXiv preprint arXiv …, 2024 - arxiv.org
Text classification systems have been proven vulnerable to adversarial text examples,
modified versions of the original text examples that are often unnoticed by human eyes, yet …

A Tutorial on Clinical Speech AI Development: From Data Collection to Model Validation

SI Ng, L Xu, I Siegert, N Cummins, NR Benway… - arXiv preprint arXiv …, 2024 - arxiv.org
There has been a surge of interest in leveraging speech as a marker of health for a wide
spectrum of conditions. The underlying premise is that any neurological, mental, or physical …