Using adapters to overcome catastrophic forgetting in end-to-end automatic speech recognition

S Vander Eeckt, H Van Hamme - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Learning a set of tasks in sequence remains a challenge for artificial neural networks, which,
in such scenarios, tend to suffer from Catastrophic Forgetting (CF). The same applies to End …

Sequence-level knowledge distillation for class-incremental end-to-end spoken language understanding

U Cappellazzo, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org
The ability to learn new concepts sequentially is a major weakness for modern neural
networks, which hinders their use in non-stationary environments. Their propensity to fit the …

Augmented Memory Replay-based Continual Learning Approaches for Network Intrusion Detection

S Channappayya, BR Tamma - Advances in Neural …, 2024 - proceedings.neurips.cc
Intrusion detection is a form of anomalous activity detection in communication network traffic.
Continual learning (CL) approaches to the intrusion detection task accumulate old …

Online continual learning in keyword spotting for low-resource devices via pooling high-order temporal statistics

U Michieli, PP Parada, M Ozay - arXiv preprint arXiv:2307.12660, 2023 - arxiv.org
Keyword Spotting (KWS) models on embedded devices should adapt fast to new user-
defined words without forgetting previous ones. Embedded devices have limited storage …

Continual learning for on-device speech recognition using disentangled conformers

A Diwan, CF Yeh, WN Hsu, P Tomasello… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Automatic speech recognition research focuses on training and evaluating on static
datasets. Yet, as speech models are increasingly deployed on personal devices, such …

Continual Contrastive Spoken Language Understanding

U Cappellazzo, E Fini, M Yang, D Falavigna… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, neural networks have shown impressive progress across diverse fields, with
speech processing being no exception. However, recent breakthroughs in this area require …

Leveraging Cross-Utterance Context For ASR Decoding

R Flynn, A Ragni - arXiv preprint arXiv:2306.16903, 2023 - arxiv.org
While external language models (LMs) are often incorporated into the decoding stage of
automated speech recognition systems, these models usually operate with limited context …

Emotion Recognition in the Real-World: Passively Collecting and Estimating Emotions from Natural Speech Data of Individuals with Bipolar Disorder

EM Provost, SH Sperry, J Tavernor… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Emotions provide critical information regarding a person's health and well-being. Therefore,
the ability to track emotion and patterns in emotion over time could provide new …

[PDF][PDF] Episodic Memory For Domain-Adaptable, Robust Speech Emotion Recognition

J Tavernor, M Perez, EM Provost - Interspeech, 2023 - emp.engin.umich.edu
Emotion conveys abundant information that can improve the user experience of various
automated systems, in addition to communicating information important for managing …

Evaluating and Improving Continual Learning in Spoken Language Understanding

M Yang, X Li, U Cappellazzo, S Watanabe… - arXiv preprint arXiv …, 2024 - arxiv.org
Continual learning has emerged as an increasingly important challenge across various
tasks, including Spoken Language Understanding (SLU). In SLU, its objective is to …