Modeling spoken information queries for virtual assistants: Open problems, challenges and opportunities

C Van Gysel - Proceedings of the 46th International ACM SIGIR …, 2023 - dl.acm.org
Virtual assistants are becoming increasingly important speech-driven Information Retrieval
platforms that assist users with various tasks. We discuss open problems and challenges …

Training large-vocabulary neural language models by private federated learning for resource-constrained devices

M Xu, C Song, Y Tian, N Agrawal… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
Federated Learning (FL) is a technique to train models on distributed edge devices with
local data samples. Differential Privacy (DP) can be applied with FL to provide a formal …

Listen, know and spell: Knowledge-infused subword modeling for improving asr performance of oov named entities

N Das, M Sunkara, D Bekal, DH Chau… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Automatic speech recognition (ASR) is increasingly being used in specialized domains such
as medical ASR and news transcription. Owing to the lack of high quality annotated speech …

Synthetic query generation using large language models for virtual assistants

S Sannigrahi, T Fraga-Silva, Y Oualil… - Proceedings of the 47th …, 2024 - dl.acm.org
Virtual Assistants (VAs) are important Information Retrieval platforms that help users
accomplish various tasks through spoken commands. The speech recognition system …

Space-efficient representation of entity-centric query language models

C Van Gysel, M Hannemann, E Pusateri… - arXiv preprint arXiv …, 2022 - arxiv.org
Virtual assistants make use of automatic speech recognition (ASR) to help users answer
entity-centric queries. However, spoken entity recognition is a difficult problem, due to the …

Knowledge Prompt for Whisper: An ASR Entity Correction Approach with Knowledge Base

M Zhang, X Qiao, Y Zhao, C Su, Y Li… - … Conference on Big …, 2023 - ieeexplore.ieee.org
Entity correction is crucial in Automatic Speech TABLE I Recognition (ASR), since erroneous
entities seriously affect our understanding of ASR results. In this paper, in order to correct …

Record deduplication for entity distribution modeling in ASR transcripts

T Huang, CH Hong, C Wivagg, K Shimizu - arXiv preprint arXiv …, 2023 - arxiv.org
Voice digital assistants must keep up with trending search queries. We rely on a speech
recognition model using contextual biasing with a rapidly updated set of entities, instead of …