Modeling spoken information queries for virtual assistants: Open problems, challenges and opportunities

C Van Gysel - Proceedings of the 46th International ACM SIGIR …, 2023 - dl.acm.org
Virtual assistants are becoming increasingly important speech-driven Information Retrieval
platforms that assist users with various tasks. We discuss open problems and challenges …

Synthetic query generation using large language models for virtual assistants

S Sannigrahi, T Fraga-Silva, Y Oualil… - Proceedings of the 47th …, 2024 - dl.acm.org
Virtual Assistants (VAs) are important Information Retrieval platforms that help users
accomplish various tasks through spoken commands. The speech recognition system …

[PDF][PDF] Robust Command Recognition for Lithuanian Air Traffic Control Tower Utterances.

O Ohneiser, SS Sarfjoo, H Helmke, S Shetty… - Interspeech, 2021 - isca-archive.org
The maturity of automatic speech recognition (ASR) systems at controller working positions
is currently a highly relevant technological topic in air traffic control (ATC). However, ATC …

Space-efficient representation of entity-centric query language models

C Van Gysel, M Hannemann, E Pusateri… - arXiv preprint arXiv …, 2022 - arxiv.org
Virtual assistants make use of automatic speech recognition (ASR) to help users answer
entity-centric queries. However, spoken entity recognition is a difficult problem, due to the …

Combining Language Models For Specialized Domains: A Colorful Approach

D Eitan, M Pirchi, N Glazer, S Meital, G Ayach… - arXiv preprint arXiv …, 2023 - arxiv.org
General purpose language models (LMs) encounter difficulties when processing domain-
specific jargon and terminology, which are frequently utilized in specialized fields such as …

Error-driven pruning of language models for virtual assistants

S Gondala, L Verwimp, E Pusateri… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Language models (LMs) for virtual assistants (VAs) are typically trained on large amounts of
data, resulting in prohibitively large models which require excessive memory and/or cannot …

Application-agnostic language modeling for on-device ASR

M Nußbaum-Thom, L Verwimp, Y Oualil - arXiv preprint arXiv:2305.09764, 2023 - arxiv.org
On-device automatic speech recognition systems face several challenges compared to
server-based systems. They have to meet stricter constraints in terms of speed, disk size and …

Predicting entity popularity to improve spoken entity recognition by virtual assistants

C Van Gysel, M Tsagkias, E Pusateri… - Proceedings of the 43rd …, 2020 - dl.acm.org
We focus on improving the effectiveness of a Virtual Assistant (VA) in recognizing emerging
entities in spoken queries. We introduce a method that uses historical user interactions to …

Transformer-based Model for ASR N-Best Rescoring and Rewriting

IE Kang, C Van Gysel, MH Siu - arXiv preprint arXiv:2406.08207, 2024 - arxiv.org
Voice assistants increasingly use on-device Automatic Speech Recognition (ASR) to ensure
speed and privacy. However, due to resource constraints on the device, queries pertaining …

Towards Continual Entity Learning in Language Models for Conversational Agents

RT Gadde, I Bulyko - arXiv preprint arXiv:2108.00082, 2021 - arxiv.org
Neural language models (LM) trained on diverse corpora are known to work well on
previously seen entities, however, updating these models with dynamically changing entities …