Conversational AI for multi-agent communication in Natural Language: Research directions at the Interaction Lab

O Lemon - Ai Communications, 2022 - journals.sagepub.com
Research at the Interaction Lab focuses on human-agent communication using
conversational Natural Language. The ultimate goal is to create systems where humans and …

Best of both worlds: Making high accuracy non-incremental transformer-based disfluency detection incremental

M Rohanian, J Hough - Proceedings of the 59th Annual Meeting …, 2021 - aclanthology.org
While Transformer-based text classifiers pre-trained on large volumes of text have yielded
significant improvements on a wide range of computational linguistics tasks, their …

Hierarchical multi-task natural language understanding for cross-domain conversational AI: HERMIT NLU

A Vanzo, E Bastianelli, O Lemon - arXiv preprint arXiv:1910.00912, 2019 - arxiv.org
We present a new neural architecture for wide-coverage Natural Language Understanding
in Spoken Dialogue Systems. We develop a hierarchical multi-task architecture, which …

Fillers in spoken language understanding: Computational and psycholinguistic perspectives

T Dinkar, C Clavel, I Vasilescu - arXiv preprint arXiv:2301.10761, 2023 - arxiv.org
Disfluencies (ie interruptions in the regular flow of speech), are ubiquitous to spoken
discourse. Fillers (" uh"," um") are disfluencies that occur the most frequently compared to …

A comprehensive evaluation of incremental speech recognition and diarization for conversational AI

A Addlesee, Y Yu, A Eshghi - Proceedings of the 28th International …, 2020 - aclanthology.org
Abstract Automatic Speech Recognition (ASR) systems are increasingly powerful and more
accurate, but also more numerous with several options existing currently as a service (eg …

Teaching BERT to wait: Balancing accuracy and latency for streaming disfluency detection

A Chen, V Zayats, DD Walker, D Padfield - arXiv preprint arXiv …, 2022 - arxiv.org
In modern interactive speech-based systems, speech is consumed and transcribed
incrementally prior to having disfluencies removed. This post-processing step is crucial for …

What helps transformers recognize conversational structure? Importance of context, punctuation, and labels in dialog act recognition

P Żelasko, R Pappagari, N Dehak - Transactions of the Association …, 2021 - direct.mit.edu
Dialog acts can be interpreted as the atomic units of a conversation, more fine-grained than
utterances, characterized by a specific communicative function. The ability to structure a …

Re-framing incremental deep language models for dialogue processing with multi-task learning

M Rohanian, J Hough - arXiv preprint arXiv:2011.06754, 2020 - arxiv.org
We present a multi-task learning framework to enable the training of one universal
incremental dialogue processing model with four tasks of disfluency detection, language …

[PDF][PDF] Evaluation of Automatic Speech Recognition for Conversational Speech in Dutch, English and German: What Goes Missing?

A Lopez, A Liesenfeld… - Proceedings of the 18th …, 2022 - aclanthology.org
As voice user interfaces and conversational agents grow in importance, automatic speech
recognition (ASR) encounters increasingly free-form and informal input data. Conversational …

Computational models of disfluencies: fillers and discourse markers in spoken language understanding

T Dinkar - 2022 - theses.hal.science
People rarely speak in the same manner that they write–they are generally disfluent.
Disfluencies can be defined as interruptions in the regular flow of speech, such as pausing …