Multi-task learning for domain-general spoken disfluency detection in dialogue systems

Conversational AI for multi-agent communication in Natural Language: Research directions at the Interaction Lab

O Lemon - Ai Communications, 2022 - journals.sagepub.com

Research at the Interaction Lab focuses on human-agent communication using
conversational Natural Language. The ultimate goal is to create systems where humans and …

被引用次数：9 相关文章所有 7 个版本

[PDF] aclanthology.org

Best of both worlds: Making high accuracy non-incremental transformer-based disfluency detection incremental

M Rohanian, J Hough - Proceedings of the 59th Annual Meeting …, 2021 - aclanthology.org

While Transformer-based text classifiers pre-trained on large volumes of text have yielded
significant improvements on a wide range of computational linguistics tasks, their …

被引用次数：29 相关文章所有 6 个版本

[PDF] arxiv.org

Hierarchical multi-task natural language understanding for cross-domain conversational AI: HERMIT NLU

A Vanzo, E Bastianelli, O Lemon - arXiv preprint arXiv:1910.00912, 2019 - arxiv.org

We present a new neural architecture for wide-coverage Natural Language Understanding
in Spoken Dialogue Systems. We develop a hierarchical multi-task architecture, which …

被引用次数：44 相关文章所有 8 个版本

[PDF] arxiv.org

Fillers in spoken language understanding: Computational and psycholinguistic perspectives

T Dinkar, C Clavel, I Vasilescu - arXiv preprint arXiv:2301.10761, 2023 - arxiv.org

Disfluencies (ie interruptions in the regular flow of speech), are ubiquitous to spoken
discourse. Fillers (" uh"," um") are disfluencies that occur the most frequently compared to …

被引用次数：11 相关文章所有 5 个版本

[PDF] aclanthology.org

A comprehensive evaluation of incremental speech recognition and diarization for conversational AI

A Addlesee, Y Yu, A Eshghi - Proceedings of the 28th International …, 2020 - aclanthology.org

Abstract Automatic Speech Recognition (ASR) systems are increasingly powerful and more
accurate, but also more numerous with several options existing currently as a service (eg …

被引用次数：25 相关文章所有 5 个版本

[PDF] arxiv.org

Teaching BERT to wait: Balancing accuracy and latency for streaming disfluency detection

A Chen, V Zayats, DD Walker, D Padfield - arXiv preprint arXiv …, 2022 - arxiv.org

In modern interactive speech-based systems, speech is consumed and transcribed
incrementally prior to having disfluencies removed. This post-processing step is crucial for …

被引用次数：14 相关文章所有 6 个版本

[PDF] mit.edu

What helps transformers recognize conversational structure? Importance of context, punctuation, and labels in dialog act recognition

P Żelasko, R Pappagari, N Dehak - Transactions of the Association …, 2021 - direct.mit.edu

Dialog acts can be interpreted as the atomic units of a conversation, more fine-grained than
utterances, characterized by a specific communicative function. The ability to structure a …

被引用次数：14 相关文章所有 9 个版本

[PDF] arxiv.org

Re-framing incremental deep language models for dialogue processing with multi-task learning

M Rohanian, J Hough - arXiv preprint arXiv:2011.06754, 2020 - arxiv.org

We present a multi-task learning framework to enable the training of one universal
incremental dialogue processing model with four tasks of disfluency detection, language …

被引用次数：14 相关文章所有 6 个版本

[PDF] aclanthology.org

[PDF][PDF] Evaluation of Automatic Speech Recognition for Conversational Speech in Dutch, English and German: What Goes Missing?

A Lopez, A Liesenfeld… - Proceedings of the 18th …, 2022 - aclanthology.org

As voice user interfaces and conversational agents grow in importance, automatic speech
recognition (ASR) encounters increasingly free-form and informal input data. Conversational …

被引用次数：9 相关文章所有 5 个版本

[PDF] hal.science

Computational models of disfluencies: fillers and discourse markers in spoken language understanding

T Dinkar - 2022 - theses.hal.science

People rarely speak in the same manner that they write–they are generally disfluent.
Disfluencies can be defined as interruptions in the regular flow of speech, such as pausing …

被引用次数：3 相关文章所有 13 个版本

高级搜索

QQ 群