Espnet-slu: Advancing spoken language understanding through espnet

S Arora, S Dalmia, P Denisov, X Chang… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
As Automatic Speech Processing (ASR) systems are getting better, there is an increasing
interest of using the ASR output to do downstream Natural Language Processing (NLP) …

[图书][B] Neural approaches to conversational information retrieval

J Gao, C Xiong, P Bennett, N Craswell - 2023 - Springer
A conversational information retrieval (CIR) system is an information retrieval (IR) system
with a conversational interface, which allows users to interact with the system to seek …

Few-shot intent classification and slot filling with retrieved examples

D Yu, L He, Y Zhang, X Du, P Pasupat, Q Li - arXiv preprint arXiv …, 2021 - arxiv.org
Few-shot learning arises in important practical scenarios, such as when a natural language
understanding system needs to learn new semantic labels for an emerging, resource-scarce …

Semi-supervised spoken language understanding via self-supervised speech and language model pretraining

CI Lai, YS Chuang, HY Lee, SW Li… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
Much recent work on Spoken Language Understanding (SLU) is limited in at least one of
three ways: models were trained on oracle text input and neglected ASR errors, models …

Policy-driven neural response generation for knowledge-grounded dialogue systems

B Hedayatnia, K Gopalakrishnan, S Kim, Y Liu… - arXiv preprint arXiv …, 2020 - arxiv.org
Open-domain dialogue systems aim to generate relevant, informative and engaging
responses. Seq2seq neural response generation approaches do not have explicit …

Neural generation meets real people: Towards emotionally engaging mixed-initiative conversations

A Paranjape, A See, K Kenealy, H Li, A Hardy… - arXiv preprint arXiv …, 2020 - arxiv.org
We present Chirpy Cardinal, an open-domain dialogue agent, as a research platform for the
2019 Alexa Prize competition. Building an open-domain socialbot that talks to real people is …

A study on the integration of pre-trained ssl, asr, lm and slu models for spoken language understanding

Y Peng, S Arora, Y Higuchi, Y Ueda… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
Collecting sufficient labeled data for spoken language understanding (SLU) is expensive
and time-consuming. Recent studies achieved promising results by using pre-trained …

Token-level sequence labeling for spoken language understanding using compositional end-to-end models

S Arora, S Dalmia, B Yan, F Metze, AW Black… - arXiv preprint arXiv …, 2022 - arxiv.org
End-to-end spoken language understanding (SLU) systems are gaining popularity over
cascaded approaches due to their simplicity and ability to avoid error propagation. However …

Gunrock 2.0: A user adaptive social conversational system

K Liang, A Chau, Y Li, X Lu, D Yu, M Zhou… - arXiv preprint arXiv …, 2020 - arxiv.org
Gunrock 2.0 is built on top of Gunrock with an emphasis on user adaptation. Gunrock 2.0
combines various neural natural language understanding modules, including named entity …

Two-pass low latency end-to-end spoken language understanding

S Arora, S Dalmia, X Chang, B Yan, A Black… - arXiv preprint arXiv …, 2022 - arxiv.org
End-to-end (E2E) models are becoming increasingly popular for spoken language
understanding (SLU) systems and are beginning to achieve competitive performance to …