Data-efficient paraphrase generation to bootstrap intent classification and slot labeling for new features in task-oriented dialog systems

S Jolly, T Falke, C Tirkaz, D Sorokin - Proceedings of the 28th …, 2020 - aclanthology.org
Recent progress through advanced neural models pushed the performance of task-oriented
dialog systems to almost perfect accuracy on existing benchmark datasets for intent …

[HTML][HTML] Cross-lingual transfer learning with data selection for large-scale spoken language understanding

QNT Do, J Gaspers - 2019 - amazon.science
Typically, spoken language understanding (SLU) models are trained on annotated data
which are costly to gather. Aiming to reduce data needs for bootstrapping a SLU system for a …

From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language understanding

R Van Der Goot, I Sharaf, A Imankulova… - arXiv preprint arXiv …, 2021 - arxiv.org
The lack of publicly available evaluation data for low-resource languages limits progress in
Spoken Language Understanding (SLU). As key tasks like intent classification and slot filling …

Cross-lingual transfer learning for spoken language understanding

QNT Do, J Gaspers - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org
Typically, spoken language understanding (SLU) models are trained on annotated data
which are costly to gather. Aiming to reduce data needs for bootstrapping a SLU system for a …

Language model bootstrapping using neural machine translation for conversational speech recognition

S Punjabi, H Arsikere… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
Building conversational speech recognition systems for new languages is constrained by
the availability of utterances capturing user-device interactions. Data collection is expensive …

Multilingual paraphrase generation for bootstrapping new features in task-oriented dialog systems

S Panda, C Tirkaz, T Falke… - Proceedings of the 3rd …, 2021 - aclanthology.org
The lack of labeled training data for new features is a common problem in rapidly changing
real-world dialog systems. As a solution, we propose a multilingual paraphrase generation …

Investigating Paraphrasing-Based Data Augmentation for Task-Oriented Dialogue Systems

L Vogel, L Flek - International Conference on Text, Speech, and …, 2022 - Springer
With synthetic data generation, the required amount of human-generated training data can
be reduced significantly. In this work, we explore the usage of automatic paraphrasing …

[图书][B] Multilinguality in knowledge graphs

LA Kaffee - 2023 - books.google.com
Content on the web is predominantly written in English, making it inaccessible to those who
only speak other languages. Knowledge graphs can store multilingual information, facilitate …

[PDF][PDF] Back-translation as strategy to tackle the lack of corpus in natural language generation from semantic representations

MAS Cabezudo, S Mille, TAS Pardo - Proceedings, 2019 - repositorio.usp.br
This paper presents an exploratory study that aims to evaluate the usefulness of
backtranslation in Natural Language Generation (NLG) from semantic representations for …

To what degree can language borders be blurred in BERT-based multilingual spoken language understanding?

Q Do, J Gaspers, T Roding, M Bradford - arXiv preprint arXiv:2011.05007, 2020 - arxiv.org
This paper addresses the question as to what degree a BERT-based multilingual Spoken
Language Understanding (SLU) model can transfer knowledge across languages. Through …