Selecting machine-translated data for quick bootstrapping of a natural language understanding system

Data-efficient paraphrase generation to bootstrap intent classification and slot labeling for new features in task-oriented dialog systems

S Jolly, T Falke, C Tirkaz, D Sorokin - Proceedings of the 28th …, 2020 - aclanthology.org

Recent progress through advanced neural models pushed the performance of task-oriented
dialog systems to almost perfect accuracy on existing benchmark datasets for intent …

被引用次数：23 相关文章所有 3 个版本

[HTML] amazon.science

[HTML][HTML] Cross-lingual transfer learning with data selection for large-scale spoken language understanding

QNT Do, J Gaspers - 2019 - amazon.science

Typically, spoken language understanding (SLU) models are trained on annotated data
which are costly to gather. Aiming to reduce data needs for bootstrapping a SLU system for a …

被引用次数：20 相关文章所有 3 个版本

[PDF] arxiv.org

From masked language modeling to translation: Non-English auxiliary tasks improve zero-shot spoken language understanding

R Van Der Goot, I Sharaf, A Imankulova… - arXiv preprint arXiv …, 2021 - arxiv.org

The lack of publicly available evaluation data for low-resource languages limits progress in
Spoken Language Understanding (SLU). As key tasks like intent classification and slot filling …

被引用次数：14 相关文章所有 12 个版本

[PDF] arxiv.org

Cross-lingual transfer learning for spoken language understanding

QNT Do, J Gaspers - ICASSP 2019-2019 IEEE International …, 2019 - ieeexplore.ieee.org

Typically, spoken language understanding (SLU) models are trained on annotated data
which are costly to gather. Aiming to reduce data needs for bootstrapping a SLU system for a …

被引用次数：21 相关文章所有 3 个版本

[PDF] arxiv.org

Language model bootstrapping using neural machine translation for conversational speech recognition

S Punjabi, H Arsikere… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

Building conversational speech recognition systems for new languages is constrained by
the availability of utterances capturing user-device interactions. Data collection is expensive …

被引用次数：11 相关文章所有 5 个版本

[PDF] aclanthology.org

Multilingual paraphrase generation for bootstrapping new features in task-oriented dialog systems

S Panda, C Tirkaz, T Falke… - Proceedings of the 3rd …, 2021 - aclanthology.org

The lack of labeled training data for new features is a common problem in rapidly changing
real-world dialog systems. As a solution, we propose a multilingual paraphrase generation …

被引用次数：8 相关文章所有 4 个版本

Investigating Paraphrasing-Based Data Augmentation for Task-Oriented Dialogue Systems

L Vogel, L Flek - International Conference on Text, Speech, and …, 2022 - Springer

With synthetic data generation, the required amount of human-generated training data can
be reduced significantly. In this work, we explore the usage of automatic paraphrasing …

被引用次数：2 相关文章所有 3 个版本

[PDF] soton.ac.uk

[图书][B] Multilinguality in knowledge graphs

LA Kaffee - 2023 - books.google.com

Content on the web is predominantly written in English, making it inaccessible to those who
only speak other languages. Knowledge graphs can store multilingual information, facilitate …

被引用次数：3 相关文章所有 2 个版本

[PDF] usp.br

[PDF][PDF] Back-translation as strategy to tackle the lack of corpus in natural language generation from semantic representations

MAS Cabezudo, S Mille, TAS Pardo - Proceedings, 2019 - repositorio.usp.br

This paper presents an exploratory study that aims to evaluate the usefulness of
backtranslation in Natural Language Generation (NLG) from semantic representations for …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

To what degree can language borders be blurred in BERT-based multilingual spoken language understanding?

Q Do, J Gaspers, T Roding, M Bradford - arXiv preprint arXiv:2011.05007, 2020 - arxiv.org

This paper addresses the question as to what degree a BERT-based multilingual Spoken
Language Understanding (SLU) model can transfer knowledge across languages. Through …

被引用次数：5 相关文章所有 6 个版本

高级搜索

QQ 群