Active learning for natural language generation

Y Perlitz, A Gera, M Shmueli-Scheuer… - arXiv preprint arXiv …, 2023 - arxiv.org
The field of Natural Language Generation (NLG) suffers from a severe shortage of labeled
data due to the extremely expensive and time-consuming process involved in manual …

Reinforced active learning for low-resource, domain-specific, multi-label text classification

L Wertz, J Bogojeska, K Mirylenka… - Findings of the …, 2023 - aclanthology.org
Text classification datasets from specialised or technical domains are in high demand,
especially in industrial applications. However, due to the high cost of annotation such …

Evaluating pre-trained Sentence-BERT with class embeddings in active learning for multi-label text classification

L Wertz, J Bogojeska, K Mirylenka… - 2nd Conference of the …, 2022 - digitalcollection.zhaw.ch
The Transformer Language Model is a powerful tool that has been shown to excel at various
NLP tasks and has become the de-facto standard solution thanks to its versatility. In this …

Mind the User! Measures to More Accurately Evaluate the Practical Value of Active Learning Strategies

J Romberg - Proceedings of the 14th International Conference on …, 2023 - aclanthology.org
One solution to limited annotation budgets is active learning (AL), a collaborative process of
human and machine to strategically select a small but informative set of examples. While …

MCVIE: An Effective Batch-Mode Active Learning for Multi-label Text Classification

X Cheng, F Zhou, Q Wang, Y Wang, Y Wang - … International Conference on …, 2023 - Springer
Data labeling for multi-label text is a challenging task in natural language processing, and
active learning has emerged as a promising approach to reduce annotation effort while …

When few-shot fails: low-resource, domain-specific text classification with transformers

L Wertz - 2024 - elib.uni-stuttgart.de
Text classification (TC) is a foundational technique in natural language processing (NLP).
The ability to automatically classify texts into predetermined categories serves a critical role …

[PDF][PDF] Machine-assisted Text Classification of Public Participation Contributions

J Romberg - 2024 - docserv.uni-duesseldorf.de
Engaging citizens in decision-making processes is a widely implemented instrument in
democracies. Such public participation processes serve the goal of achieving a more …

On the Interdependence between Data Selection and Architecture Optimization in Deep Active Learning

P Bajracharya, R Li, L Wang - Transactions on Machine Learning Research - openreview.net
Deep active learning (DAL) studies the optimal selection of labeled data for training deep
neural networks (DNNs). While data selection in traditional active learning is mostly …

Apprentissage actif multi-labels pour des architectures transformers

M Arens - 2024 - theses.hal.science
L'annotation des données est cruciale pour l'apprentissage automatique, notamment dans
les domaines techniques, où la qualité et la quantité des données annotées affectent …

[PDF][PDF] BabyLM Challenge: Curriculum learning based on sentence complexity approximating language acquisition

M Oba, A Haga, A Fukatsu… - Proceedings of the BabyLM …, 2023 - scholar.archive.org
This paper describes our proposed models in the BabyLM Challenge (Warstadt et al., 2023).
The goal of this shared task is to pretrain models efficiently using a developmentally …