The 2024 repronlp shared task on reproducibility of evaluations in nlp: Overview and results

A Belz, C Thomson - Proceedings of the Fourth Workshop on …, 2024 - aclanthology.org
This paper presents an overview of, and the results from, the 2024 Shared Task on
Reproducibility of Evaluations in NLP (ReproNLP'24), following on from three previous …

(Mostly) Automatic Experiment Execution for Human Evaluations of NLP Systems

C Thomson, A Belz - Proceedings of the 17th International Natural …, 2024 - aclanthology.org
Human evaluation is widely considered the most reliable form of evaluation in NLP, but
recent research has shown it to be riddled with mistakes, often as a result of manual …

ReproHum# 0712-01: Reproducing Human Evaluation of Meaning Preservation in Paraphrase Generation

LN Watson, D Gkatzia - Proceedings of the Fourth Workshop on …, 2024 - aclanthology.org
Reproducibility is a cornerstone of scientific research, ensuring the reliability and
generalisability of findings. The ReproNLP Shared Task on Reproducibility of Evaluations in …

ReproHum# 0712-01: Human Evaluation Reproduction Report for “Hierarchical Sketch Induction for Paraphrase Generation”

M Arvan, N Parde - Proceedings of the Fourth Workshop on …, 2024 - aclanthology.org
Human evaluations are indispensable in the development of NLP systems because they
provide direct insights into how effectively these systems meet real-world needs and …

Machine Learning and Open Science: On Risks and Challenges

M Arvan - 2024 - search.proquest.com
Recent years have witnessed substantial growth in Machine Learning (ML) and Natural
Language Processing (NLP), largely fueled by the accessibility and openness of data and …

[PDF][PDF] Improving Robustness and Reproducibility in Validation of Natural Language Generation Systems

JG Corbelle - 2024 - ceur-ws.org
Evaluating properly the goodness of Natural Language Generation (NLG) systems remains
a hot topic that requires further research. There are two major issues to tackle:(i) the lack of …

Natural

E Reiter - Springer
I have been working on natural language generation (NLG), that is using artificial
intelligence techniques to produce texts in English and other human languages, since I got …