C Thomson, A Belz - Proceedings of the 17th International Natural …, 2024 - aclanthology.org
Human evaluation is widely considered the most reliable form of evaluation in NLP, but recent research has shown it to be riddled with mistakes, often as a result of manual …
LN Watson, D Gkatzia - Proceedings of the Fourth Workshop on …, 2024 - aclanthology.org
Reproducibility is a cornerstone of scientific research, ensuring the reliability and generalisability of findings. The ReproNLP Shared Task on Reproducibility of Evaluations in …
M Arvan, N Parde - Proceedings of the Fourth Workshop on …, 2024 - aclanthology.org
Human evaluations are indispensable in the development of NLP systems because they provide direct insights into how effectively these systems meet real-world needs and …
Recent years have witnessed substantial growth in Machine Learning (ML) and Natural Language Processing (NLP), largely fueled by the accessibility and openness of data and …
Evaluating properly the goodness of Natural Language Generation (NLG) systems remains a hot topic that requires further research. There are two major issues to tackle:(i) the lack of …
I have been working on natural language generation (NLG), that is using artificial intelligence techniques to produce texts in English and other human languages, since I got …