Mined bitexts can contain imperfect translations that yield unreliable training signals for Neural Machine Translation (NMT). While filtering such pairs out is known to improve final …
Abstract The ARC-NKUA (“Athena” Research Center-National and Kapodistrian University of Athens) submission to the WMT22 General Machine Translation shared task concerns the …
Machine Translation is a crucial task of Natural Language Processing, as it aims to provide a fast and automatic way of translating various types of texts. In recent years, the emergence of …
Y Gao, W Wang, H Ney - arXiv preprint arXiv:1908.09716, 2019 - arxiv.org
The preprocessing pipelines in Natural Language Processing usually involve a step of removing sentences consisted of illegal characters. The definition of illegal characters and …