M Bañón, P Chen, B Haddow, K Heafield… - … Conference of the …, 2020 - research.ed.ac.uk
We report on methods to create the largest publicly available parallel corpora by crawling
the web, using open source software. We empirically compare alternative methods and …