Parallel data, tools and interfaces in OPUS. J Tiedemann Lrec 2012, 2214-2218, 2012 | 2147 | 2012 |
Opensubtitles2016: Extracting large parallel corpora from movie and tv subtitles P Lison, J Tiedemann European Language Resources Association, 2016 | 920 | 2016 |
News from OPUS-A collection of multilingual parallel corpora with tools and interfaces J Tiedemann Recent advances in natural language processing 5, 237-248, 2009 | 753 | 2009 |
OPUS-MT–building open translation services for the world J Tiedemann, S Thottingal Proceedings of the 22nd annual conference of the European Association for …, 2020 | 549 | 2020 |
An analysis of encoder representations in transformer-based machine translation A Raganato, J Tiedemann Proceedings of the 2018 EMNLP workshop BlackboxNLP: analyzing and …, 2018 | 301 | 2018 |
Neural machine translation with extended context J Tiedemann, Y Scherrer arXiv preprint arXiv:1708.05943, 2017 | 280 | 2017 |
The opus corpus-parallel and free: http://logos. uio. no/opus J Tiedemann, L Nygaard Proceedings of the Fourth International Conference on Language Resources and …, 2004 | 233 | 2004 |
Discriminating between similar languages and arabic dialect identification: A report on the third dsl shared task S Malmasi, M Zampieri, N Ljubešić, P Nakov, A Ali, J Tiedemann Proceedings of the third workshop on NLP for similar languages, varieties …, 2016 | 205 | 2016 |
OpenSubtitles2018: Statistical rescoring of sentence alignments in large, noisy parallel corpora P Lison, J Tiedemann, M Kouylekov Proceedings of the Eleventh International Conference on Language Resources …, 2018 | 190 | 2018 |
Findings of the VarDial evaluation campaign 2017 M Zampieri, S Malmasi, N Ljubešić, P Nakov, A Ali, J Tiedemann, ... Proceedings of the fourth workshop on NLP for similar languages, varieties …, 2017 | 179 | 2017 |
The Tatoeba Translation Challenge--Realistic Data Sets for Low Resource and Multilingual MT J Tiedemann arXiv preprint arXiv:2010.06354, 2020 | 162 | 2020 |
Finding synonyms using automatic word alignment and measures of distributional similarity L Van der Plas, J Tiedemann Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions, 866-873, 2006 | 154 | 2006 |
Efficient word alignment with markov chain monte carlo R Östling, J Tiedemann The Prague Bulletin of Mathematical Linguistics 106 (1), 125, 2016 | 151 | 2016 |
Bitext alignment J Tiedemann Morgan & Claypool Publishers, 2011 | 148 | 2011 |
A report on the DSL shared task 2014 M Zampieri, L Tan, N Ljubešić, J Tiedemann Proceedings of the first workshop on applying NLP tools to similar languages …, 2014 | 136 | 2014 |
Language Identification and Morphosyntactic Tagging. The Second VarDial Evaluation Campaign. M Zampieri, S Malmasi, P Nakov, A Ali, S Shon, J Glass, Y Scherrer, ... Santa Fe, USA: Association for Computational Linguistics, 2018 | 126 | 2018 |
Recycling translations: Extraction of lexical data from parallel corpora and their application in natural language processing J Tiedemann Acta Universitatis Upsaliensis, 2003 | 125 | 2003 |
Character-based joint segmentation and POS tagging for Chinese using bidirectional RNN-CRF Y Shao, C Hardmeier, J Tiedemann, J Nivre arXiv preprint arXiv:1704.01314, 2017 | 124 | 2017 |
Identifying idiomatic expressions using automatic word-alignment BV Moirón, J Tiedemann Proceedings of the Workshop on Multi-word-expressions in a multilingual context, 2006 | 121 | 2006 |
Overview of the DSL shared task 2015 M Zampieri, L Tan, N Ljubešić, J Tiedemann, P Nakov Proceedings of the Joint Workshop on Language Technology for Closely Related …, 2015 | 119 | 2015 |