M Gallé - Proceedings of the 2019 conference on empirical …, 2019 - aclanthology.org
Abstract Byte-Pair Encoding (BPE) is an unsupervised sub-word tokenization technique,
commonly used in neural machine translation and other NLP tasks. Its effectiveness makes it …