O Zafrir, G Boudoukh,
P Izsak… - 2019 Fifth Workshop on …, 2019 - ieeexplore.ieee.org
Recently, pre-trained Transformer [1] based language models such as BERT [2] and GPT [3],
have shown great improvement in many Natural Language Processing (NLP) tasks …