Y Liu, Y Zhang, Y Wang, F Hou, J Yuan, J Tian… - arXiv preprint arXiv …, 2021 - arxiv.org
Transformer, an attention-based encoder-decoder model, has already revolutionized the
field of natural language processing (NLP). Inspired by such significant achievements, some …