BERT 模型的主要优化改进方法研究综述

刘欢, 张智雄, 王宇飞 - 数据分析与知识发现, 2021 - manu44.magtech.com.cn
… ] Researchers can focus on pre-training targets optimization and Transformer structure
improvement, and consider choosing the optimization routes according to different application