Z Yang, Z Dai,
Y Yang, J Carbonell… - Advances in neural …, 2019 - proceedings.neurips.cc
With the capability of modeling bidirectional contexts, denoising autoencoding based
pretraining like BERT achieves better performance than pretraining approaches based on …