Efficient training of bert by progressively stacking L Gong, D He, Z Li, T Qin, L Wang, T Liu International conference on machine learning, 2337-2346, 2019 | 138 | 2019 |
Joint language semantic and structure embedding for knowledge graph completion J Shen, C Wang, L Gong, D Song arXiv preprint arXiv:2209.08721, 2022 | 35 | 2022 |
Microsoft Research Asia's systems for WMT19 Y Xia, X Tan, F Tian, F Gao, W Chen, Y Fan, L Gong, Y Leng, R Luo, ... arXiv preprint arXiv:1911.06191, 2019 | 26 | 2019 |
Plotcoder: Hierarchical decoding for synthesizing visualization code in programmatic context X Chen, L Gong, A Cheung, D Song Proceedings of the 59th Annual Meeting of the Association for Computational …, 2021 | 18 | 2021 |
Anytime sampling for autoregressive models via ordered autoencoding Y Xu, Y Song, S Garg, L Gong, R Shu, A Grover, S Ermon arXiv preprint arXiv:2102.11495, 2021 | 17 | 2021 |
Mc-bert: Efficient language pre-training via a meta controller Z Xu, L Gong, G Ke, D He, S Zheng, L Wang, J Bian, TY Liu arXiv preprint arXiv:2006.05744, 2020 | 17 | 2020 |
Improved clinical abbreviation expansion via non-sense-based approaches J Kim, L Gong, J Khim, JC Weiss, P Ravikumar Machine Learning for Health, 161-178, 2020 | 8 | 2020 |
AST-T5: Structure-Aware Pretraining for Code Generation and Understanding L Gong, M Elhoushi, A Cheung arXiv preprint arXiv:2401.03003, 2024 | 3 | 2024 |
ADELT: Transpilation Between Deep Learning Frameworks L Gong, J Wang, A Cheung arXiv preprint arXiv:2303.03593, 2023 | 3 | 2023 |
Model-generated pretraining signals improves zero-shot generalization of text-to-text transformers L Gong, C Xiong, X Liu, P Bajaj, Y Xie, A Cheung, J Gao, X Song arXiv preprint arXiv:2305.12567, 2023 | 2 | 2023 |
Evaluation of LLMs on Syntax-Aware Code Fill-in-the-Middle Tasks L Gong, S Wang, M Elhoushi, A Cheung arXiv preprint arXiv:2403.04814, 2024 | 1 | 2024 |