Natural language generation and understanding of big code for AI-assisted programming: A review

MF Wong, S Guo, CN Hang, SW Ho, CW Tan - Entropy, 2023 - mdpi.com
This paper provides a comprehensive review of the literature concerning the utilization of
Natural Language Processing (NLP) techniques, with a particular focus on transformer …

Machine/deep learning for software engineering: A systematic literature review

S Wang, L Huang, A Gao, J Ge, T Zhang… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Since 2009, the deep learning revolution, which was triggered by the introduction of
ImageNet, has stimulated the synergy between Software Engineering (SE) and Machine …

Unixcoder: Unified cross-modal pre-training for code representation

D Guo, S Lu, N Duan, Y Wang, M Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org
Pre-trained models for programming languages have recently demonstrated great success
on code intelligence. To support both code-related understanding and generation tasks …

Spt-code: Sequence-to-sequence pre-training for learning source code representations

C Niu, C Li, V Ng, J Ge, L Huang, B Luo - Proceedings of the 44th …, 2022 - dl.acm.org
Recent years have seen the successful application of large pre-trained models to code
representation learning, resulting in substantial improvements on many code-related …

An empirical comparison of pre-trained models of source code

C Niu, C Li, V Ng, D Chen, J Ge… - 2023 IEEE/ACM 45th …, 2023 - ieeexplore.ieee.org
While a large number of pre-trained models of source code have been successfully
developed and applied to a variety of software engineering (SE) tasks in recent years, our …

Syncobert: Syntax-guided multi-modal contrastive pre-training for code representation

X Wang, Y Wang, F Mi, P Zhou, Y Wan, X Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Code representation learning, which aims to encode the semantics of source code into
distributed vectors, plays an important role in recent deep-learning-based models for code …

Exploring the potential of chatgpt in automated code refinement: An empirical study

Q Guo, J Cao, X Xie, S Liu, X Li, B Chen… - Proceedings of the 46th …, 2024 - dl.acm.org
Code review is an essential activity for ensuring the quality and maintainability of software
projects. However, it is a time-consuming and often error-prone task that can significantly …

Multi-target backdoor attacks for code pre-trained models

Y Li, S Liu, K Chen, X Xie, T Zhang, Y Liu - arXiv preprint arXiv …, 2023 - arxiv.org
Backdoor attacks for neural code models have gained considerable attention due to the
advancement of code intelligence. However, most existing works insert triggers into task …

Deep learning meets software engineering: A survey on pre-trained models of source code

C Niu, C Li, B Luo, V Ng - arXiv preprint arXiv:2205.11739, 2022 - arxiv.org
Recent years have seen the successful application of deep learning to software engineering
(SE). In particular, the development and use of pre-trained models of source code has …

Flakify: A black-box, language model-based predictor for flaky tests

S Fatima, TA Ghaleb, L Briand - IEEE Transactions on Software …, 2022 - ieeexplore.ieee.org
Software testing assures that code changes do not adversely affect existing functionality.
However, a test case can be flaky, ie, passing and failing across executions, even for the …