Graph neural networks: foundation, frontiers and applications

L Wu, P Cui, J Pei, L Zhao, X Guo - … of the 28th ACM SIGKDD Conference …, 2022 - dl.acm.org
The field of graph neural networks (GNNs) has seen rapid and incredible strides over the
recent years. Graph neural networks, also known as deep learning on graphs, graph …

A systematic mapping study of source code representation for deep learning in software engineering

HP Samoaa, F Bayram, P Salza, P Leitner - IET Software, 2022 - Wiley Online Library
The usage of deep learning (DL) approaches for software engineering has attracted much
attention, particularly in source code modelling and analysis. However, in order to use DL …

Transformers meet directed graphs

S Geisler, Y Li, DJ Mankowitz… - International …, 2023 - proceedings.mlr.press
Transformers were originally proposed as a sequence-to-sequence model for text but have
become vital for a wide range of modalities, including images, audio, video, and undirected …

Proof artifact co-training for theorem proving with language models

JM Han, J Rute, Y Wu, EW Ayers, S Polu - arXiv preprint arXiv:2102.06203, 2021 - arxiv.org
Labeled data for imitation learning of theorem proving in large libraries of formalized
mathematics is scarce as such libraries require years of concentrated effort by human …

How could neural networks understand programs?

D Peng, S Zheng, Y Li, G Ke, D He… - … on Machine Learning, 2021 - proceedings.mlr.press
Semantic understanding of programs is a fundamental problem for programming language
processing (PLP). Recent works that learn representations of code based on pre-training …

CODE-MVP: Learning to represent source code from multiple views with contrastive pre-training

X Wang, Y Wang, Y Wan, J Wang, P Zhou, L Li… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent years have witnessed increasing interest in code representation learning, which
aims to represent the semantics of source code into distributed vectors. Currently, various …

IR2VEC LLVM IR Based Scalable Program Embeddings

S VenkataKeerthy, R Aggarwal, S Jain… - ACM Transactions on …, 2020 - dl.acm.org
We propose IR2Vec, a Concise and Scalable encoding infrastructure to represent programs
as a distributed embedding in continuous space. This distributed embedding is obtained by …

Machine learning in compilers: Past, present and future

H Leather, C Cummins - 2020 Forum for Specification and …, 2020 - ieeexplore.ieee.org
Writing optimising compilers is difficult. The range of programs that may be presented to the
compiler is huge and the systems on which they run are complex, heterogeneous, non …

Explaining graph neural networks for vulnerability discovery

T Ganz, M Härterich, A Warnecke, K Rieck - Proceedings of the 14th …, 2021 - dl.acm.org
Graph neural networks (GNNs) have proven to be an effective tool for vulnerability discovery
that outperforms learning-based methods working directly on source code. Unfortunately …

Commit2vec: Learning distributed representations of code changes

R Cabrera Lozoya, A Baumann, A Sabetta… - SN Computer …, 2021 - Springer
Deep learning methods have found successful applications in fields like image classification
and natural language processing. They have recently been applied to source code analysis …