Code generation using machine learning: A systematic review

E Dehaerne, B Dey, S Halder, S De Gendt… - Ieee …, 2022 - ieeexplore.ieee.org
Recently, machine learning (ML) methods have been used to create powerful language
models for a broad range of natural language processing tasks. An important subset of this …

Large language models for code: Security hardening and adversarial testing

J He, M Vechev - Proceedings of the 2023 ACM SIGSAC Conference on …, 2023 - dl.acm.org
Large language models (large LMs) are increasingly trained on massive codebases and
used to generate code. However, LMs lack awareness of security and are found to …

Investigating the catastrophic forgetting in multimodal large language models

Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee… - arXiv preprint arXiv …, 2023 - arxiv.org
Following the success of GPT4, there has been a surge in interest in multimodal large
language model (MLLM) research. This line of research focuses on developing general …

Aligning language models with preferences through f-divergence minimization

D Go, T Korbak, G Kruszewski, J Rozen, N Ryu… - arXiv preprint arXiv …, 2023 - arxiv.org
Aligning language models with preferences can be posed as approximating a target
distribution representing some desired behavior. Existing approaches differ both in the …

OpenMedLM: prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

J Maharjan, A Garikipati, NP Singh, L Cyrus… - Scientific Reports, 2024 - nature.com
LLMs can accomplish specialized medical knowledge tasks, however, equitable access is
hindered by the extensive fine-tuning, specialized medical data requirement, and limited …

A survey of mamba

H Qu, L Ning, R An, W Fan, T Derr, H Liu, X Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
As one of the most representative DL techniques, Transformer architecture has empowered
numerous advanced models, especially the large language models (LLMs) that comprise …

LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling

K Ma, X Zang, Z Feng, H Fang, C Ban… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent studies have explored the potential of large language models (LLMs) for
understanding the semantic information in images. However, the use of LLMs to understand …

On reinforcement learning and distribution matching for fine-tuning language models with no catastrophic forgetting

T Korbak, H Elsahar, G Kruszewski… - Advances in Neural …, 2022 - proceedings.neurips.cc
The availability of large pre-trained models is changing the landscape of Machine Learning
research and practice, moving from a" training from scratch" to a" fine-tuning''paradigm …

Gradient-based constrained sampling from language models

S Kumar, B Paria, Y Tsvetkov - arXiv preprint arXiv:2205.12558, 2022 - arxiv.org
Large pretrained language models generate fluent text but are notoriously hard to
controllably sample from. In this work, we study constrained sampling from such language …

From decoding to meta-generation: Inference-time algorithms for large language models

S Welleck, A Bertsch, M Finlayson… - arXiv preprint arXiv …, 2024 - arxiv.org
One of the most striking findings in modern research on large language models (LLMs) is
that scaling up compute during training leads to better results. However, less attention has …