- 学术资源搜索

Code generation using machine learning: A systematic review

E Dehaerne, B Dey, S Halder, S De Gendt… - Ieee …, 2022 - ieeexplore.ieee.org

Recently, machine learning (ML) methods have been used to create powerful language
models for a broad range of natural language processing tasks. An important subset of this …

被引用次数：76 相关文章所有 8 个版本

[PDF] acm.org

Large language models for code: Security hardening and adversarial testing

J He, M Vechev - Proceedings of the 2023 ACM SIGSAC Conference on …, 2023 - dl.acm.org

Large language models (large LMs) are increasingly trained on massive codebases and
used to generate code. However, LMs lack awareness of security and are found to …

被引用次数：101 相关文章所有 5 个版本

[PDF] arxiv.org

Investigating the catastrophic forgetting in multimodal large language models

Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee… - arXiv preprint arXiv …, 2023 - arxiv.org

Following the success of GPT4, there has been a surge in interest in multimodal large
language model (MLLM) research. This line of research focuses on developing general …

被引用次数：94 相关文章所有 3 个版本

[PDF] arxiv.org

Aligning language models with preferences through f-divergence minimization

D Go, T Korbak, G Kruszewski, J Rozen, N Ryu… - arXiv preprint arXiv …, 2023 - arxiv.org

Aligning language models with preferences can be posed as approximating a target
distribution representing some desired behavior. Existing approaches differ both in the …

被引用次数：64 相关文章所有 6 个版本

[PDF] nature.com

OpenMedLM: prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

J Maharjan, A Garikipati, NP Singh, L Cyrus… - Scientific Reports, 2024 - nature.com

LLMs can accomplish specialized medical knowledge tasks, however, equitable access is
hindered by the extensive fine-tuning, specialized medical data requirement, and limited …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

A survey of mamba

H Qu, L Ning, R An, W Fan, T Derr, H Liu, X Xu… - arXiv preprint arXiv …, 2024 - arxiv.org

As one of the most representative DL techniques, Transformer architecture has empowered
numerous advanced models, especially the large language models (LLMs) that comprise …

被引用次数：15 相关文章所有 3 个版本

[PDF] thecvf.com

LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling

K Ma, X Zang, Z Feng, H Fang, C Ban… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent studies have explored the potential of large language models (LLMs) for
understanding the semantic information in images. However, the use of LLMs to understand …

被引用次数：14 相关文章所有 3 个版本

[PDF] neurips.cc

On reinforcement learning and distribution matching for fine-tuning language models with no catastrophic forgetting

T Korbak, H Elsahar, G Kruszewski… - Advances in Neural …, 2022 - proceedings.neurips.cc

The availability of large pre-trained models is changing the landscape of Machine Learning
research and practice, moving from a" training from scratch" to a" fine-tuning''paradigm …

被引用次数：54 相关文章所有 6 个版本

[PDF] arxiv.org

Gradient-based constrained sampling from language models

S Kumar, B Paria, Y Tsvetkov - arXiv preprint arXiv:2205.12558, 2022 - arxiv.org

Large pretrained language models generate fluent text but are notoriously hard to
controllably sample from. In this work, we study constrained sampling from such language …

被引用次数：46 相关文章所有 4 个版本

[PDF] arxiv.org

From decoding to meta-generation: Inference-time algorithms for large language models

S Welleck, A Bertsch, M Finlayson… - arXiv preprint arXiv …, 2024 - arxiv.org

One of the most striking findings in modern research on large language models (LLMs) is
that scaling up compute during training leads to better results. However, less attention has …

被引用次数：12 相关文章所有 3 个版本

高级搜索

QQ 群

Code generation using machine learning: A systematic review

Large language models for code: Security hardening and adversarial testing

Investigating the catastrophic forgetting in multimodal large language models

Aligning language models with preferences through f-divergence minimization

OpenMedLM: prompt engineering can out-perform fine-tuning in medical question-answering with open-source large language models

A survey of mamba

LLaViLo: Boosting Video Moment Retrieval via Adapter-Based Multimodal Modeling

On reinforcement learning and distribution matching for fine-tuning language models with no catastrophic forgetting

Gradient-based constrained sampling from language models

From decoding to meta-generation: Inference-time algorithms for large language models

引用