A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

Natural language processing models that automate programming will transform chemistry research and teaching

GM Hocky, AD White - Digital discovery, 2022 - pubs.rsc.org
Natural language processing models have emerged that can generate useable software
and automate a number of programming tasks with high fidelity. These tools have yet to …

Tutorial on amortized optimization

B Amos - Foundations and Trends® in Machine Learning, 2023 - nowpublishers.com
Optimization is a ubiquitous modeling tool and is often deployed in settings which
repeatedly solve similar instances of the same problem. Amortized optimization methods …

Out-of-distribution generalization in natural language processing: Past, present, and future

L Yang, Y Song, X Ren, C Lyu, Y Wang… - Proceedings of the …, 2023 - aclanthology.org
Abstract Machine learning (ML) systems in natural language processing (NLP) face
significant challenges in generalizing to out-of-distribution (OOD) data, where the test …

Check your other door! Creating backdoor attacks in the frequency domain

HAAK Hammoud, B Ghanem - arXiv preprint arXiv:2109.05507, 2021 - arxiv.org
Deep Neural Networks (DNNs) are ubiquitous and span a variety of applications ranging
from image classification to real-time object detection. As DNN models become more …

Jiuzhang 2.0: A unified chinese pre-trained language model for multi-task mathematical problem solving

X Zhao, K Zhou, B Zhang, Z Gong, Z Chen… - Proceedings of the 29th …, 2023 - dl.acm.org
Although pre-trained language models~(PLMs) have recently advanced the research
progress in mathematical reasoning, they are not specially designed as a capable multi-task …

Towards constituting mathematical structures for learning to optimize

J Liu, X Chen, Z Wang, W Yin… - … Conference on Machine …, 2023 - proceedings.mlr.press
Abstract Learning to Optimize (L2O), a technique that utilizes machine learning to learn an
optimization algorithm automatically from data, has gained arising attention in recent years …

Unsupervised techniques for generating a standard sample self-explanation answer with knowledge components in a math quiz

R Nakamoto, B Flanagan, Y Dai… - … and Practice in …, 2024 - repository.kulib.kyoto-u.ac.jp
Self-explanation is a widely recognized and effective pedagogical method. Previous
research has indicated that self-explanation can be used to evaluate students' …

Learning to reason with relational abstractions

AJ Nam, M Ren, C Finn, JL McClelland - arXiv preprint arXiv:2210.02615, 2022 - arxiv.org
Large language models have recently shown promising progress in mathematical reasoning
when fine-tuned with human-generated sequences walking through a sequence of solution …

The COVID That Wasn't: Counterfactual Journalism Using GPT

S Hamilton, A Piper - arXiv preprint arXiv:2210.06644, 2022 - arxiv.org
In this paper, we explore the use of large language models to assess human interpretations
of real world events. To do so, we use a language model trained prior to 2020 to artificially …