Mathematical language models: A survey

W Liu, H Hu, J Zhou, Y Ding, J Li, J Zeng, M He… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, there has been remarkable progress in leveraging Language Models (LMs),
encompassing Pre-trained Language Models (PLMs) and Large-scale Language Models …

Large language models for mathematical reasoning: Progresses and challenges

J Ahn, R Verma, R Lou, D Liu, R Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Mathematical reasoning serves as a cornerstone for assessing the fundamental cognitive
capabilities of human intelligence. In recent times, there has been a notable surge in the …

Conflictbank: A benchmark for evaluating the influence of knowledge conflicts in llm

Z Su, J Zhang, X Qu, T Zhu, Y Li, J Sun, J Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have achieved impressive advancements across numerous
disciplines, yet the critical issue of knowledge conflicts, a major source of hallucinations, has …

Is your model really a good math reasoner? evaluating mathematical reasoning with checklist

Z Zhou, S Liu, M Ning, W Liu, J Wang, DF Wong… - arXiv preprint arXiv …, 2024 - arxiv.org
Exceptional mathematical reasoning ability is one of the key features that demonstrate the
power of large language models (LLMs). How to comprehensively define and evaluate the …

Adversarial math word problem generation

R Xie, C Huang, J Wang, B Dhingra - Findings of the Association …, 2024 - aclanthology.org
Large language models (LLMs) have significantly transformed the educational landscape.
As current plagiarism detection tools struggle to keep pace with LLMs' rapid advancements …

Towards robust automated math problem solving: a survey of statistical and deep learning approaches

A Saraf, P Kamat, S Gite, S Kumar, K Kotecha - Evolutionary Intelligence, 2024 - Springer
Automated mathematical problem-solving represents a unique intersection of natural
language processing (NLP) and mathematical reasoning, posing significant challenges in …

Causal Evaluation of Language Models

S Chen, B Peng, M Chen, R Wang, M Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Causal reasoning is viewed as crucial for achieving human-level machine intelligence.
Recent advances in language models have expanded the horizons of artificial intelligence …

A Security Risk Taxonomy for Prompt-Based Interaction With Large Language Models

E Derner, K Batistič, J Zahálka, R Babuška - IEEE Access, 2024 - ieeexplore.ieee.org
As large language models (LLMs) permeate more and more applications, an assessment of
their associated security risks becomes increasingly necessary. The potential for exploitation …

Target-driven Attack for Large Language Models

C Zhang, M Jin, D Shu, T Wang, D Liu, X Jin - arXiv preprint arXiv …, 2024 - arxiv.org
Current large language models (LLM) provide a strong foundation for large-scale user-
oriented natural language tasks. Many users can easily inject adversarial text or instructions …

Language Models are Symbolic Learners in Arithmetic

C Deng, Z Li, R Xie, R Chang, H Chen - arXiv preprint arXiv:2410.15580, 2024 - arxiv.org
Large Language Models (LLMs) are thought to struggle with arithmetic learning due to the
inherent differences between language modeling and numerical computation, but concrete …