Evaluating and improving tool-augmented computation-intensive math reasoning

B Zhang, K Zhou, X Wei, X Zhao… - Advances in …, 2024 - proceedings.neurips.cc
Chain-of-thought prompting (CoT) and tool augmentation have been validated in recent
work as effective practices for improving large language models (LLMs) to perform step-by …

Bridging the novice-expert gap via models of decision-making: A case study on remediating math mistakes

R Wang, Q Zhang, C Robinson, S Loeb… - Proceedings of the …, 2024 - aclanthology.org
Scaling high-quality tutoring remains a major challenge in education. Due to growing
demand, many platforms employ novice tutors who, unlike experienced educators, struggle …

Mathattack: Attacking large language models towards math solving ability

Z Zhou, Q Wang, M Jin, J Yao, J Ye, W Liu… - Proceedings of the …, 2024 - ojs.aaai.org
With the boom of Large Language Models (LLMs), the research of solving Math Word
Problem (MWP) has recently made great progress. However, there are few studies to …

Exploring the numerical reasoning capabilities of language models: A comprehensive analysis on tabular data

M Akhtar, A Shankarampeta, V Gupta, A Patil… - arXiv preprint arXiv …, 2023 - arxiv.org
Numbers are crucial for various real-world domains such as finance, economics, and
science. Thus, understanding and reasoning with numbers are essential skills for language …

Data augmentation for conversational ai

H Soudani, E Kanoulas, F Hasibi - Proceedings of the 32nd ACM …, 2023 - dl.acm.org
Advancements in conversational systems have revolutionized information access,
surpassing the limitations of single queries. However, developing dialogue systems requires …

Knowledge-Based and Generative-AI-Driven Pedagogical Conversational Agents: A Comparative Study of Grice's Cooperative Principles and Trust

M Wölfel, MB Shirzad, A Reich, K Anderer - Big Data and Cognitive …, 2023 - mdpi.com
The emergence of generative language models (GLMs), such as OpenAI's ChatGPT, is
changing the way we communicate with computers and has a major impact on the …

Do Language Models Exhibit the Same Cognitive Biases in Problem Solving as Human Learners?

A Opedal, A Stolfo, H Shirakami, Y Jiao… - arXiv preprint arXiv …, 2024 - arxiv.org
There is increasing interest in employing large language models (LLMs) as cognitive
models. For such purposes, it is central to understand which cognitive properties are well …

Findings of the AmericasNLP 2024 shared task on the creation of educational materials for indigenous languages

L Chiruzzo, P Denisov, A Molina-Villegas… - Proceedings of the …, 2024 - aclanthology.org
This paper presents the results of the first shared task about the creation of educational
materials for three indigenous languages of the Americas. The task proposes to …

Language models as science tutors

A Chevalier, J Geng, A Wettig, H Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
NLP has recently made exciting progress toward training language models (LMs) with
strong scientific problem-solving skills. However, model development has not focused on …

MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions

Z Liang, D Yu, W Yu, W Yao, Z Zhang, X Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Large language models (LLMs) have demonstrated impressive capabilities in mathematical
problem solving, particularly in single turn question answering formats. However, real world …