Formal mathematics statement curriculum learning

S Polu, JM Han, K Zheng, M Baksys… - arXiv preprint arXiv …, 2022 - arxiv.org
We explore the use of expert iteration in the context of language modeling applied to formal
mathematics. We show that at same compute budget, expert iteration, by which we mean …

Draft, sketch, and prove: Guiding formal theorem provers with informal proofs

AQ Jiang, S Welleck, JP Zhou, W Li, J Liu… - arXiv preprint arXiv …, 2022 - arxiv.org
The formalization of existing mathematical proofs is a notoriously difficult process. Despite
decades of research on automation and proof assistants, writing formal proofs remains …

Thor: Wielding hammers to integrate language models and automated theorem provers

AQ Jiang, W Li, S Tworkowski… - Advances in …, 2022 - proceedings.neurips.cc
In theorem proving, the task of selecting useful premises from a large library to unlock the
proof of a given conjecture is crucially important. This presents a challenge for all theorem …

Deepseekmath: Pushing the limits of mathematical reasoning in open language models

Z Shao, P Wang, Q Zhu, R Xu, J Song, M Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org
Mathematical reasoning poses a significant challenge for language models due to its
complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which …

The TPTP problem library and associated infrastructure: from CNF to TH0, TPTP v6. 4.0

G Sutcliffe - Journal of Automated Reasoning, 2017 - Springer
This paper describes the TPTP problem library and associated infrastructure, from its use of
Clause Normal Form (CNF), via the First-Order Form (FOF) and Typed First-order Form …

Evaluating language models for mathematics through interactions

KM Collins, AQ Jiang, S Frieder… - Proceedings of the …, 2024 - National Acad Sciences
There is much excitement about the opportunity to harness the power of large language
models (LLMs) when building problem-solving assistants. However, the standard …

Learning to prove theorems via interacting with proof assistants

K Yang, J Deng - International Conference on Machine …, 2019 - proceedings.mlr.press
Humans prove theorems by relying on substantial high-level reasoning and problem-
specific insights. Proof assistants offer a formalism that resembles human mathematical …

[PDF][PDF] Hammering towards QED

JC Blanchette, C Kaliszyk, LC Paulson… - Journal of Formalized …, 2016 - pure.mpg.de
The main ingredients underlying this approach are efficient automatic theorem provers that
can cope with hundreds of axioms, suitable translations of the proof assistant's logic to the …

A survey of reasoning with foundation models

J Sun, C Zheng, E Xie, Z Liu, R Chu, J Qiu, J Xu… - arXiv preprint arXiv …, 2023 - arxiv.org
Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …

Extending Sledgehammer with SMT solvers

JC Blanchette, S Böhme, LC Paulson - Journal of automated reasoning, 2013 - Springer
Sledgehammer is a component of Isabelle/HOL that employs resolution-based first-order
automatic theorem provers (ATPs) to discharge goals arising in interactive proofs. It …