Metitarski: Past and future

S Polu, JM Han, K Zheng, M Baksys… - arXiv preprint arXiv …, 2022 - arxiv.org

We explore the use of expert iteration in the context of language modeling applied to formal
mathematics. We show that at same compute budget, expert iteration, by which we mean …

被引用次数：115 相关文章所有 6 个版本

[PDF] arxiv.org

Draft, sketch, and prove: Guiding formal theorem provers with informal proofs

AQ Jiang, S Welleck, JP Zhou, W Li, J Liu… - arXiv preprint arXiv …, 2022 - arxiv.org

The formalization of existing mathematical proofs is a notoriously difficult process. Despite
decades of research on automation and proof assistants, writing formal proofs remains …

被引用次数：90 相关文章所有 5 个版本

[PDF] neurips.cc

Thor: Wielding hammers to integrate language models and automated theorem provers

AQ Jiang, W Li, S Tworkowski… - Advances in …, 2022 - proceedings.neurips.cc

In theorem proving, the task of selecting useful premises from a large library to unlock the
proof of a given conjecture is crucially important. This presents a challenge for all theorem …

被引用次数：60 相关文章所有 11 个版本

[PDF] arxiv.org

Deepseekmath: Pushing the limits of mathematical reasoning in open language models

Z Shao, P Wang, Q Zhu, R Xu, J Song, M Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Mathematical reasoning poses a significant challenge for language models due to its
complex and structured nature. In this paper, we introduce DeepSeekMath 7B, which …

被引用次数：42 相关文章所有 2 个版本

The TPTP problem library and associated infrastructure: from CNF to TH0, TPTP v6. 4.0

G Sutcliffe - Journal of Automated Reasoning, 2017 - Springer

This paper describes the TPTP problem library and associated infrastructure, from its use of
Clause Normal Form (CNF), via the First-Order Form (FOF) and Typed First-order Form …

被引用次数：245 相关文章所有 2 个版本

[PDF] pnas.org

Evaluating language models for mathematics through interactions

KM Collins, AQ Jiang, S Frieder… - Proceedings of the …, 2024 - National Acad Sciences

There is much excitement about the opportunity to harness the power of large language
models (LLMs) when building problem-solving assistants. However, the standard …

被引用次数：23 相关文章所有 9 个版本

[PDF] mlr.press

Learning to prove theorems via interacting with proof assistants

K Yang, J Deng - International Conference on Machine …, 2019 - proceedings.mlr.press

Humans prove theorems by relying on substantial high-level reasoning and problem-
specific insights. Proof assistants offer a formalism that resembles human mathematical …

被引用次数：133 相关文章所有 14 个版本

[PDF] mpg.de

[PDF][PDF] Hammering towards QED

JC Blanchette, C Kaliszyk, LC Paulson… - Journal of Formalized …, 2016 - pure.mpg.de

The main ingredients underlying this approach are efficient automatic theorem provers that
can cope with hundreds of axioms, suitable translations of the proof assistant's logic to the …

被引用次数：223 相关文章所有 34 个版本

[PDF] arxiv.org

A survey of reasoning with foundation models

J Sun, C Zheng, E Xie, Z Liu, R Chu, J Qiu, J Xu… - arXiv preprint arXiv …, 2023 - arxiv.org

Reasoning, a crucial ability for complex problem-solving, plays a pivotal role in various real-
world settings such as negotiation, medical diagnosis, and criminal investigation. It serves …

被引用次数：18 相关文章所有 2 个版本

[PDF] psu.edu

Extending Sledgehammer with SMT solvers

JC Blanchette, S Böhme, LC Paulson - Journal of automated reasoning, 2013 - Springer

Sledgehammer is a component of Isabelle/HOL that employs resolution-based first-order
automatic theorem provers (ATPs) to discharge goals arising in interactive proofs. It …

被引用次数：312 相关文章所有 33 个版本

高级搜索

QQ 群

Formal mathematics statement curriculum learning

Draft, sketch, and prove: Guiding formal theorem provers with informal proofs

Thor: Wielding hammers to integrate language models and automated theorem provers

Deepseekmath: Pushing the limits of mathematical reasoning in open language models

The TPTP problem library and associated infrastructure: from CNF to TH0, TPTP v6. 4.0

Evaluating language models for mathematics through interactions

Learning to prove theorems via interacting with proof assistants

[PDF][PDF] Hammering towards QED

A survey of reasoning with foundation models

Extending Sledgehammer with SMT solvers

引用