Self-[in] correct: Llms struggle with refining self-generated responses

X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) are expected to respond accurately but often exhibit
deficient reasoning or generate hallucinatory content. To address these, studies prefixed …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Chain of thoughtlessness: An analysis of cot in planning

K Stechly, K Valmeekam, S Kambhampati - arXiv preprint arXiv …, 2024 - arxiv.org

Large language model (LLM) performance on reasoning problems typically does not
generalize out of distribution. Previous work has claimed that this can be mitigated by …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

RATIONALYST: Pre-training Process-Supervision for Improving Reasoning

D Jiang, G Wang, Y Lu, A Wang, J Zhang, C Liu… - arXiv preprint arXiv …, 2024 - arxiv.org

The reasoning steps generated by LLMs might be incomplete, as they mimic logical leaps
common in everyday communication found in their pre-training data: underlying rationales …

被引用次数：1 相关文章

[PDF] mit.edu

From Text to Life: On the Reciprocal Relationship between Artificial Life and Large Language Models

E Nisioti, C Glanois, E Najarro, A Dai… - Artificial Life …, 2024 - direct.mit.edu

Abstract Large Language Models (LLMs) have taken the field of AI by storm, but their
adoption in the field of Artificial Life (ALife) has been, so far, relatively reserved. In this work …

被引用次数：1 相关文章所有 5 个版本

[PDF] arxiv.org

Direct-Inverse Prompting: Analyzing LLMs' Discriminative Capacity in Self-Improving Generation

JJ Ahn, R Kamoi, L Cheng, R Zhang, W Yin - arXiv preprint arXiv …, 2024 - arxiv.org

Mainstream LLM research has primarily focused on enhancing their generative capabilities.
However, even the most advanced LLMs experience uncertainty in their outputs, often …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Self-Correction is More than Refinement: A Learning Framework for Visual and Language Reasoning Tasks

J He, H Lin, Q Wang, Y Fung, H Ji - arXiv preprint arXiv:2410.04055, 2024 - arxiv.org

While Vision-Language Models (VLMs) have shown remarkable abilities in visual and
language reasoning tasks, they invariably generate flawed responses. Self-correction that …

[PDF] arxiv.org

高级搜索

QQ 群