相关文章- 学术资源搜索

Reconcile: Round-table conference improves reasoning via consensus among diverse llms

JCY Chen, S Saha, M Bansal - arXiv preprint arXiv:2309.13007, 2023 - arxiv.org

Large Language Models (LLMs) still struggle with complex reasoning tasks. Motivated by
the society of minds (Minsky, 1988), we propose ReConcile, a multi-model multi-agent …

被引用次数：32 相关文章所有 3 个版本

[PDF] arxiv.org

Improving factuality and reasoning in language models through multiagent debate

Y Du, S Li, A Torralba, JB Tenenbaum… - arXiv preprint arXiv …, 2023 - arxiv.org

Large language models (LLMs) have demonstrated remarkable capabilities in language
generation, understanding, and few-shot learning in recent years. An extensive body of work …

被引用次数：227 相关文章所有 6 个版本

[PDF] arxiv.org

Encouraging divergent thinking in large language models through multi-agent debate

T Liang, Z He, W Jiao, X Wang, Y Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Modern large language models (LLMs) like ChatGPT have shown remarkable performance
on general language tasks but still struggle on complex reasoning tasks, which drives the …

被引用次数：133 相关文章所有 3 个版本

[PDF] arxiv.org

Corex: Pushing the boundaries of complex reasoning through multi-model collaboration

Q Sun, Z Yin, X Li, Z Wu, X Qiu, L Kong - arXiv preprint arXiv:2310.00280, 2023 - arxiv.org

Large Language Models (LLMs) are evolving at an unprecedented pace and have exhibited
considerable capability in the realm of natural language processing (NLP) with world …

被引用次数：9 相关文章所有 2 个版本

[PDF] arxiv.org

Language models with rationality

N Kassner, O Tafjord, A Sabharwal… - arXiv preprint arXiv …, 2023 - arxiv.org

While large language models (LLMs) are proficient at question-answering (QA), it is not
always clear how (or even if) an answer follows from their latent" beliefs". This lack of …

被引用次数：7 相关文章所有 4 个版本

[PDF] aclanthology.org

Exchange-of-thought: Enhancing large language model capabilities through cross-model communication

Z Yin, Q Sun, C Chang, Q Guo, J Dai… - Proceedings of the …, 2023 - aclanthology.org

Abstract Large Language Models (LLMs) have recently made significant strides in complex
reasoning tasks through the Chain-of-Thought technique. Despite this progress, their …

被引用次数：12 相关文章所有 5 个版本

[PDF] arxiv.org

Can ChatGPT Defend the Truth? Automatic Dialectical Evaluation Elicits LLMs' Deficiencies in Reasoning

B Wang, X Yue, H Sun - arXiv preprint arXiv:2305.13160, 2023 - arxiv.org

We explore testing the reasoning ability of large language models (LLMs), such as
ChatGPT, by engaging with them in a debate-like conversation that probes deeper into their …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Can large language models explore in-context?

A Krishnamurthy, K Harris, DJ Foster, C Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

We investigate the extent to which contemporary Large Language Models (LLMs) can
engage in exploration, a core capability in reinforcement learning and decision making. We …

被引用次数：3 相关文章所有 3 个版本

[PDF] arxiv.org

Adapting llm agents through communication

K Wang, Y Lu, M Santacroce, Y Gong, C Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Recent advancements in large language models (LLMs) have shown potential for human-
like agents. To help these agents adapt to new tasks without extensive human supervision …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability

X Lv, Y Cao, L Hou, J Li, Z Liu, Y Zhang… - arXiv preprint arXiv …, 2021 - arxiv.org

Multi-hop reasoning has been widely studied in recent years to obtain more interpretable
link prediction. However, we find in experiments that many paths given by these models are …

被引用次数：17 相关文章所有 6 个版本

高级搜索

QQ 群

Reconcile: Round-table conference improves reasoning via consensus among diverse llms

Improving factuality and reasoning in language models through multiagent debate

Encouraging divergent thinking in large language models through multi-agent debate

Corex: Pushing the boundaries of complex reasoning through multi-model collaboration

Language models with rationality

Exchange-of-thought: Enhancing large language model capabilities through cross-model communication

Can ChatGPT Defend the Truth? Automatic Dialectical Evaluation Elicits LLMs' Deficiencies in Reasoning

Can large language models explore in-context?

Adapting llm agents through communication

Is multi-hop reasoning really explainable? Towards benchmarking reasoning interpretability

引用