Enhancing self-consistency and performance of pre-trained language models through natural...

P Lu, L Qiu, W Yu, S Welleck, KW Chang - arXiv preprint arXiv:2212.10535, 2022 - arxiv.org

Mathematical reasoning is a fundamental aspect of human intelligence and is applicable in
various fields, including science, engineering, finance, and everyday life. The development …

被引用次数：127 相关文章所有 6 个版本

[PDF] arxiv.org

Mquake: Assessing knowledge editing in language models via multi-hop questions

Z Zhong, Z Wu, CD Manning, C Potts… - arXiv preprint arXiv …, 2023 - arxiv.org

The information stored in large language models (LLMs) falls out of date quickly, and
retraining from scratch is often not an option. This has recently given rise to a range of …

被引用次数：143 相关文章所有 6 个版本

[PDF] arxiv.org

Knowledge conflicts for llms: A survey

R Xu, Z Qi, Z Guo, C Wang, H Wang, Y Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

This survey provides an in-depth analysis of knowledge conflicts for large language models
(LLMs), highlighting the complex challenges they encounter when blending contextual and …

被引用次数：53 相关文章所有 2 个版本

[PDF] arxiv.org

Interactive natural language processing

Z Wang, G Zhang, K Yang, N Shi, W Zhou… - arXiv preprint arXiv …, 2023 - arxiv.org

Interactive Natural Language Processing (iNLP) has emerged as a novel paradigm within
the field of NLP, aimed at addressing limitations in existing frameworks while aligning with …

被引用次数：55 相关文章所有 5 个版本

[PDF] arxiv.org

Consistency analysis of chatgpt

ME Jang, T Lukasiewicz - arXiv preprint arXiv:2303.06273, 2023 - arxiv.org

ChatGPT has gained a huge popularity since its introduction. Its positive aspects have been
reported through many media platforms, and some analyses even showed that ChatGPT …

被引用次数：91 相关文章所有 6 个版本

[PDF] arxiv.org

Conformal language modeling

V Quach, A Fisch, T Schuster, A Yala, JH Sohn… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose a novel approach to conformal prediction for generative language models
(LMs). Standard conformal prediction produces prediction sets--in place of single predictions …

被引用次数：55 相关文章所有 5 个版本

[PDF] arxiv.org

Internal consistency and self-feedback in large language models: A survey

X Liang, S Song, Z Zheng, H Wang, Q Yu, X Li… - arXiv preprint arXiv …, 2024 - arxiv.org

Large language models (LLMs) often exhibit deficient reasoning or generate hallucinations.
To address these, studies prefixed with" Self-" such as Self-Consistency, Self-Improve, and …

被引用次数：23 相关文章所有 3 个版本

[PDF] arxiv.org

Cross-lingual consistency of factual knowledge in multilingual language models

J Qi, R Fernández, A Bisazza - arXiv preprint arXiv:2310.10378, 2023 - arxiv.org

Multilingual large-scale Pretrained Language Models (PLMs) have been shown to store
considerable amounts of factual knowledge, but large variations are observed across …

被引用次数：42 相关文章所有 6 个版本

[PDF] neurips.cc

Human-like few-shot learning via bayesian reasoning over natural language

K Ellis - Advances in Neural Information Processing …, 2023 - proceedings.neurips.cc

A core tension in models of concept learning is that the model must carefully balance the
tractability of inference against the expressivity of the hypothesis class. Humans, however …

被引用次数：16 相关文章所有 5 个版本

[PDF] arxiv.org

Benchmarking and improving generator-validator consistency of language models

XL Li, V Shrivastava, S Li, T Hashimoto… - arXiv preprint arXiv …, 2023 - arxiv.org

As of September 2023, ChatGPT correctly answers" what is 7+ 8" with 15, but when asked"
7+ 8= 15, True or False" it responds with" False". This inconsistency between generating …

被引用次数：25 相关文章所有 3 个版本

高级搜索

QQ 群