BeHonest: Benchmarking Honesty of Large Language Models

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

BeHonest: Benchmarking Honesty of Large Language Models

在引用文章中搜索

[PDF] radensa.ru

[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

F Wang, Z Zhang, X Zhang, Z Wu, T Mo, Q Lu… - arXiv preprint arXiv …, 2024 - ai.radensa.ru

Large language models (LLM) have demonstrated emergent abilities in text generation,
question answering, and reasoning, facilitating various tasks and domains. Despite their …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Multilevel interpretability of artificial neural networks: leveraging framework and methods from neuroscience

Z He, J Achterberg, K Collins, K Nejad… - arXiv preprint arXiv …, 2024 - arxiv.org

As deep learning systems are scaled up to many billions of parameters, relating their
internal structure to external behaviors becomes very challenging. Although daunting, this …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Benchmarking Large Language Models via Random Variables

Z Hong, H Wu, S Dong, J Dong, Y Xiao… - arXiv preprint arXiv …, 2025 - arxiv.org

With the continuous advancement of large language models (LLMs) in mathematical
reasoning, evaluating their performance in this domain has become a prominent research …

高级搜索

QQ 群

BeHonest: Benchmarking Honesty of Large Language Models

[PDF][PDF] A comprehensive survey of small language models in the era of large language models: Techniques, enhancements, applications, collaboration with llms, and …

Multilevel interpretability of artificial neural networks: leveraging framework and methods from neuroscience

Benchmarking Large Language Models via Random Variables

引用