Sparks of artificial general intelligence: Early experiments with gpt-4

T Wu, S He, J Liu, S Sun, K Liu… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org

ChatGPT, an artificial intelligence generated content (AIGC) model developed by OpenAI,
has attracted world-wide attention for its capability of dealing with challenging language …

被引用次数：510 相关文章所有 4 个版本

[PDF] acm.org

A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org

Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

被引用次数：916 相关文章所有 4 个版本

[PDF] neurips.cc

Judging llm-as-a-judge with mt-bench and chatbot arena

L Zheng, WL Chiang, Y Sheng… - Advances in …, 2024 - proceedings.neurips.cc

Evaluating large language model (LLM) based chat assistants is challenging due to their
broad capabilities and the inadequacy of existing benchmarks in measuring human …

被引用次数：1185 相关文章所有 6 个版本

[PDF] arxiv.org

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org

Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

被引用次数：1956 相关文章所有 4 个版本

[PDF] neurips.cc

Direct preference optimization: Your language model is secretly a reward model

R Rafailov, A Sharma, E Mitchell… - Advances in …, 2024 - proceedings.neurips.cc

While large-scale unsupervised language models (LMs) learn broad world knowledge and
some reasoning skills, achieving precise control of their behavior is difficult due to the …

被引用次数：992 相关文章所有 9 个版本

[PDF] acm.org

Generative agents: Interactive simulacra of human behavior

JS Park, J O'Brien, CJ Cai, MR Morris, P Liang… - Proceedings of the 36th …, 2023 - dl.acm.org

Believable proxies of human behavior can empower interactive applications ranging from
immersive environments to rehearsal spaces for interpersonal communication to prototyping …

被引用次数：962 相关文章所有 8 个版本

[PDF] neurips.cc

Segment everything everywhere all at once

X Zou, J Yang, H Zhang, F Li, L Li… - Advances in …, 2024 - proceedings.neurips.cc

In this work, we present SEEM, a promotable and interactive model for segmenting
everything everywhere all at once in an image. In SEEM, we propose a novel and versatile …

被引用次数：316 相关文章所有 5 个版本

[PDF] neurips.cc

Mathematical capabilities of chatgpt

S Frieder, L Pinchetti, RR Griffiths… - Advances in neural …, 2024 - proceedings.neurips.cc

We investigate the mathematical capabilities of two iterations of ChatGPT (released 9-
January-2023 and 30-January-2023) and of GPT-4 by testing them on publicly available …

被引用次数：382 相关文章所有 10 个版本

[PDF] neurips.cc

Camel: Communicative agents for" mind" exploration of large language model society

G Li, H Hammoud, H Itani… - Advances in Neural …, 2023 - proceedings.neurips.cc

The rapid advancement of chat-based language models has led to remarkable progress in
complex task-solving. However, their success heavily relies on human input to guide the …

被引用次数：310 相关文章所有 8 个版本

[PDF] aaai.org

Graph of thoughts: Solving elaborate problems with large language models

M Besta, N Blach, A Kubicek, R Gerstenberger… - Proceedings of the …, 2024 - ojs.aaai.org

Abstract We introduce Graph of Thoughts (GoT): a framework that advances prompting
capabilities in large language models (LLMs) beyond those offered by paradigms such as …

被引用次数：352 相关文章所有 20 个版本

高级搜索

QQ 群