A survey on evaluation of large language models

Y Chang, X Wang, J Wang, Y Wu, L Yang… - ACM Transactions on …, 2024 - dl.acm.org
Large language models (LLMs) are gaining increasing popularity in both academia and
industry, owing to their unprecedented performance in various applications. As LLMs …

Large language models for software engineering: A systematic literature review

X Hou, Y Zhao, Y Liu, Z Yang, K Wang, L Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) have significantly impacted numerous domains, notably
including Software Engineering (SE). Nevertheless, a well-rounded understanding of the …

A survey of large language models

WX Zhao, K Zhou, J Li, T Tang, X Wang, Y Hou… - arXiv preprint arXiv …, 2023 - arxiv.org
Language is essentially a complex, intricate system of human expressions governed by
grammatical rules. It poses a significant challenge to develop capable AI algorithms for …

The rise and potential of large language model based agents: A survey

Z Xi, W Chen, X Guo, W He, Y Ding, B Hong… - arXiv preprint arXiv …, 2023 - arxiv.org
For a long time, humanity has pursued artificial intelligence (AI) equivalent to or surpassing
the human level, with AI agents considered a promising vehicle for this pursuit. AI agents are …

Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

Large language models: A survey

S Minaee, T Mikolov, N Nikzad, M Chenaghlu… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have drawn a lot of attention due to their strong
performance on a wide range of natural language tasks, since the release of ChatGPT in …

Think-on-graph: Deep and responsible reasoning of large language model with knowledge graph

J Sun, C Xu, L Tang, S Wang, C Lin, Y Gong… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have made significant strides in various tasks, yet they often
struggle with complex reasoning and exhibit poor performance in scenarios where …

A survey of safety and trustworthiness of large language models through the lens of verification and validation

X Huang, W Ruan, W Huang, G Jin, Y Dong… - Artificial Intelligence …, 2024 - Springer
Large language models (LLMs) have exploded a new heatwave of AI for their ability to
engage end-users in human-level conversations with detailed and articulate answers across …

Reasoning on graphs: Faithful and interpretable large language model reasoning

L Luo, YF Li, G Haffari, S Pan - arXiv preprint arXiv:2310.01061, 2023 - arxiv.org
Large language models (LLMs) have demonstrated impressive reasoning abilities in
complex tasks. However, they lack up-to-date knowledge and experience hallucinations …

A survey on graph neural networks for time series: Forecasting, classification, imputation, and anomaly detection

M Jin, HY Koh, Q Wen, D Zambon, C Alippi… - arXiv preprint arXiv …, 2023 - arxiv.org
Time series are the primary data type used to record dynamic system measurements and
generated in great volume by both physical sensors and online processes (virtual sensors) …