[HTML][HTML] Evidence of interrelated cognitive-like capabilities in large language models: Indications of artificial general intelligence or achievement?

D Ilić, GE Gignac - Intelligence, 2024 - Elsevier
Large language models (LLMs) are advanced artificial intelligence (AI) systems that can
perform a variety of tasks commonly found in human intelligence tests, such as defining …

Collaborative Performance Prediction for Large Language Models

Q Zhang, F Lyu, X Liu, C Ma - arXiv preprint arXiv:2407.01300, 2024 - arxiv.org
Comprehensively understanding and accurately predicting the performance of large
language models across diverse downstream tasks has emerged as a pivotal challenge in …

Observational Scaling Laws and the Predictability of Language Model Performance

Y Ruan, CJ Maddison, T Hashimoto - arXiv preprint arXiv:2405.10938, 2024 - arxiv.org
Understanding how language model performance varies with scale is critical to benchmark
and algorithm development. Scaling laws are one approach to building this understanding …

Forecastbench: A dynamic benchmark of ai forecasting capabilities

E Karger, H Bastani, C Yueh-Han, Z Jacobs… - arXiv preprint arXiv …, 2024 - arxiv.org
Forecasts of future events are essential inputs into informed decision-making. Machine
learning (ML) systems have the potential to deliver forecasts at scale, but there is no …

Performance Law of Large Language Models

C Wu, R Tang - arXiv preprint arXiv:2408.09895, 2024 - arxiv.org
Guided by the belief of the scaling law, large language models (LLMs) have achieved
impressive performance in recent years. However, scaling law only gives a qualitative …

Analysing the Predictability of Language Model Performance

W Schellaert, F Martínez-Plumed… - ACM Transactions on …, 2024 - dl.acm.org
Can a language model predict for which questions another language model will answer
successfully? We investigate the extent to which performance prediction is possible and …

Neural Scaling Laws for Embodied AI

S Sartor, N Thompson - arXiv preprint arXiv:2405.14005, 2024 - arxiv.org
Scaling laws have driven remarkable progress across machine learning domains like
language modeling and computer vision. However, the exploration of scaling laws in …

100 instances is all you need: predicting the success of a new LLM on unseen data by testing on a few instances

L Pacchiardi, LG Cheke, J Hernández-Orallo - arXiv preprint arXiv …, 2024 - arxiv.org
Predicting the performance of LLMs on individual task instances is essential to ensure their
reliability in high-stakes applications. To do so, a possibility is to evaluate the considered …

[HTML][HTML] An Approach to Predicting Energy Demand Within Automobile Production Using the Temporal Fusion Transformer Model

A Lenk, M Vogt, C Herrmann - Energies, 2024 - mdpi.com
The increasing share of renewable energies within energy systems leads to an increase in
complexity. The growing complexity is due to the diversity of technologies, ongoing …

Large Language Model Evaluation Criteria Framework in Healthcare: Fuzzy MCDM Approach

HM Alabool - SN Computer Science, 2025 - Springer
Abstract Large Language Models (LLMs) gained notable popularity in academia and
industry. It has unprecedented features and performance in many applications. LLMs are …