Inadequacies of large language model benchmarks in the era of generative artificial intelligence

TR McIntosh, T Susnjak, N Arachchilage, T Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid rise in popularity of Large Language Models (LLMs) with emerging capabilities
has spurred public curiosity to evaluate and compare different LLMs, leading many …

Deep configuration performance learning: A systematic survey and taxonomy

J Gong, T Chen - ACM Transactions on Software Engineering and …, 2024 - dl.acm.org
Performance is arguably the most crucial attribute that reflects the quality of a configurable
software system. However, given the increasing scale and complexity of modern software …

A survey on machine learning techniques for source code analysis

T Sharma, M Kechagia, S Georgiou, R Tiwari… - arXiv preprint arXiv …, 2021 - arxiv.org
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

[HTML][HTML] Fairness for machine learning software in education: A systematic mapping study

N Pham, PN Hung, A Nguyen-Duc - Journal of Systems and Software, 2024 - Elsevier
The integration of machine learning (ML) systems into various sectors, notably education,
has great potential to transform business workflows and decision-making processes …

On the use of evaluation measures for defect prediction studies

R Moussa, F Sarro - Proceedings of the 31st ACM SIGSOFT International …, 2022 - dl.acm.org
Software defect prediction research has adopted various evaluation measures to assess the
performance of prediction models. In this paper, we further stress on the importance of the …

[HTML][HTML] A survey on machine learning techniques applied to source code

T Sharma, M Kechagia, S Georgiou, R Tiwari… - Journal of Systems and …, 2024 - Elsevier
The advancements in machine learning techniques have encouraged researchers to apply
these techniques to a myriad of software engineering tasks that use source code analysis …

Do performance aspirations matter for guiding software configuration tuning? an empirical investigation under dual performance objectives

T Chen, M Li - ACM Transactions on Software Engineering and …, 2023 - dl.acm.org
Configurable software systems can be tuned for better performance. Leveraging on some
Pareto optimizers, recent work has shifted from tuning for a single, time-related performance …

How do Android developers improve non-functional properties of software?

J Callan, O Krauss, J Petke, F Sarro - Empirical Software Engineering, 2022 - Springer
Nowadays there is an increased pressure on mobile app developers to take non-functional
properties into account. An app that is too slow or uses much bandwidth will decrease user …

An empirical study on the fairness of pre-trained word embeddings

E Sesari, M Hort, F Sarro - Proceedings of the 4th Workshop on …, 2022 - aclanthology.org
Pre-trained word embedding models are easily distributed and applied, as they alleviate
users from the effort to train models themselves. With widely distributed models, it is …

[PDF][PDF] Search-based software engineering in the era of modern software systems

F Sarro - Proceedings of the IEEE International Conference on …, 2023 - discovery.ucl.ac.uk
Search-Based Software Engineering in the Era of Modern Software Systems Page 1 Search-Based
Software Engineering in the Era of Modern Software Systems Federica Sarro Department of …