On the value of oversampling for deep learning in software defect prediction

R Yedida, T Menzies - IEEE Transactions on Software …, 2021 - ieeexplore.ieee.org
One truism of deep learning is that the automatic feature engineering (seen in the first layers
of those networks) excuses data scientists from performing tedious manual feature …

Evaluating classifiers in SE research: the ECSER pipeline and two replication studies

D Dell'Anna, FB Aydemir, F Dalpiaz - Empirical Software Engineering, 2023 - Springer
Context Automated classifiers, often based on machine learning (ML), are increasingly used
in software engineering (SE) for labelling previously unseen SE data. Researchers have …

Comparative study of random search hyper-parameter tuning for software effort estimation

L Villalobos-Arias, C Quesada-López - Proceedings of the 17th …, 2021 - dl.acm.org
Empirical studies on software effort estimation have employed hyper-parameter tuning
algorithms to improve model accuracy and stability. While these tuners can improve model …

Parameter tuning for software fault prediction with different variants of differential evolution

N Nikravesh, MR Keyvanpour - Expert Systems with Applications, 2024 - Elsevier
The cost of software testing could be reduced if faulty entities were identified prior to the
testing phase, which is possible with software fault prediction (SFP). In most SFP models …

How to find actionable static analysis warnings: A case study with FindBugs

R Yedida, HJ Kang, H Tu, X Yang, D Lo… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Automatically generated static code warnings suffer from a large number of false alarms.
Hence, developers only take action on a small percent of those warnings. To better predict …

How to improve deep learning for software analytics: (a case study with code smell detection)

R Yedida, T Menzies - Proceedings of the 19th International Conference …, 2022 - dl.acm.org
To reduce technical debt and make code more maintainable, it is important to be able to
warn programmers about code smells. State-of-the-art code small detectors use deep …

Enhanced Machine Learning-Based Code Smell Detection Through Hyper-Parameter Optimization

P Sukkasem, C Soomlek - 2023 20th International Joint …, 2023 - ieeexplore.ieee.org
To preserve software quality and maintainability, machine learning-based code smell
detection has been proposed, and the results are promising. This research proposes an …

Learning from Very Little Data: On the Value of Landscape Analysis for Predicting Software Project Health

A Lustosa, T Menzies - ACM Transactions on Software Engineering and …, 2024 - dl.acm.org
When data is scarce, software analytics can make many mistakes. For example, consider
learning predictors for open source project health (eg, the number of closed pull requests in …

Hyper-parameter tuning of classification and regression trees for software effort estimation

L Villalobos-Arias, C Quesada-López… - Trends and Applications …, 2021 - Springer
Classification and regression trees (CART) have been reported to be competitive machine
learning algorithms for software effort estimation. In this work, we analyze the impact of …

Predicting health indicators for open source projects (using hyperparameter optimization)

T Xia, W Fu, R Shu, R Agrawal, T Menzies - Empirical Software …, 2022 - Springer
Software developed on public platform is a source of data that can be used to make
predictions about those projects. While the individual developing activity may be random …