Data quality matters: A case study on data label correctness for security bug report prediction

X Wu, W Zheng, X Xia, D Lo - IEEE Transactions on Software …, 2021 - ieeexplore.ieee.org
In the research of mining software repositories, we need to label a large amount of data to
construct a predictive model. The correctness of the labels will affect the performance of a …

How can i improve my app? classifying user reviews for software maintenance and evolution

S Panichella, A Di Sorbo, E Guzman… - 2015 IEEE …, 2015 - ieeexplore.ieee.org
App Stores, such as Google Play or the Apple Store, allow users to provide feedback on
apps by posting review comments and giving star ratings. These platforms constitute a …

Combining text mining and data mining for bug report classification

Y Zhou, Y Tong, R Gu, H Gall - Journal of Software: Evolution …, 2016 - Wiley Online Library
Bug reports represent an important information source for software construction.
Misclassification of these reports inevitably introduces bias. Manual examinations can help …

The impact of mislabeled changes by szz on just-in-time defect prediction

Y Fan, X Xia, DA Da Costa, D Lo… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Just-in-Time (JIT) defect prediction-a technique which aims to predict bugs at change level-
has been paid more attention. JIT defect prediction leverages the SZZ approach to identify …

How we refactor and how we document it? On the use of supervised machine learning algorithms to classify refactoring documentation

EA AlOmar, A Peruma, MW Mkaouer… - Expert Systems with …, 2021 - Elsevier
Refactoring is the art of improving the structural design of a software system without altering
its external behavior. Today, refactoring has become a well-established and disciplined …

Toward the automatic classification of self-affirmed refactoring

EA AlOmar, MW Mkaouer, A Ouni - Journal of Systems and Software, 2021 - Elsevier
Abstract The concept of Self-Affirmed Refactoring (SAR) was introduced to explore how
developers document their refactoring activities in commit messages, ie, developers explicit …

On the documentation of refactoring types

EA AlOmar, J Liu, K Addo, MW Mkaouer… - Automated Software …, 2022 - Springer
Commit messages are the atomic level of software documentation. They provide a natural
language description of the code change and its purpose. Messages are critical for software …

Multi-factor duplicate question detection in stack overflow

Y Zhang, D Lo, X Xia, JL Sun - Journal of Computer Science and …, 2015 - Springer
Stack Overflow is a popular on-line question and answer site for software developers to
share their experience and expertise. Among the numerous questions posted in Stack …

Automated classification of software issue reports using machine learning techniques: an empirical study

N Pandey, DK Sanyal, A Hudait, A Sen - Innovations in Systems and …, 2017 - Springer
Software developers, testers and customers routinely submit issue reports to software issue
trackers to record the problems they face in using a software. The issues are then directed to …

Behind the scenes: On the relationship between developer experience and refactoring

EA AlOmar, A Peruma, MW Mkaouer… - Journal of Software …, 2024 - Wiley Online Library
Refactoring is widely recognized as one of the efficient techniques to manage technical debt
and maintain a healthy software project through enforcing best design practices, or coping …