Cross-dataset design discussion mining

A Mahadi, K Tongay, NA Ernst - 2020 IEEE 27th International …, 2020 - ieeexplore.ieee.org
2020 IEEE 27th International Conference on Software Analysis …, 2020ieeexplore.ieee.org
Being able to identify software discussions that are primarily about design—which we call
design mining—can improve documentation and maintenance of software systems. Existing
design mining approaches have good classification performance using natural language
processing (NLP) techniques, but the conclusion stability of these approaches is generally
poor. A classifier trained on a given dataset of software projects has so far not worked well
on different artifacts or different datasets. In this study, we replicate and synthesize these …
Being able to identify software discussions that are primarily about design—which we call design mining—can improve documentation and maintenance of software systems. Existing design mining approaches have good classification performance using natural language processing (NLP) techniques, but the conclusion stability of these approaches is generally poor. A classifier trained on a given dataset of software projects has so far not worked well on different artifacts or different datasets. In this study, we replicate and synthesize these earlier results in a meta—analysis. We then apply recent work in transfer learning for NLP to the problem of design mining. However, for our datasets, these deep transfer learning classifiers perform no better than less complex classifiers. We conclude by discussing some reasons behind the transfer learning approach to design mining.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果