complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction
DOI:
https://doi.org/10.7494/csci.2021.22.1.3743Keywords:
cross-project defect prediction, complexFuzzy, training instance selection, fuzzy clusteringAbstract
Over the last decade, researchers have investigated to what extent cross-project defect prediction (CPDP) shows advantages over traditional defect prediction settings. These works do not take training and testing data of defect prediction from the same project. Instead, dissimilar projects are employed. Selecting proper training data plays an important role in terms of the success of CPDP. In this study, a novel clustering method named complexFuzzy is presented for selecting training data of CPDP. The method is developed by determining membership values with the help of some metrics which can be considered as indicators of complexity. First, CPDP combinations are created on 29 different data sets. Subsequently, complexFuzzy is evaluated by considering cluster centers of data sets and comparing some performance measures including area under the curve (AUC) and F-measure. The method is superior to other five comparison algorithms in terms of the distance of cluster centers and prediction performance.Downloads
Download data is not yet available.
Downloads
Published
2021-02-01
How to Cite
Ozturk, M. M. (2021). complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction. Computer Science, 22(1). https://doi.org/10.7494/csci.2021.22.1.3743
Issue
Section
Articles
License
Copyright (c) 2021 Computer Science
This work is licensed under a Creative Commons Attribution 4.0 International License.