complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction

Authors

DOI:

https://doi.org/10.7494/csci.2021.22.1.3743

Keywords:

cross-project defect prediction, complexFuzzy, training instance selection, fuzzy clustering

Abstract

Over the last decade, researchers have investigated to what extent cross-project defect prediction (CPDP) shows advantages over traditional defect prediction settings. These works do not take training and testing data of defect prediction from the same project. Instead, dissimilar projects are employed. Selecting proper training data plays an important role in terms of the success of CPDP. In this study, a novel clustering method named complexFuzzy is presented for selecting training data of CPDP. The method is developed by determining membership values with the help of some metrics which can be considered as indicators of complexity. First, CPDP combinations are created on 29 different data sets. Subsequently, complexFuzzy is evaluated by considering cluster centers of data sets and comparing some performance measures including area under the curve (AUC) and F-measure. The method is superior to other five comparison algorithms in terms of the distance of cluster centers and prediction performance.

Downloads

Download data is not yet available.

Author Biography

Muhammed Maruf Ozturk, Suleyman Demirel University

M. Maruf Ozturk received his Ph.D. degree from Sakarya University. He is currently working at Computer Engineering Department of Suleyman Demirel University as an Assistant Prof. His main interests lie on machine learning, software quality, green software, and test case prioritization.

Downloads

Published

2021-02-01

How to Cite

Ozturk, M. M. (2021). complexFuzzy: A novel clustering method for selecting training instances of cross-project defect prediction. Computer Science, 22(1). https://doi.org/10.7494/csci.2021.22.1.3743

Issue

Section

Articles