TENFOLD BOOTSTRAP PROCEDURE FOR SUPPORT VECTOR MACHINES

Borislava Vrigazova, Ivan Ivanov

Abstract


Cross validation is often used to split input data into training and test set in Support vector machines. The two most commonly used cross validation versions are the tenfold and leave-one-out cross validation. Another commonly used resampling method is the random test/train split. The advantage of these methods is that they avoid overfitting in the model and perform model selection. They, however, can increase the computational time for fitting Support vector machines with the increase of the size of the dataset. In this research, we propose an alternative for fitting SVM, which we call the tenfold bootstrap for Support vector machines. This resampling procedure can significantly reduce execution time despite the big number of observations, while preserving model’s accuracy. With this finding, we propose a solution to the problem of slow execution time when fitting support vector machines on big datasets.

Full Text:

PDF


DOI: https://doi.org/10.7494/csci.2020.21.2.3634

Refbacks

  • There are currently no refbacks.