Generalizing Clustering Inferences with ML Augmentation of Ordinal Survey Data

Authors

  • Bhupendera Kumar Jawaharlal Nehru University New Delhi
  • Rajeev Kumar Jawaharlal Nehru University, New Delhi, India

DOI:

https://doi.org/10.7494/csci.2024.25.1.5685

Abstract

In this paper, we attempt to generalize the ability to achieve quality inferences of survey data for a larger population through data augmentation and unification. Data augmentation techniques have proven effective in enhancing models' performance by expanding the dataset's size. We employ ML data augmentation, unification, and clustering techniques. First, we augment the \textit{limited} survey data size using data augmentation technique(s). Next, we carry out data unification, followed by clustering for inferencing.

We took two benchmark survey datasets to demonstrate the effectiveness of augmentation and unification. One is on features of students to be entrepreneurs, and the second is breast cancer survey data. We compare the results of the inference obtained from the raw survey data and the newly converted data. The results of this study indicate that the machine learning approach, data augmentation with the unification of data followed by clustering, can be beneficial for generalizing the inferences drawn from the survey data.

Downloads

Download data is not yet available.

Downloads

Published

2024-03-10

How to Cite

Kumar, B., & Kumar, R. . (2024). Generalizing Clustering Inferences with ML Augmentation of Ordinal Survey Data. Computer Science, 25(1). https://doi.org/10.7494/csci.2024.25.1.5685

Issue

Section

Articles

Most read articles by the same author(s)