JOURNAL

Layanan journal yang disediakan oleh Perpustakaan Universitas Gunadarma

The Impact of Data Re-Sampling on Learning Performance of Class Imbalanced Bankruptcy Prediction Models

Judul Artikel:The Impact of Data Re-Sampling on Learning Performance of Class Imbalanced Bankruptcy Prediction Models
Judul Terbitan:International Journal on Electrical Engineering and Informatics
ISSN:20856830
Bahasa:ENG
Tempat Terbit:Bandung
Tahun:0000
Volume:Vol. 10 Issue 3 0000
Penerbit:School of Electrical Engineering and Informatics
Frekuensi Penerbitan:-
Penulis:Dilip Singh Sisodia and Upasana Verma
Abstraksi:The aim of this paper is to evaluate the effect of data sampling techniques on the performance of learners using real highly imbalanced Spanish bankruptcy dataset. The class imbalance problem refers to the highly uneven distribution of class instances where one class is having most of the instances than others. In the presence of highly skewed data distribution, the performance of classical learners is heavily biased in recognizing the majority class and consequently leads to the performance degradation of quantitative classifier or predictors models. In this paper, six sampling methods such as synthetic minority oversampling technique (SMOTE), Borderline-SMOTE, Safe-level-SMOTE, Random under sampling, random oversampling and condensed nearest neighbor are used with a different individual(SVM, C4.5, and Logistic regression) and ensemble learners(AdaBoostMl, DTBagging, and Random Forests). The different quantitative prediction models are designed by combination data sampling techniques and classical learners. The performance of quantitative prediction models are evaluated using G-Mean and area under the curve (AUC) measures on the real highly imbalanced data set. The result suggest that the performance of oversampling (with LR and DTBagging) and undersampling (with C4.5 and RF) methods are superior as compare to others on this data set.
Kata Kunci:Class imbalance; Ensemble learners; Individual learners; Prediction; Sampling; Bankruptcy Prediction Model; and Performance Evaluation.
Lokasi:p433
Terakreditasi:belum