Accuracy based-stacked ensemble learning model for the prediction of coronary heart disease

Santosini Bhutia, Bichitrananda Patra, Mitrabinda Ray

Abstract


Coronary heart disease (CHD) is the primary cause of silent and noncommunicable deaths. Early detection is essential for slowing the progression of death and saving lives. Medical researchers use machine learning techniques to predict CHD. This article proposes an accuracy based-stacked ensemble learning (AB-SEL) model to predict CHD while minimizing computational time (CT). The dataset undergoes the logistic regression recursive feature elimination (LR-RFE) method to identify the important features. The three strong classifiers, logistic regression (LR), random forest (RF), and AdaBoost, are chosen to build ensemble machine-learning models, including techniques like bagging, majority voting, and stacking, for the Cleveland dataset accessible from Kaggle. Data scaling was done using the normal scalar method, and hyperparameter optimization was done using random search and grid search. Effectiveness is measured by accuracy, precision, recall, F1 score, and CT is validated through 5-fold cross-validation. The suggested approach achieved a 90.16% accuracy rate, required only 0.2 seconds of CT, and yielded an area under the curve (AUC) of 0.892.

Keywords


Heart disease; Machine learning; Ensemble learning; Logistic regression-recursive feature elimination feature selection; Grid search; Random search; 5-fold cross-validation

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v13.i4.pp4516-4525

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats