Comparison of various data mining methods for early diagnosis of human cardiology

Abeer Mohammed Shanshool, Enas Mohammed Hussien Saeed, Hasan Hadi Khaleel


Recent healthcare reports indicate clearly an increasing mortality rates worldwide which puts a significant burden on the healthcare sector due to different diseases. Coronary artery diseases (CAD) is one of the main reasons of these uprising death rates since it affects the heart directly. For early diagnosis and treatment of CADs, a swiftly growing technology called data mining has been used to collect and categorize necessary data from patients; age, blood sugar and pressure, a type of thorax pain, cholesterol, and so on. Therefore, this paper adopted four data mining methods; decision tree (DT), logistic regression (LR), random forest (RF), and Naïve Bayes (NB) to achieve the goal. The paper utilized the Cleveland dataset along with Python programming language to compare among the four data mining methods in terms of precision, accuracy, recall, and area under the curve. The results illustrated that NB method has the best accuracy of 89.47% compared with previous studies which will help with accurate, fast and inexpensive diagnosis of CADs.


Cardiology; Classification algorithms; Cleveland dataset; Data mining; Python

Full Text:




  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats