Enhancing breast cancer diagnosis: a comparative analysis of feature selection techniques
Abstract
Breast cancer is a significant contributor to female mortality, emphasizing the importance of early detection. Predicting breast cancer accurately remains a complex challenge within medical data analysis. Machine learning (ML) algorithms offer valuable assistance in decision-making and diagnosis using medical data. Numerous research studies highlight the effectiveness of ML techniques in improving breast cancer prediction. Feature selection plays a pivotal role in data preprocessing, eliminating irrelevant and redundant features to minimize feature count and improve classification accuracy. This study focuses on optimizing breast cancer diagnostics through feature selection methods, specifically genetic algorithms (GA) and particle swarm optimization (PSO). The research involves a comparative analysis of these methods and the application of a diverse set of ML classification techniques, including logistic regression (LR), support vector machine (SVM), decision tree (DT), and ensemble methods like random forest (RF), AdaBoost, and gradient boosting (GB), using a breast cancer dataset. The models' performance is subsequently evaluated using various performance metrics. The experimental findings illustrate that PSO achieved the highest average accuracy, reaching 99.6% when applied to AdaBoost, while GA attained an accuracy rate of 99.5% when employed with both AdaBoost and RF.
Keywords
Breast cancer; Classification; Feature selection; Genetic algorithm; Particle swarm optimization
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v13.i4.pp4312-4322
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).