Integrating random forest and genetic algorithms for improved kidney disease prediction
Abstract
This work offers a novel method for predicting chronic kidney disease (CKD) by combining random forest (RF) classification with genetic algorithm (GA) to optimize important parameters. The dataset comprises 1,659 patients with 51 clinical parameters. The suggested method emphasizes the optimization of random state values, test size, and essential hyperparameters, such as the number of trees in the forest, the least number of samples needed at a leaf node, and the smallest number of samples necessary to split an internal node. The optimization process is conducted in two stages: the first stage optimizes the random state and test size, while the second stage focuses on hyperparameters. Through extensive simulations over 50 runs, the study demonstrates that the optimized model achieves an accuracy ranging from 0.9451 to 0.9738. The results indicate a maximum increase in accuracy of 2.09%, showcasing the effectiveness of the GA-RF integrated approach in enhancing model performance. This work provides valuable insights into the impact of parameter optimization on machine learning (ML) models, particularly in medical diagnostics, and offers a robust framework for developing highly accurate predictive models.
Keywords
Chronic kidney disease; Genetic algorithm; Machine learning; Optimization; Predictive model; Random forest
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v14.i4.pp2797-2804
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Institute of Advanced Engineering and Science
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).