A two-step intelligent framework for gene expression-based cancer diagnosis
Abstract
DNA microarray technology has advanced cancer diagnosis by enabling large-scale gene expression analysis, yet challenges remain in selecting relevant genes and achieving accurate classification. This study introduces two novel methods: the three-stage gene selection (3SGS) method and the statistics classifier (SC). By eliminating redundant, noisy, and less informative genes, the 3SGS method effectively lowers the dimensionality of gene expression data, while the SC classifier uses statistical measures of gene expression to classify samples with high accuracy and speed. Evaluated on leukemia, prostate cancer, and colon cancer datasets, the 3SGS method effectively identified minimal yet informative gene subsets, achieving 100% accuracy for leukemia, 99.3% for prostate cancer, and 97% for colon cancer. The SC classifier consistently outperformed traditional models in both accuracy and computational efficiency, completing predictions in under 2 seconds per dataset. Compared to conventional classifiers, it requires no parameter tuning and performs reliably even with small gene sets. While promising, future work should address multiclass classification and clinical validation to broaden the framework’s applicability. Together, these methods offer a precise and rapid cancer classification framework, supporting early diagnosis and personalized treatment strategies across diverse cancer types.
Keywords
Cancer classification; Computer science; Feature selection; Image processing; Machine learning
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v14.i6.pp4731-4738
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Sara Haddou Bouazza, Jihad Haddou bouazza

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).