A novel method for examining promoters using statistical analysis and artificial intelligence learning

Sinan Salim Mohammed Sheet, Marwa Mawfaq Mohamedsheet Al-Hatab, Maysaloon Abed Qasim

Abstract


Accurately classifying promoters has become a significant focus in bioinformatics research. Although numerous studies have attempted to address this challenge, the performance of existing methods still leaves room for improvement this study, statistical feature analysis has been applied to the features that have been developed in our previous work. This approach extracted additional informative features from basic sequence characteristics and then used them together with the original and newly engineered features. Utilizing statistical feature analysis enhanced key patterns, which lead to an improvement in the accuracy of the promoter classification. Results demonstrated that our proposed method outperforms other models that use only basic features. The value of the area under the curve (AUC) of 0.83958 achieved when using the combined feature set confirmed the effectiveness of our approach. Furthermore, the AUC value reached 1 when these optimized features were used with naive Bayes (NB) classifier, referring to the strength of incorporating statistical analysis into feature design.

Keywords


Area under the curve; Deoxyribonucleic acid; Machine learning; Promoter; Statistical feature analysis

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v14.i5.pp%25p

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Sinan Salim Mohammed Sheet, Marwa Mawfaq Mohamedsheet Al-Hatab, Maysaloon Abed Qasim

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES).

View IJAI Stats