Enhancing phishing website detection: a comparative study of SMOTETomek-XGB and SMOTEENN-XGB

Kamal Omari, Ayoub Oukhatar

Abstract


In the evolving landscape of cybersecurity, phishing websites continue to be a persistent threat, challenging detection methods due to the significant class imbalance between phishing and legitimate websites. This study evaluates the effectiveness of two advanced hybrid-resampling techniques SMOTETomek and SMOTEENN integrated with the extreme gradient boosting (XGBoost) classifier to enhance phishing website detection. SMOTETomek combines the synthetic minority over-sampling technique (SMOTE) with Tomek links, creating synthetic examples and eliminating overlapping instances to address dataset imbalance. SMOTEENN, on the other hand, merges SMOTE with edited nearest neighbors (ENN) to improve class balance through synthetic sample generation and noise reduction. The comparative analysis reveals that both methods significantly enhance classification performance, SMOTETomek-XGB consistently outperforms SMOTEENN-XGB across key evaluation metrics, including accuracy, F1 score, recall, and receiver operating characteristic - area under the curve (ROC-AUC), underscoring its superior effectiveness in distinguishing phishing sites from legitimate ones. This study offers practical insights into the application of advanced resampling methods for improving machine learning model performance in cybersecurity.

Keywords


Class imbalance; Phishing website detection; SMOTEENN techniques; SMOTETomek techniques; XGBoost classifier

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v15.i3.pp2935-2945

Refbacks

  • There are currently no refbacks.


Copyright (c) 2026 Kamal Omari, Ayoub Oukhatar

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES).

View IJAI Stats