Predicting university student dropouts in Latin America using machine learning
Abstract
In the university context, student dropout has become one of the most recurring problems, both in the short and long term. The objective of this research was to develop a predictive model using the random forest (RF) algorithm to identify patterns associated with university dropout. To achieve this, the knowledge discovery in databases (KDD) methodology was applied, which encompasses the stages of selection, preprocessing, transformation, data mining, and interpretation of results. The RF model demonstrated superior performance compared to other evaluated models, achieving an accuracy of 87%, a precision of 86%, a recall of 85%, an F1-score of 85%, and an receiver operating characteristic (ROC) area under the curve (AUC) of 0.91, highlighting its high predictive capability compared to other techniques analyzed. Therefore, the application of the proposed model is recommended in various university institutions in order to identify potential dropout cases at an early stage.
Keywords
Decision making; Machine learning; Predictive model; Random forest; Student dropout
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v15.i1.pp628-641
Refbacks
- There are currently no refbacks.
Copyright (c) 2026 Laberiano Andrade-Arenas, Inoc Rubio Paucar, Margarita Giraldo Retuerto, Cesar Yactayo-Arias

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).