Interpretable machine learning for academic risk analysis in university students
Abstract
Higher education institutions often grapple with issues related to academic risk among their students. These academic risks encompass low academic performance, study delays, and dropouts. One approach to address these challenges is to predict students’ academic performance as accurately as possible by leveraging advanced computational techniques and utilizing academic and non-academic student data. This research aims to develop a model that accurately identifies students with high potential for academic risk while explaining the contributing factors to this phenomenon in the Faculty of Vocational Studies, Institut Teknologi Sepuluh Nopember (ITS). The prediction model is constructed using the light gradient boosting machine (LightGBM) method and is subsequently interpreted using the Shapley additive explanations (SHAP) value. Additionally, an oversampling method, based on synthetic minority oversampling technique (SMOTE), is implemented to address imbalances in the dataset. The proposed approach achieves 96% and 97% accuracy and specificity rates, respectively. Analysis based on SHAP values reveals that extracurricular activities, choice of major, smoking habit, gender, and friendship circle are among the top five factors impacting students’ academic risk.
Keywords
Academic risk; Interpretable machine learning; LightGBM; Shapley additive explanations; SMOTE
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v14.i4.pp3089-3098
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Institute of Advanced Engineering and Science
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).