Improve malware classifiers performance using cost-sensitive learning for imbalanced dataset

Ikram Ben Abdel Ouahab, Lotfi Elaachak, Mohammed Bouhorma


In recent times, malware visualization has become very popular for malware
classification in cybersecurity. Existing malware features can easily identify
known malware that have been already detected, but they cannot identify new
and infrequent malwares accurately. Moreover, deep learning algorithms
show their power in term of malware classification topic. However, we found
the use of imbalanced data; the Malimg database which contains 25 malware
families don’t have same or near number of images per class. To address these
issues, this paper proposes an effective malware classifier, based on costsensitive deep learning. When performing classification on imbalanced data, some classes get less accuracy than others. Cost-sensitive is meant to solve this issue, however in our case of 25 classes, classical cost-sensitive weights wasn’t effective is giving equal attention to all classes. The proposed approach improves the performance of malware classification, and we demonstrate this improvement using two Convolutional Neural Network models using functional and subclassing programming techniques, based on loss, accuracy, recall and precision.


Convolutional neural network; Cost-sensitive; Cybersecurity; Deep learning; Malware classification

Full Text:




  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats