Abusive comment identification on Indonesian social media data using hybrid deep learning

Tiara Intana Sari, Zalfa Natania Ardilla, Nur Hayatin, Ruhaila Maskat

Abstract


Half of the entire social media users in Indonesia has experienced cyberbullying. Cyberbullying is one of the treatments received as an attack with abusive words. An abusive word is a word or phrase that contained harassment and is expressed be it spoken or in the form of text. This is a serious problem that must be controlled because the act has an impact on the victim's psychology and causes trauma resulting in depression. This study proposed to identify abusive comments from social media in Indonesian language using a deep learning approach. The architecture used is a hybrid model, a combination between recurrent neural network (RNN) and long short-term memory (LSTM). RNN can map the input sequences to fixed-size vectors on hidden vector components and LSTM implemented to overcome gradient vector growth components that have the potential to exist in RNN. The steps carried out include preprocessing, modelling, implementation, and evaluation. The dataset used is indonesian abusive and hate speech from Twitter data. The evaluation result showed that the model proposed produced an f-measure value of 94% with an increase in accuracy of 23%.

Keywords


abusive comments; deep learning; long short-term memory; recurrent neural network; sentiment analysis;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v11.i3.pp895-904

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats