Word embedding for detecting cyberbullying based on recurrent neural networks

Noor Haydar Shaker, Ban N. Dhannoon

Abstract


The phenomenon of cyberbullying has spread and has become one of the biggest problems facing users of social media sites and generated significant adverse effects on society and the victim in particular. Finding appropriate solutions to detect and reduce cyberbullying has become necessary to mitigate its negative impacts on society and the victim. Twitter comments on two datasets are used to detect cyberbullying, the first dataset was the Arabic cyberbullying dataset, and the second was the English cyberbullying dataset. Three different pre-trained global vectors (GloVe) corpora with different dimensions were used on the original and preprocessed datasets to represent the words. Recurrent neural networks (RNN), long short-term memory (LSTM), Bidirectional LSTM (BiLSTM), gated recurrent unit (GRU), and Bidirectional GRU (BiGRU) classifiers utilized, evaluated and compared. The GRU outperform other classifiers on both datasets; its accuracy on the Arabic cyberbullying dataset using the Arabic GloVe corpus of dimension equal to 256D is 87.83%, while the accuracy on the English datasets using 100 D pre-trained GloVe corpus is 93.38%.

Keywords


Deep learning classifiers; Gated recurrent unit; GloVe word embedding; Long short-term memory; Recurrent neural networks;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v13.i1.pp500-508

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats