Boyer Moore String-Match Framework for A Hybrid Short Message Service Spam Filtering Technique

Arnold Adimabua Ojugo, David Ademola Oyemade

Abstract


Advances in technology and the proliferation of mobile device has continued to advance the ubiquitous nature of computing alongside its many prowess and improved features it brings as a disruptive technology to aid information sharing amongst many online users. This popularity, usage and adoption ease, mobility and portability of the mobile smartphone devices have allowed for its acceptability and popularity. Mobile smartphones continue to adopt the use of short messages services accompanied with a scenario for spamming to thrive. Spams are unsolicited message or inappropriate contents. An effective spam filter studies are limited as SMS are 140-bytes, 160-characters and rippled with abbreviation and slangs that further inhibits the effective training of models. The study proposes a string match algorithm used as deep learning ensemble on a hybrid spam filtering technique to normalize noisy features, expand text and use semantic dictionaries of disambiguation to train underlying learning heuristics and effectively classify SMS into legitimate and spam classes. Study uses a profile hidden Markov network to select and train the network structure and employs the deep neural network as a classifier network structure. Model achieves an accuracy of 97% with an error rate of 1.2%.

Keywords


Boyer Moore String Matching; Hybrid algorithm; Spam; Spam filters; String matching; Text processing;



DOI: http://doi.org/10.11591/ijai.v10.i3.pp%25p

Refbacks

  • There are currently no refbacks.


View IJAI Stats

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.