A proposed approach for plagiarism detection in Myanmar Unicode text
Abstract
Around the world, with technology that improves over time, almost everyone can access the internet easily and quickly. With the increase in the use of the internet, the plagiarism of information that is easily available on the internet has also increased. Such plagiarism seriously undermines originality and ethical principles. In order to prevent these incidents, there is plagiarism detection software for many countries and languages, but there is no plagiarism detection software for the Myanmar language yet. In an attempt to fill that gap, this study proposed a deep learning model with Rabin-Karp hash code and Word2vec model and built a plagiarism detection system. Our deep learning model was trained by randomly obtaining information from Myanmar Wikipedia. According to the experiments, our proposed model can effectively detect plagiarism of educational content and information from Myanmar Wikipedia. Moreover, it is possible to distinguish plagiarized texts by rearranging words or substituting words with some synonyms. This study contributes to a broader understanding of the complexities of plagiarism in the Myanmar academic area and highlights the importance of measures to effectively prevent plagiarism. It maintains the credibility of education and promotes a culture that values originality and intellectual integrity.
Keywords
Deep learning; Myanmar Unicode; Natural language processing; Plagiarism detection; Syllables segmentation
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v14.i2.pp1616-1624
Refbacks
- There are currently no refbacks.
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).