Classification of regional language dialects using convolutional neural network and multilayer perceptron
Abstract
Regional languages are vital for communication and preserving cultural identity, safeguarding local heritage. However, globalization and modernization endanger their existence as they are increasingly replaced by national or global languages. Despite progress in dialect recognition research, particularly for certain languages, further studies are needed to improve model performance and address less-represented dialects, including those in Indonesia. This study enhances a custom-built dataset for dialect recognition through the application of data augmentation techniques, specifically adding noise, time stretching, and pitch shifting. Using Mel-frequency cepstral coefficients (MFCC) for feature extraction, it evaluates the performance of convolutional neural network (CNN) and multilayer perceptron (MLP) in classifying six Indonesian dialects. Results indicate that CNN outperformed, achieving 97.92% accuracy, 97.90% recall, 97.97% precision, 97.92% F1-score, and a kappa score of 97.49% with combined augmentation techniques, setting a foundation for further research.
Keywords
Convolutional neural network; Dialect recognition; Mel-frequency cepstral coefficients; Multilayer perceptron; Regional language dialects
Full Text:
PDFDOI: http://doi.org/10.11591/ijai.v14.i6.pp5017-5026
Refbacks
- There are currently no refbacks.
Copyright (c) 2025 Fahmi B. Marasabessy, Dwiza Riana, Muji Ernawati

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938
This journal is published by the Institute of Advanced Engineering and Science (IAES).