Deep learning for audio signal-based tempo classification scenarios

Muljono Muljono, Pulung Nurtantio Andono, Sari Ayu Wulandari, Harun Al Azies, Muhammad Naufal


This article explains how to determine the tempo of the kendhang, an Indonesian traditional melodic instrument. This research presents novelty as technological research related to gamelan instruments, which has rarely been achieved thus far, through the introduction of kendhang tempo types through the sounds produced, with the hope of creating an automatic system that can recognize the kendhang tempo during a gamelan performance. The testing in this work will categorize the tempo of kendhang into three categories: slow, medium, and fast, utilizing one of the two scenario models proposed, mel frequency cepstral coefficients (MFCC) and convolutional neural network (CNN) in the first scenario, and mel spectrogram and CNN in the second. Kendhang's original audio data, which was captured in real time and later enhanced, makes up the data set. The model 1 scenario, which entails feature extraction using MFCC and classification using the CNN classification approach, is the best scenario in this research, based on the experimental results. When compared to the other suggested modeling scenarios, model 1 has a level of 97%, an average accuracy, and a gain value of 96.67%, making it a solid assistant in terms of kendhang's good tempo recognition accuracy.


Convolutional neural network; Mel spectrogram; Melfrequency cepstral coefficients; Tempo recognition

Full Text:




  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats