Overlapped music segmentation using a new effective feature and random forests

Duraid Y. Mohammed, Khamis A. Al-Karawi, Philip Duncan, Francis F. Li


In the field of audio classification, audio signals may be broadly divided into three classes: speech, music and events. Most studies, however, neglect that real audio soundtracks can have any combination of these classes simultaneously. This can result in information loss, thus compromising the knowledge discovery. In this study, a novel feature, “Entrocy”, is proposed for the detection of music in both pure form and overlapping with the other audio classes. Entrocy is defined as the variation of the information (or entropy) in an audio segment over time. Segments, which contain music, were found to have lower Entrocy since there are fewer abrupt changes over time. We have also compared Entrocy with existing music detection features and the entrocy showing a good performance.


Audio content analysis; Audio indexing; Entropy; Music detection; Real world audio classification

Full Text:


DOI: http://doi.org/10.11591/ijai.v8.i2.pp181-189


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats