Predicting the classification of high vowel sound by using artificial neural network: a study in forensic linguistics

Susanto Susanto, Deri Sis Nanda

Abstract


One of the tasks in forensic linguistics, especially forensic phonetics, is evaluating the speech sounds in the recordings. The speech evaluation aims at identifying and verifying speakers to predict if the sound were spoken by the suspect or not. The common problem in the task is determining which acoustic features of the speech sounds are reliable for the speaker identification and verification. The purpose of this research is studying formant frequencies to predict high vowel sounds /i/, and /u/ by using artificial neural network (ANN). Using three various normalization methods (i.e., softmax, z-score and sigmoid), we utilized multilayer perceptron on backpropagation ANN with the architectural models of 4-5-2, 4-10-2 and 4-20-2. The results show that the z-score normalization method provides higher accuracy than the other two in all formations and the 4-10-2 formation has shown the highest accuracy (92.26%).

Keywords


Artificial neural network; Forensic linguistics; Formant frequency; Normalization method; Vowel sound;

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v13.i1.pp195-200

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats