Multi-task deep learning for Vietnamese capitalization and punctuation recognition

Phuong-Nhung Nguyen, Nguyen Thu-Hien, Truong-Thang Nguyen, Nguyen Thi Thu-Nga, Nguyen Thi Anh-Phuong, Tuan-Linh Nguyen

Abstract


Speech recognition is the process of converting the speech signal of a particular language into a sequence of corresponding content words in text format. The output text of automatic speech recognition (ASR) systems often lacks struc- ture, such as punctuation, capitalization of the first letter of a sentence, proper nouns, and names of locations. This absence of structure complicates compre- hension and restricts the utility of ASR-generated text in various applications, such as creating movie subtitles, generating transcripts for online meetings, and extracting customer information. Therefore, developing standardization solu- tions for the output text from ASR is necessary to improve the overall quality of ASR systems. In this article, we use the idea of multitask deep learning for the task of capitalization and punctuation recognition (CPR) for the output text of Vietnamese ASR, with the aim of the named entity recognition (NER) task as a supplement to help the CPR model perform better, and proposed to use text-to- speech (TTS) to create a dataset for CPR-NER multitask model training. The experiment results show that the multi-task deep learning model has improved CPR results by 6.2% of F1 score with ASR output and 7.1% on raw text.


Keywords


Capitalization; Deep learning; Multitask learning; Named entity recognition; Punctuation

Full Text:

PDF


DOI: http://doi.org/10.11591/ijai.v14.i2.pp1605-1615

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

IAES International Journal of Artificial Intelligence (IJ-AI)
ISSN/e-ISSN 2089-4872/2252-8938 
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

View IJAI Stats