Unsupervised acoustic model training: comparing South African English and isiZulu

Kleynhans, Neil; De Wet, Febe; Barnard, Etienne

Unsupervised acoustic model training: comparing South African English and isiZulu

Files

kleynhans-2015-model-training.pdf (111.76 KB)

Date

2015

Authors

Kleynhans, Neil

De Wet, Febe

Barnard, Etienne

Researcher ID

21021287 - Barnard, Etienne

Publisher

IEEE

Abstract

Large amounts of untranscribed audio data are generated every day. These audio resources can be used to develop robust acoustic models that can be used in a variety of speech-based systems. Manually transcribing this data is resource intensive and requires funding, time and expertise. Lightly-supervised training techniques, however, provide a means to rapidly transcribe audio, thus reducing the initial resource investment to begin the modelling process. Our findings suggest that the lightly-supervised training technique works well for English but when moving to an agglutinative language, such as isiZulu, the process fails to achieve the performance seen for English. Additionally, phone-based performances are significantly worse when compared to an approach using word-based language models. These results indicate a strong dependence on large or well-matched text resources for lightly-supervised training techniques.

Keywords

Lightly-supervised training, Unsupervised training, Automatic transcription generation, Audio harvesting, English, isiZulu

Citation

Neil Kleynhans, Febe de Wet and Etienne Barnard, “Unsupervised acoustic model training: comparing South African English and isiZulu”, in Proc. Annual Symp. Pattern Recognition Association of South Africa (PRASA), pp 136 - 141, Port Elizabeth, South Africa, 2015. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications]

URI

http://ieeexplore.ieee.org/document/7359512/
https://researchspace.csir.co.za/dspace/handle/10204/8629
http://hdl.handle.net/10394/26490

Collections

Faculty of Engineering

Full item page

Unsupervised acoustic model training: comparing South African English and isiZulu

Files

Date

Authors

Researcher ID

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

Record Identifier

Abstract

Sustainable Development Goals

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By