NWU Institutional Repository

Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

PRASA

Abstract

This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional pronunciation dictionary, while the latter system uses the letters of each word directly as the acoustic units to be modelled. We ensure that the pronunciation dictionary we use is highly accurate and then investigate the extent to which ASR performance degrades when the dictionary is removed. We analyse this effect at different data set sizes and classify the causes of performance degradation. With grapheme-based ASR outperforming phoneme-based ASR in certain word categories, we find that relative error rates are highly dependent on word category, which points towards strategies for compensating for grapheme-based inaccuracies

Description

Citation

Basson, W.D. & Davel, M.H. 2012. Comparing grapheme-based and phoneme-based speech recognition for Afrikaans. Proceedings of the Twenty-Third Annual Symposium of the Pattern Recognition Association of South Africa. Pretoria. p. 144-148. [http://www.prasa.org/]

Endorsement

Review

Supplemented By

Referenced By