Validating smartphone-collected speech corpora
Loading...
Date
Authors
Van Heerden, Carel J.
Barnard, Etienne
Davel, Marelie H.
Journal Title
Journal ISSN
Volume Title
Publisher
SLTU
Abstract
We investigate the effectiveness with which the accuracy of a prompted speech corpus can be validated when minimal additional speech resources are available, and specifically when a language model in the target language is not available. We compare a word-based variant of Goodness of Pronunciation (GOP) with a phone-based dynamic programming (PDP) scoring technique. The first technique uses the acoustic likelihood ratio and the second the optimal alignment between an observed phone string (generated by a speech recogniser) and a reference phone string (obtained from a dictionary) to generate validation scores. We define a new technique to obtain a PDP scoring matrix in a data-driven fashion, examine different ways of using GOP for word scoring, and find that variants of both techniques provide results that are effective for corpus validation.
Description
International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Cape Town, South Africa, 7-9 May 2012
Citation
Davel, M.H. & Van Heerden, C.J., et al. 2012. Validating smartphone-collected speech corpora. In: International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Cape Town, South Africa, 7-9 May 2012.