NWU Institutional Repository

Validating smartphone-collected speech corpora

Loading...
Thumbnail Image

Date

Authors

Van Heerden, Carel J.
Barnard, Etienne
Davel, Marelie H.

Journal Title

Journal ISSN

Volume Title

Publisher

SLTU

Abstract

We investigate the effectiveness with which the accuracy of a prompted speech corpus can be validated when minimal additional speech resources are available, and specifically when a language model in the target language is not available. We compare a word-based variant of Goodness of Pronunciation (GOP) with a phone-based dynamic programming (PDP) scoring technique. The first technique uses the acoustic likelihood ratio and the second the optimal alignment between an observed phone string (generated by a speech recogniser) and a reference phone string (obtained from a dictionary) to generate validation scores. We define a new technique to obtain a PDP scoring matrix in a data-driven fashion, examine different ways of using GOP for word scoring, and find that variants of both techniques provide results that are effective for corpus validation.

Description

International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Cape Town, South Africa, 7-9 May 2012

Citation

Davel, M.H. & Van Heerden, C.J., et al. 2012. Validating smartphone-collected speech corpora. In: International Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU), Cape Town, South Africa, 7-9 May 2012.

Endorsement

Review

Supplemented By

Referenced By