Search

Now showing items 1-9 of 9

The NCHLT Speech Corpus of the South African languages

Barnard, Etienne; Davel, Marelie H.; van Heerden, Charl; De Wet, Febe; Badenhorst, Jaco (Workshop Spoken Language Technologies for Under-resourced Languages (SLTU), 2014)

The NCHLT speech corpus contains wide-band speech from approximately 200 speakers per language, in each of the eleven official languages of South Africa. We describe the design and development processes that were ...

G2P variant prediction techniques for ASR and STD

Davel, Marelie H.; van Heerden, Charl; Barnard, Etienne (Interspeech 2013, 2013)

Introducing pronunciation variants into a lexicon is a balancing act: incorporating necessary variants can improve automatic speech recognition (ASR) and spoken term detection (STD) performance by capturing some of the ...

The South African directory enquiries (SADE) name corpus

Thirion, Jan Willem Frederick; Van Heerden, Charl Johannes; Giwa, Oluwapelumi; Davel, Marelie Hattingh (Springer, 2020)

We present the design and development of a South African directory enquiries (DE) corpus. It contains audio and orthographic transcriptions of a wide range of South African names produced by first language speakers of four ...

Comparing grapheme-based and phoneme-based speech recognition for Afrikaans

Basson, Willem D.; Davel, Marelie H. (PRASA, 2012)

This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional ...

Efficient harvesting of Internet audio for resource-scarce ASR

Davel, Marelie H.; van Heerden, Charl; Kleynhans, Neil; Barnard, Etienne (Interspeech 2011, 2011)

Spoken recordings that have been transcribed for human reading (e.g. as captions for audiovisual material, or to provide alternative modes of access to recordings) are widely available in many languages. Such recordings ...

Performance analysis of a multilingual directory enquiries application

van Heerden, Charl; Davel, Marelie H.; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2014)

In a multilingual society such as South Africa, a practical directory enquiries (DE) application should be able to serve users from various language backgrounds with information relating to names in various languages: ...

Number pronunciation in a multilingual environment and implications for an ASR system

Molapo, Raymond; Barnard, Etienne (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2014)

The purpose of this paper is to address the challenges and describe step-by-step solutions faced when developing an automatic speech recognition system in multilingual societies. We give a brief statistical analysis of ...

Introduction to the special issue on processing under-resourced languages

Besacier, L.; Barnard, E.; Karpov, A.; Schultz, T. (Speech Communications, 2014)

The creation of language and acoustic resources, for any given spoken language, is typically a costly task. For example, a large amount of time and money is required to properly create annotated speech corpora for automatic ...

Bilateral G2P accuracy: measuring the effect of variants

Giwa, Oluwapelumi; Davel, Marelie H. (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2017)

Incorporating pronunciation variants in a dictionary is controversial, as this can be either advantageous or detrimental for a speech recognition system. Grapheme-tophoneme (G2P) accuracy can help guide this decision, ...