Search
Now showing items 1-9 of 9
The NCHLT Speech Corpus of the South African languages
(Workshop Spoken Language Technologies for Under-resourced Languages (SLTU), 2014)
The NCHLT speech corpus contains wide-band speech from approximately
200 speakers per language, in each of the eleven
official languages of South Africa. We describe the design and
development processes that were ...
G2P variant prediction techniques for ASR and STD
(Interspeech 2013, 2013)
Introducing pronunciation variants into a lexicon is a balancing
act: incorporating necessary variants can improve automatic
speech recognition (ASR) and spoken term detection (STD)
performance by capturing some of the ...
The South African directory enquiries (SADE) name corpus
(Springer, 2020)
We present the design and development of a South African directory enquiries (DE) corpus. It contains audio and orthographic transcriptions of a wide range of South African names produced by first language speakers of four ...
Comparing grapheme-based and phoneme-based speech recognition for Afrikaans
(PRASA, 2012)
This paper compares the recognition accuracy of a phoneme-based automatic speech recognition system with that of a grapheme-based system, using Afrikaans as case study. The first system is developed using a conventional ...
Efficient harvesting of Internet audio for resource-scarce ASR
(Interspeech 2011, 2011)
Spoken recordings that have been transcribed for human reading
(e.g. as captions for audiovisual material, or to provide alternative
modes of access to recordings) are widely available in many
languages. Such recordings ...
Performance analysis of a multilingual directory enquiries application
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2014)
In a multilingual society such as South Africa, a
practical directory enquiries (DE) application should be able to
serve users from various language backgrounds with information
relating to names in various languages: ...
Number pronunciation in a multilingual environment and implications for an ASR system
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2014)
The purpose of this paper is to address the challenges
and describe step-by-step solutions faced when developing an
automatic speech recognition system in multilingual societies.
We give a brief statistical analysis of ...
Introduction to the special issue on processing under-resourced languages
(Speech Communications, 2014)
The creation of language and acoustic resources, for any given spoken language, is typically a costly task. For example, a large amount of time and money is required to properly create annotated speech corpora for automatic ...
Bilateral G2P accuracy: measuring the effect of variants
(Pattern Recognition Association of South Africa and Mechatronics International Conference, 2017)
Incorporating pronunciation variants in a dictionary
is controversial, as this can be either advantageous or
detrimental for a speech recognition system. Grapheme-tophoneme
(G2P) accuracy can help guide this decision, ...