Search
Now showing items 1-2 of 2
Collecting and evaluating speech recognition corpora for 11 South African languages
(Springer, 2011)
We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...
Efficient harvesting of Internet audio for resource-scarce ASR
(Interspeech 2011, 2011)
Spoken recordings that have been transcribed for human reading
(e.g. as captions for audiovisual material, or to provide alternative
modes of access to recordings) are widely available in many
languages. Such recordings ...