Search
Now showing items 1-8 of 8
A discourse model of affect for text-to-speech synthesis
(Pattern recognition association of South Africa (PRASA), 2013)
This paper introduces a model of affect to improve prosody in text-to-speech synthesis. It operates on the discourse level of text to predict the underlying linguistic factors that contribute towards emotional appraisal, ...
Towards lecture transcription in resource-scarce environments
(Pattern recognition association of South Africa (PRASA), 2012)
We present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our development has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A ...
A distributed approach to speech resource collection
(Pattern recognition association of South Africa (PRASA), 2013)
We describe the integration of several tools to enable the end-to-end development of an Automatic Speech Recognition system in a typical under-resourced language. Google App Engine is employed as the core environment for ...
The semi-automated creation of stratified speech corpora
(Pattern recognition association of South Africa (PRASA), 2013)
Smartphones provide an efficient means for the collection of speech data; however, the quality of the corpora created in this fashion is not predictable. We describe an approach that allows us to post-process and rank ...
Improved transition models for cepstral trajectories
(Pattern recognition association of South Africa (PRASA), 2012)
We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
Spoken language identification system adaptation in under-resourced environments
(Pattern recognition association of South Africa (PRASA), 2013)
Speech technologies have matured over the past few decades and have made significant impacts in a variety of fields, from assistive technologies to personal assistants. However, speech system development is a resource ...
Kernel bandwidth estimation for non-parametric density estimation: a comparative study
(Pattern recognition association of South Africa (PRASA), 2013)
We investigate the performance of conventional bandwidth estimators for non- parametric kernel density estimation on a number of representative pattern-recognition tasks, to gain a better understanding of the behaviour of ...
Cross-bandwidth adaptation for ASR systems
(Pattern recognition association of South Africa (PRASA), 2013)
Mismatches between application and training data greatly reduce the performance of automatic speech recognition (ASR) systems. However, collecting suitable amounts of in-domain and application-specific data for training ...