Now showing items 1-4 of 4

    • Collecting and evaluating speech recognition corpora for 11 South African languages 

      Badenhorst, Jaco; Van Heerden, Charl; Barnard, Etienne; Davel, Marelie H. (Springer, 2011)
      We describe the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which contains data from the eleven official languages of South Africa. Because of practical constraints, the amount of ...
    • Improved transition models for cepstral trajectories 

      Badenhorst, Jaco; Barnard, Etienne; Davel, Marelie H. (Pattern recognition association of South Africa (PRASA), 2012)
      We improve on a piece-wise linear model of the trajectories of Mel Frequency Cepstral Coefficients, which are commonly used as features in Automatic Speech Recognition. For this purpose, we have created a very clean ...
    • A smartphone-based ASR data collection tool for under-resourced languages 

      De Vries, Nic J.; Badenhorst, Jaco; Basson, Willem D.; De Wet, Febe; Barnard, Etienne; De Waal, Alta; Davel, Marelie H. (Elsevier, 2014)
      Acoustic data collection for automatic speech recognition (ASR) purposes is a particularly challenging task when working with under-resourced languages, many of which are found in the developing world. We provide a brief ...
    • Synthetic triphones from trajectory-based feature distributions 

      Badenhorst, Jaco; Davel, Marelie H. (Pattern Recognition Association of South Africa and Mechatronics International Conference, 2015)
      We experiment with a new method to create synthetic models of rare and unseen triphones in order to supplement limited automatic speech recognition (ASR) training data. A trajectory model is used to characterise seen ...