• Login
    View Item 
    •   NWU-IR Home
    • Research Output
    • Faculty of Engineering
    • View Item
    •   NWU-IR Home
    • Research Output
    • Faculty of Engineering
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    The NCHLT Speech Corpus of the South African languages

    Thumbnail
    View/Open
    barnard-2014-speech-corpus (652.2Kb)
    Date
    2014
    Author
    Barnard, Etienne
    Davel, Marelie H.
    van Heerden, Charl
    De Wet, Febe
    Badenhorst, Jaco
    Metadata
    Show full item record
    Abstract
    The NCHLT speech corpus contains wide-band speech from approximately 200 speakers per language, in each of the eleven official languages of South Africa. We describe the design and development processes that were undertaken in order to develop the corpus, and report on associated materials such as orthographic transcriptions and pronunciation dictionaries that were released as part of the corpus. In order to benchmark speech recognition performance on the corpus, we have also developed both phone-recognition and word-recognition systems for all eleven languages; we find that high accuracies can be achieved for these speaker-independent but vocabulary-dependent recognition tasks in all languages.
    URI
    https://researchspace.csir.co.za/dspace/handle/10204/7549
    http://mica.edu.vn/sltu2014/proceedings/28.pdf
    http://hdl.handle.net/10394/26493
    Collections
    • Faculty of Engineering [1136]
    • Faculty of Natural and Agricultural Sciences [4855]

    Copyright © North-West University
    Contact Us | Send Feedback
    Theme by 
    Atmire NV
     

     

    Browse

    All of NWU-IR Communities & CollectionsBy Issue DateAuthorsTitlesSubjectsAdvisor/SupervisorThesis TypeThis CollectionBy Issue DateAuthorsTitlesSubjectsAdvisor/SupervisorThesis Type

    My Account

    LoginRegister

    Copyright © North-West University
    Contact Us | Send Feedback
    Theme by 
    Atmire NV