• Login
    View Item 
    •   NWU-IR Home
    • Research Output
    • Faculty of Natural and Agricultural Sciences
    • View Item
    •   NWU-IR Home
    • Research Output
    • Faculty of Natural and Agricultural Sciences
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Semi-supervised training for lecture transcription in resource-scarce environments

    Thumbnail
    Date
    2014
    Author
    De Villiers, Pieter
    Barnard, Etienne
    Van Heerden, Charl
    Metadata
    Show full item record
    Abstract
    We present a study where standard semi-supervised training methods are applied in a resource-scarce environment to build lecture transcription systems. Experiments are conducted on two different corpora which one can expect to be available in resource-scarce environments. These include 1) speaker- and domain-specific data where a single South African English lecturer presents the “Operating Systems” course, and 2) Afrikaans speaker-independent and domain non-specific data collected from science and law courses. Different amounts of acoustic and language model data are used for training the respective models. We find that lecture transcription systems in resource-scarce environments can benefit substantially from semi-supervised training methods. We also describe a small, new corpus of spoken lectures which is freely available in the public domain.
    URI
    http://hdl.handle.net/10394/17314
    Collections
    • Faculty of Natural and Agricultural Sciences [4855]

    Copyright © North-West University
    Contact Us | Send Feedback
    Theme by 
    Atmire NV
     

     

    Browse

    All of NWU-IR Communities & CollectionsBy Issue DateAuthorsTitlesSubjectsAdvisor/SupervisorThesis TypeThis CollectionBy Issue DateAuthorsTitlesSubjectsAdvisor/SupervisorThesis Type

    My Account

    LoginRegister

    Copyright © North-West University
    Contact Us | Send Feedback
    Theme by 
    Atmire NV