NWU Institutional Repository

Towards lecture transcription in resource-scarce environments

dc.contributor.authorDe Villiers, Pieter
dc.contributor.authorJooste, Petri
dc.contributor.authorVan Heerden, Carel J.
dc.contributor.authorBarnard, Etienne
dc.contributor.researchID21281858 - De Villiers, Pieter Theunis
dc.contributor.researchID10080694 - Jooste, Josef Petrus
dc.contributor.researchID11539151 - Van Heerden, Carel Jacobus
dc.contributor.researchID21021287 - Barnard, Etienne
dc.date.accessioned2014-11-04T05:43:19Z
dc.date.available2014-11-04T05:43:19Z
dc.date.issued2012
dc.description.abstractWe present progress towards automated Lecture Transcription (LT) in resource scarce environments. Our development has focused on the transcription of lectures in Afrikaans from two faculties at North-West University. A bootstrapping procedure is followed to filter and select well-aligned segments of speech. These segments are then used to train acoustic models. Initial work towards language modeling for LT in a resource-scarce environment is also presented; manual lecture transcriptions are combined with text mined from other sources such as study guides to train language models. Interpolation results indicate that study guides are a useful resource for language modeling, whereas general text (obtained from a publisher of Afrikaans books) is less useful in this context. Our findings are confirmed by the reduced word error rates (WERs) obtained from our off-line speech-recognition system for Lecture Transcription.en_US
dc.description.urihttp://www.prasa.org/index.php/2012-03-07-10-55-15
dc.identifier.citationDe Villiers, P.T. et al. 2012. Towards lecture transcription in resource-scarce environments. Proceedings of the Twenty-Third Annual Symposium of the Pattern Recognition Association of South Africa. Pretoria. p.138-143. [http://www.prasa.org/]en_US
dc.identifier.isbn978-0-620-54601-0
dc.identifier.urihttp://hdl.handle.net/10394/12123
dc.language.isoenen_US
dc.publisherPattern recognition association of South Africa (PRASA)en_US
dc.subjectLecture transcriptionen_US
dc.subjectAfrikaansen_US
dc.subjectKaldien_US
dc.subjectDynamic programmingen_US
dc.subjectLanguage modelen_US
dc.subjectResource-scarceen_US
dc.titleTowards lecture transcription in resource-scarce environmentsen_US
dc.typeArticleen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
prasa2012-28.pdf
Size:
72.45 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: