Show simple item record

dc.contributor.authorBarnard, Etienne
dc.contributor.authorVan Heerden, Charl
dc.contributor.authorHartmann, William
dc.contributor.authorKarakos, Damianos
dc.contributor.authorSchwartz, Richard
dc.contributor.authorTsakalidis, Stavros
dc.contributor.authorDavel, Marelie H.
dc.date.accessioned2018-03-02T12:50:32Z
dc.date.available2018-03-02T12:50:32Z
dc.date.issued2015
dc.identifier.citationMarelie Davel, Damianos Karakos, Etienne Barnard, Charl van Heerden, Richard Schwartz and Stavros Tsakalidis, William Hartmann, “Exploring minimal pronunciation modeling for low resource languages”, in Proc. Interspeech, pp 538-542, Dresden, Germany, 2015. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications]en_US
dc.identifier.isbn978-1-61499-700-9
dc.identifier.urihttps://books.google.co.za/books?id=-RGhDQAAQBAJ&pg=PA44&lpg=PA44&dq=Exploring+minimal+pronunciation+modeling+for+low+resource+languages&source=bl&ots=wAYDYAm_Ju&sig=ha5BMCtwoEBjHQTAkyauz2wSSEc&hl=en&sa=X&ved=0ahUKEwjFwPDv1M3ZAhUlKsAKHXrICPkQ6AEIODAC#v=onepage&q=Exploring%20minimal%20pronunciation%20modeling%20for%20low%20resource%20languages&f=false
dc.identifier.urihttps://www.lti.cs.cmu.edu/sites/default/files/sitaram%2C%20sunayana.pdf
dc.identifier.urihttp://hdl.handle.net/10394/26488
dc.description.abstractPronunciation lexicons can range from fully graphemic (modeling each word using the orthography directly) to fully phonemic (first mapping each word to a phoneme string). Between these two options lies a continuum of modeling options. We analyze techniques that can improve the accuracy of a graphemic system without requiring significant effort to design or implement. The analysis is performed in the context of the IARPA Babel project, which aims to develop spoken term detection systems for previously unseen languages rapidly, and with minimal human effort. We consider techniques related to letter-to-sound mapping and language-independent syllabification of primarily graphemic systems, and discuss results obtained for six languages: Cebuano, Kazakh, Kurmanji Kurdish, Lithuanian, Telugu and Tok Pisin.en_US
dc.description.sponsorshipThis work was supported by the Intelligence Advanced Research Projects Activity (IARPA) via Department of Defense U.S. Army Research Laboratory contract number W911NF-12- C-0013. The U.S. Government is authorized to reproduce and distribute reprints for Governmental purposes notwithstanding any copyright annotation thereon. Disclaimer: The views and conclusions contained herein are those of the authors and should not be interpreted as necessarily representing the official policies or endorsements, either express or implied, of IARPA, DoD/ARL, or the U.S. Government.en_US
dc.language.isoenen_US
dc.publisherIOS Press Incen_US
dc.subjectSpoken term detectionen_US
dc.subjectGraphemic systemsen_US
dc.subjectPronunciation lexiconsen_US
dc.titleExploring minimal pronunciation modeling for low resource languagesen_US
dc.typePresentationen_US
dc.contributor.researchID23607955 - Davel, Marelie Hattingh
dc.contributor.researchID21021287 - Barnard, Etienne


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record