Rapid development of TTS corpora for four South African languages
dc.contributor.author | Van Niekerk, Daniel R. | |
dc.contributor.author | van Heerden, Charl | |
dc.contributor.author | Kleynhans, Neil | |
dc.contributor.author | Kjartansson, Oddur | |
dc.contributor.author | Jansche, Martin | |
dc.contributor.author | Ha, Linne | |
dc.contributor.author | Davel, Marelie H. | |
dc.date.accessioned | 2018-02-27T10:31:56Z | |
dc.date.available | 2018-02-27T10:31:56Z | |
dc.date.issued | 2017 | |
dc.description.abstract | This paper describes the development of text-to-speech corpora for four South African languages. The approach followed investigated the possibility of using low-cost methods including informal recording environments and untrained volunteer speakers. This objective and the additional future goal of expanding the corpus to increase coverage of South Africa’s 11 official languages necessitated experimenting with multi-speaker and code-switched data. The process and relevant observations are detailed throughout. The latest version of the corpora are available for download under an open-source license and will likely see further development and refinement in future. Index Terms: text-to-speech corpus, under-resourced languages | en_US |
dc.description.sponsorship | Google Inc, Interspeech | en_US |
dc.identifier.citation | Daniel Rudolph van Niekerk, Charl van Heerden, Marelie Davel, Neil Kleynhans, Oddur Kjartansson, Martin Jansche and Linne Ha, “Rapid development of TTS corpora for four South African languages”, in Proc. Interspeech, pp 2178-2182, Stockholm, Sweden, 2017. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications] | |
dc.identifier.uri | http://hdl.handle.net/10394/26444 | |
dc.identifier.uri | http://www.demitasse.co.za/~demitasse/pubs/poster_is2017.pdf | |
dc.language.iso | en | en_US |
dc.publisher | Interspeech 2017 | en_US |
dc.subject | TTS corpora | en_US |
dc.subject | multi-speaker and code-switched data | en_US |
dc.subject | under-resourced languages | en_US |
dc.title | Rapid development of TTS corpora for four South African languages | en_US |
dc.type | Presentation | en_US |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- vanniekerk-2017-rapid-dev-tts-corpora.pdf
- Size:
- 119.5 KB
- Format:
- Adobe Portable Document Format
- Description:
- vanniekerk-2017-rapid-dev-tts-corpora
License bundle
1 - 1 of 1
Loading...
- Name:
- license.txt
- Size:
- 1.61 KB
- Format:
- Item-specific license agreed upon to submission
- Description: