Generating fundamental frequency contours for speech synthesis in Yorùbá

Van Niekerk, Daniel R.; Barnard, Etienne

Generating fundamental frequency contours for speech synthesis in Yorùbá

Date

2013

Authors

Van Niekerk, Daniel R.

Barnard, Etienne

Researcher ID

21022658 - Van Niekerk, Daniël Rudolph
21021287 - Barnard, Etienne

Publisher

International Speech Communication Association ( ISCA )

Abstract

We present methods for modelling and synthesising fundamental frequency (F0) contours suitable for application in text-to-speech (TTS) synthesis of Yorùbá (an African tone language). These methods are discussed and compared with a baseline approach using the HMM-based speech synthesis system HTS. Evaluation is done by comparing ten-fold cross validation squared errors on a small corpus of four speakers. We show that the proposed methods are relatively effective at modelling and generating F0 contours in this context, achieving lower error rates than the baseline. These results suggest that our methods will be useful for the generation of improved synthesis of tone in African languages, which has been a challenge to date.

Description

Interspeech, Lyon, France, 25-29 August 2013

Keywords

Speech synthesis, Text-to-speech, Fundamental, Frequency, Tone language, Under-resourced, Yoruba

Citation

Van Niekerk, D.R. & Barnard, E. 2013. Generating fundamental frequency contours for speech synthesis in Yorùbá. In: Interspeech, Lyon, France, 25-29 August 2013.

URI

http://hdl.handle.net/10394/13497

Collections

Faculty of Engineering
Conference Papers - Vaal Triangle Campus
Faculty of Natural and Agricultural Sciences

Full item page

Generating fundamental frequency contours for speech synthesis in Yorùbá

Date

Authors

Researcher ID

Supervisors

Journal Title

Journal ISSN

Volume Title

Publisher

Record Identifier

Abstract

Sustainable Development Goals

Description

Keywords

Citation

URI

Collections

Endorsement

Review

Supplemented By

Referenced By