NWU Institutional Repository

Synthetic triphones from trajectory-based feature distributions

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Pattern Recognition Association of South Africa and Mechatronics International Conference

Abstract

We experiment with a new method to create synthetic models of rare and unseen triphones in order to supplement limited automatic speech recognition (ASR) training data. A trajectory model is used to characterise seen transitions at the spectral level, and these models are then used to create features for unseen or rare triphones. We find that a fairly restricted model (piece-wise linear with three line segments per channel of a diphone transition) is able to represent training data quite accurately. We report on initial results when creating additional triphones for a single-speaker data set, finding small but significant gains, especially when adding additional samples of rare (rather than unseen) triphones.

Description

Citation

Jaco Badenhorst and Marelie Davel, “Synthetic triphones from trajectory-based feature distributions”, in Proc. a22, Port Elizabeth, South Africa, 2015. [http://engineering.nwu.ac.za/multilingual-speech-technologies-must/publications]

Endorsement

Review

Supplemented By

Referenced By