Transformation of LF parameters for speech synthesis of emotion: regression trees

GOBL, CHRISTER; YANUSHEVSKAYA, IRENA

This item is covered by a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 Internationa. Click to find out more

File Type:

PDF

Item Type:

Conference Paper

Date:

2008

Author:

GOBL, CHRISTER

YANUSHEVSKAYA, IRENA

Citation:

Tooher, M., Yanushevskaya, I. and Gobl, C., Transformation of LF parameters for speech synthesis of emotion: regression trees, Proceedings of the 4th International Conference on Speech Prosody, International Conference on Speech Prosody, Campinas, Brazil, ISCA, 2008, 705 - 708

Download Item:

(Published (publisher's copy) - Peer Reviewed) 184.5Kb

Abstract:

This paper outlines an approach to modelling the dynamics of voice source parameters as observed in the analysis of emotional portrayals, by a male speaker of Hiberno-English. The emotions portrayed were happy, angry, sad, bored, and surprised, as well as neutral. The voice source parameters extracted from emotionally coloured repetitions of a short utterance ? by means of inverse filtering followed by source model matching ? were modelled using classification and regression trees. Regression trees were built using the voice source parameters of the neutral repetition of the same short utterance, in order to transform the voice source parameters from neutral to one of the five emotions. Re-synthesis of emotion-portraying utterances using transformed voice source parameter dynamics resulted in synthesised utterances which were confirmed by listening tests to represent the targeted emotion categories. The results suggest that the addition of dynamic voice source information in parametric synthesis of emotion will improve the quality of emotion synthesis.

URI:

http://hdl.handle.net/2262/39515

Author's Homepage:

http://people.tcd.ie/cegobl
http://people.tcd.ie/yanushi

Description:

PUBLISHED
Campinas, Brazil

Author: GOBL, CHRISTER; YANUSHEVSKAYA, IRENA

Other Titles:

Proceedings of the 4th International Conference on Speech Prosody
International Conference on Speech Prosody

Publisher:

ISCA

Type of material:

Conference Paper

URI:

http://hdl.handle.net/2262/39515

Collections

Availability:

Full text available

Keywords:

Computer science

Metadata

Show full item record

Licences:

Original License

Browse

My Account