Transformation of LF parameters for speech synthesis of emotion: regression trees
Citation:
Tooher, M., Yanushevskaya, I. and Gobl, C., Transformation of LF parameters for speech synthesis of emotion: regression trees, Proceedings of the 4th International Conference on Speech Prosody, International Conference on Speech Prosody, Campinas, Brazil, ISCA, 2008, 705 - 708Download Item:
Abstract:
This paper outlines an approach to modelling the dynamics of
voice source parameters as observed in the analysis of
emotional portrayals, by a male speaker of Hiberno-English.
The emotions portrayed were happy, angry, sad, bored, and
surprised, as well as neutral. The voice source parameters
extracted from emotionally coloured repetitions of a short
utterance ? by means of inverse filtering followed by source
model matching ? were modelled using classification and
regression trees. Regression trees were built using the voice
source parameters of the neutral repetition of the same short
utterance, in order to transform the voice source parameters
from neutral to one of the five emotions. Re-synthesis of
emotion-portraying utterances using transformed voice source
parameter dynamics resulted in synthesised utterances which
were confirmed by listening tests to represent the targeted
emotion categories. The results suggest that the addition of
dynamic voice source information in parametric synthesis of
emotion will improve the quality of emotion synthesis.
Author's Homepage:
http://people.tcd.ie/cegoblhttp://people.tcd.ie/yanushi
Description:
PUBLISHEDCampinas, Brazil
Author: GOBL, CHRISTER; YANUSHEVSKAYA, IRENA
Other Titles:
Proceedings of the 4th International Conference on Speech ProsodyInternational Conference on Speech Prosody
Publisher:
ISCAType of material:
Conference PaperAvailability:
Full text availableKeywords:
Computer scienceMetadata
Show full item recordLicences: