Yanushevskaya, I., Gobl, C. and Ní Chasaide, A., Voice quality and f0 cues for affect expression: implications for synthesis, Proceedings of the 9th European Conference on Speech Communication and Technology, INTERSPEECH 2005, Lisbon, Sept, 4-8, 2005, 1849 - 1852
Abstract:
Synthesised stimuli were used to investigate how two notionally
separable dimensions of tone-of-voice – voice quality and
fundamental frequency – are involved in the expression of
affect. Listeners were presented with three series of stimuli:
(1) stimuli exemplifying different voice qualities, (2) stimuli
all with modal voice quality but with different affect-related f0
contours, and (3) stimuli incorporating variation in both voice
quality and affect-related f0 contours. A total of 15 stimuli
were rated for 12 different affective attributes. Voice quality
differentiation appears to account for the highest affect ratings
overall, as indicated by the scores obtained for stimuli series
(1) and (3). The relatively weaker affect signalling of stimuli
differentiated by f0 alone corroborates findings in [2]. It also
suggests that for the generation of expressive, affectively
coloured speech synthesis, it is not sufficient to manipulate
only f0; we also need to capture the voice quality dimension
of the voice source.
Please note: There is a known bug in some browsers that causes an
error when a user tries to view large pdf file within the browser window.
If you receive the message "The file is damaged and could not be
repaired", please try one of the solutions linked below based on the
browser you are using.
Items in TARA are protected by copyright, with all rights reserved, unless otherwise indicated.