Presentation 2012/12/13
Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking
ROBERT A.J. CLARK, MAGDALENA ANNA KONKIEWICZ, MARIA ASTRINAK, JUNICHI YAMAGISHI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Naturally expressive speech is important for an increasing number of real world speech synthesis appli-cations including augmentative and alternative communication aids and entertainment based applications. One of the important challenges facing speech synthesis development today is how to produce reactive expressive speech, that is speech where various aspects of the way in which speech is said can be controlled in real-time as the speech is produced. This is both a challenge in terms of the adaptability and latency of speech synthesis systems and in terms of how to provide a control mechanism for different situations. To explore these issues and generally raise awareness of these issues we present a reactive speech synthesiser where pitch and duration are controlled by hand movement via the skeleton tracking of a Microsoft Kinect sensor. We see that the manipulation of pitch and duration in realtime is possible (and fun), but it is difficult to produce meaningful expressiveness without an underlying model to allow a high-level representation of expressiveness to be used.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) HMM-based speech synthesis / reactive control / expressive speech synthesis / Kinect / skeleton Tracking
Paper # SLP-94
Date of Issue

Conference Information
Committee SP
Conference Date 2012/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Reactive Control of Expressive Speech Synthesis Using Kinect Skeleton Tracking
Sub Title (in English)
Keyword(1) HMM-based speech synthesis
Keyword(2) reactive control
Keyword(3) expressive speech synthesis
Keyword(4) Kinect
Keyword(5) skeleton Tracking
1st Author's Name ROBERT A.J. CLARK
1st Author's Affiliation The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom()
2nd Author's Name MAGDALENA ANNA KONKIEWICZ
2nd Author's Affiliation The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom
3rd Author's Name MARIA ASTRINAK
3rd Author's Affiliation Facult Polytechnique de Mons(FPMs)Department of Electrical Engineering TCTS Lab 31 University of Mons Boulevard Dolez B-7000 Mons, Belgium
4th Author's Name JUNICHI YAMAGISHI
4th Author's Affiliation The Centre for Speech Technology Research University of Edinburgh Informatics Forum 10 Crichton Street EDINBURGH EH89AB United Kingdom
Date 2012/12/13
Paper # SLP-94
Volume (vol) vol.112
Number (no) 369
Page pp.pp.-
#Pages 4
Date of Issue