Presentation 2008-12-10
Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features
Katsuyoshi SETOYAMA, Hideki KASHIOKA, Nick CAMPBELL,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) It is common for speech synthesis technology to process each sentence as one single and independent unit. However, in human speech production, it is perhaps unusual to process a long utterance as a single discrete unit, and typically a series of short utterance fragments is produced in such cases. Such a fragmentary short utterance is assumed to be a minimal discourse unit, and it is proposed here that similar chunks should be used as the basic units for speech synthesis in order to speed-up the calculation processing. In this paper, the acoustic features of such utterance fragments is modeled by HMM, and the paper reports on the result of an experimental the segmentation of a natural speech corpus into optimal units for processing as utterance fragments according to the the model. The ESP-C casual conversation speech corpus was used as material for the experiment.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Utterance fragments / Dialogue Corpus / Spontaneous Speech Synthesis / Acoustics features / Speech Segmentation
Paper # NLC2008-35,SP2008-90
Date of Issue

Conference Information
Committee NLC
Conference Date 2008/12/2(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features
Sub Title (in English)
Keyword(1) Utterance fragments
Keyword(2) Dialogue Corpus
Keyword(3) Spontaneous Speech Synthesis
Keyword(4) Acoustics features
Keyword(5) Speech Segmentation
1st Author's Name Katsuyoshi SETOYAMA
1st Author's Affiliation Nara Institute of Science and Technology()
2nd Author's Name Hideki KASHIOKA
2nd Author's Affiliation Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International
3rd Author's Name Nick CAMPBELL
3rd Author's Affiliation Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International
Date 2008-12-10
Paper # NLC2008-35,SP2008-90
Volume (vol) vol.108
Number (no) 337
Page pp.pp.-
#Pages 6
Date of Issue