Presentation | 2008-12-10 Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features Katsuyoshi SETOYAMA, Hideki KASHIOKA, Nick CAMPBELL, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | It is common for speech synthesis technology to process each sentence as one single and independent unit. However, in human speech production, it is perhaps unusual to process a long utterance as a single discrete unit, and typically a series of short utterance fragments is produced in such cases. Such a fragmentary short utterance is assumed to be a minimal discourse unit, and it is proposed here that similar chunks should be used as the basic units for speech synthesis in order to speed-up the calculation processing. In this paper, the acoustic features of such utterance fragments is modeled by HMM, and the paper reports on the result of an experimental the segmentation of a natural speech corpus into optimal units for processing as utterance fragments according to the the model. The ESP-C casual conversation speech corpus was used as material for the experiment. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Utterance fragments / Dialogue Corpus / Spontaneous Speech Synthesis / Acoustics features / Speech Segmentation |
Paper # | NLC2008-35,SP2008-90 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2008/12/2(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features |
Sub Title (in English) | |
Keyword(1) | Utterance fragments |
Keyword(2) | Dialogue Corpus |
Keyword(3) | Spontaneous Speech Synthesis |
Keyword(4) | Acoustics features |
Keyword(5) | Speech Segmentation |
1st Author's Name | Katsuyoshi SETOYAMA |
1st Author's Affiliation | Nara Institute of Science and Technology() |
2nd Author's Name | Hideki KASHIOKA |
2nd Author's Affiliation | Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International |
3rd Author's Name | Nick CAMPBELL |
3rd Author's Affiliation | Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International |
Date | 2008-12-10 |
Paper # | NLC2008-35,SP2008-90 |
Volume (vol) | vol.108 |
Number (no) | 337 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |