音響的特徴を用いた話し言葉の断片発話単位への分割(音声合成・声質変換,第10回音声言語シンポジウム)

瀬戸山 勝義; 柏岡 秀紀; キャンベル ニック

講演名	2008-12-10 音響的特徴を用いた話し言葉の断片発話単位への分割(音声合成・声質変換,第10回音声言語シンポジウム) 瀬戸山勝義, 柏岡秀紀, キャンベルニック,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	現在までの音声合成技術は文を一単位として処理することが多かった.しかし,実対話において,人間は長い発話文を一度に処理することは稀であり,多くの場合,短い断片的な発話を用いる.このような短い断片的な発話を断片発話とし,音声合成の計算処理単位として用いる事を探案する.本稿では,HMMにより断片発話の音響的特徴をモデル化し,そのモデルを用いた断片発話単位へのセグメンテーション実験を行なった結果を報告する.実験には,トピックフリーの雑談対話音声を収録したESP-Cコーパスを用いた.
抄録(英)	It is common for speech synthesis technology to process each sentence as one single and independent unit. However, in human speech production, it is perhaps unusual to process a long utterance as a single discrete unit, and typically a series of short utterance fragments is produced in such cases. Such a fragmentary short utterance is assumed to be a minimal discourse unit, and it is proposed here that similar chunks should be used as the basic units for speech synthesis in order to speed-up the calculation processing. In this paper, the acoustic features of such utterance fragments is modeled by HMM, and the paper reports on the result of an experimental the segmentation of a natural speech corpus into optimal units for processing as utterance fragments according to the the model. The ESP-C casual conversation speech corpus was used as material for the experiment.
キーワード(和)	断片発話 / 対話コーパス / 話し言葉音声合成 / 音響的特徴 / セグメンテーション
キーワード(英)	Utterance fragments / Dialogue Corpus / Spontaneous Speech Synthesis / Acoustics features / Speech Segmentation
資料番号	NLC2008-35,SP2008-90
発行日

研究会情報
研究会	NLC
開催期間	2008/12/2(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Natural Language Understanding and Models of Communication (NLC)
本文の言語	JPN
タイトル（和）	音響的特徴を用いた話し言葉の断片発話単位への分割(音声合成・声質変換,第10回音声言語シンポジウム)
サブタイトル（和）
タイトル（英）	Segmentation of Spoken Language into unit of Utterance Fragment using Acoustics Features
サブタイトル（和）
キーワード(1)（和/英）	断片発話 / Utterance fragments
キーワード(2)（和/英）	対話コーパス / Dialogue Corpus
キーワード(3)（和/英）	話し言葉音声合成 / Spontaneous Speech Synthesis
キーワード(4)（和/英）	音響的特徴 / Acoustics features
キーワード(5)（和/英）	セグメンテーション / Speech Segmentation
第 1 著者氏名（和/英）	瀬戸山勝義 / Katsuyoshi SETOYAMA
第 1 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科 Nara Institute of Science and Technology
第 2 著者氏名（和/英）	柏岡秀紀 / Hideki KASHIOKA
第 2 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科:情報通信研究機構知識創成コミュニケーション研究センター:国際電気通信基礎技術研究所音声言語コミュニケーション研究所 Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International
第 3 著者氏名（和/英）	キャンベルニック / Nick CAMPBELL
第 3 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科:情報通信研究機構知識創成コミュニケーション研究センター:国際電気通信基礎技術研究所音声言語コミュニケーション研究所 Nara Institute of Science and Technology:National Institute of Information and Communications Technology:Advanced Telecommunications Research Institute International
発表年月日	2008-12-10
資料番号	NLC2008-35,SP2008-90
巻番号（vol）	vol.108
号番号（no）	337
ページ範囲	pp.-
ページ数	6
発行日