階層型合成音声制御記述言語MSCL : (Multi-layered Speech/Sound Synthesis Control Language)の構想

水野 理; 中嶌 信弥

講演名	1997/5/22 階層型合成音声制御記述言語MSCL : (Multi-layered Speech/Sound Synthesis Control Language)の構想水野理, 中嶌信弥,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	言語外的情報を表現でき,かつ,継続時間長などをテキストベースできめ細かに制御できる階層型合成音声制御記述言語MSCLを提索する. 日常の音声によるコミュニケーションでは,単純に言語情報を相手に伝えるだけでなく,ニュアンス/話者の感情/聞き手に対する要望の度合など種々の言語外情報も重要な役割を果たしている. 合成音声による音声インタフェースに関しては,現在のところ朗続調的音声であり,利用者へのテキストレベル以上の情報の伝達は難しい. 音声合成によって何らかのマルチメデイアコンテンツ作成をおこなう場合などは,単調なものとなりがちで,魅力あるものとはいえない. また,合成音声とアニメーションなどとの組合せ/同期を考えた場合,音声の継続時間長のきめ細かな制御が必要であるが、現行の合成音声制御系では不可能に近い. そこで,本報告では,"協調"や"怒り"といった意味的なレベルから,音韻レベルまで記述できる多階層の言語体系を検討している.
抄録(英)	This paper proposes a new synthetic speech/sound control language, Multi-layered Speech/Sound Synthesis Control Language (MSCL), which enables TTS systems to synthesize several modes of speech. A spoken dialogue communicates not only verbal information but also non-verbal information : nuances, mental states and speaker's attitudes. These are important in passing information effectively. Current text-to-speech (TTS) systems that output recitation voice cannot pass non-verbal information. When multimedia contents are output using, in part, a TTS system, the contents are monotonous and unattractive. Furthermore, a TTS system without speech duration control makes it difficult to synchronize the voice to facial animations. MSCL can express non-verbal information and control prosodic features in detail. MSCL is a multi-layered linguistic system and encompasses both the semantic levellayer and the phonetic level layer.
キーワード(和)	MSCL / 階層構造 / 韻律特性 / 言語外情報
キーワード(英)	MSCL / multi-layer / prosodic features / non-verbal information
資料番号	SP97-4
発行日

研究会情報
研究会	SP
開催期間	1997/5/22(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Speech (SP)
本文の言語	JPN
タイトル（和）	階層型合成音声制御記述言語MSCL : (Multi-layered Speech/Sound Synthesis Control Language)の構想
サブタイトル（和）
タイトル（英）	New Synthetic Speech/Sound Control Language : MSCL
サブタイトル（和）
キーワード(1)（和/英）	MSCL / MSCL
キーワード(2)（和/英）	階層構造 / multi-layer
キーワード(3)（和/英）	韻律特性 / prosodic features
キーワード(4)（和/英）	言語外情報 / non-verbal information
第 1 著者氏名（和/英）	水野理 / Osamu MIZUNO
第 1 著者所属（和/英）	NTTヒューマンインタフェース研究所 NTT Human Interface Labolatories
第 2 著者氏名（和/英）	中嶌信弥 / Shin'ya NAKAJIMA
第 2 著者所属（和/英）	NTTヒューマンインタフェース研究所 NTT Human Interface Labolatories
発表年月日	1997/5/22
資料番号	SP97-4
巻番号（vol）	vol.97
号番号（no）	64
ページ範囲	pp.-
ページ数	6
発行日