画像入力マイクロフォンの子音への拡張に関する一検討

長谷川 孝明; 田中 宏明

講演名	1995/6/9 画像入力マイクロフォンの子音への拡張に関する一検討長谷川孝明, 田中宏明,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	筆者らがすでに提案した口唇画像から音声信号へのメディア変換による意思伝達システム「画像入力マイクロフォン」は,音響的に劣悪な環境下でのマイクロフォンとしての使用,発声不能者の代替発声システム,発声不要な秘話通信への応用を目指しているが,子音への対応がほとんどできていなかった.本稿では,実際のコミュニケーションで不可欠となる子音への対応のためのいくつかの検討を行っている.子音の速く微妙な動きに追従するための画像処理法の改善,前後フレームの相関の利用による声道断面積関数の推定精度の向上を通し,有声破裂音に対応したシステムの構成,評価を行っている.本システムにより合成された子音を含む音声の識別率は,従来システムと比較して約2倍に向上していることから,本システムは有声破裂音および母音に対応可能であることが明らかとされ,これにより本手法の一般的な子音への拡張の可能性,さらにコミュニケーションシステムとしての実現の可能性が示唆されている.
抄録(英)	Previously we proposed a new speech communication system "The Image Input Microphone" to convert oral motion images into speech using estimation of a vocal tract transfer function from oral images. This system, which requires no actual utterance, has high security, robustness to acoustic noise and the use as a speaking-aid system for its object. However this system has not performed well for consonants, because lips move quickly in utterances of consonants. This paper describes an improved system that has an ability of more precise estimation using improved image processing and a correlation between frames. We examine speech sounds including voiced plosive consonants. The recognition rate of synthesized speech by the proposed system improves about twice as good as by the ordinary system. It is shown that the proposed system has potential to synthesize speech including consonants and to use as a communication system.
キーワード(和)	CCDカメラマイクロフォン / 画像入力マイクロフォン / 読唇 / 音声合成
キーワード(英)	CCDcamera Microphone / Image Input Microphone / Lipreading / speech synthesis
資料番号
発行日

研究会情報
研究会	HCS
開催期間	1995/6/9(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Human Communication Science (HCS)
本文の言語	JPN
タイトル（和）	画像入力マイクロフォンの子音への拡張に関する一検討
サブタイトル（和）
タイトル（英）	A Study on The Image Input Microphone Applicable to Consonants
サブタイトル（和）
キーワード(1)（和/英）	CCDカメラマイクロフォン / CCDcamera Microphone
キーワード(2)（和/英）	画像入力マイクロフォン / Image Input Microphone
キーワード(3)（和/英）	読唇 / Lipreading
キーワード(4)（和/英）	音声合成 / speech synthesis
第 1 著者氏名（和/英）	長谷川孝明 / Takaaki Hasegawa
第 1 著者所属（和/英）	埼玉大学工学部電気電子システム工学科 Dept. of Electrical and Electronic System Eng., Saitama University
第 2 著者氏名（和/英）	田中宏明 / Hiroaki Tanaka
第 2 著者所属（和/英）	埼玉大学工学部電気電子システム工学科 Dept. of Electrical and Electronic System Eng., Saitama University
発表年月日	1995/6/9
資料番号
巻番号（vol）	vol.95
号番号（no）	88
ページ範囲	pp.-
ページ数	6
発行日