音声言語運用が要求する認知的能力と音声言語工学が構築した計算論的能力

峯松,信明

お知らせ 2023年度・2024年度学生員会費割引キャンペーン実施中です
お知らせ技術研究報告と和文論文誌Ｃの同時投稿施策(掲載料1割引き)について
お知らせ電子情報通信学会における研究会開催について
お知らせ NEW 参加費の返金について

電子情報通信学会研究会発表申込システム
講演論文詳細

技報閲覧サービス
[ログイン]
技報アーカイブ

トップに戻る

前のページに戻る

[Japanese] / [English]

講演抄録／キーワード
講演名		2008-12-09 13:10 ［招待講演］音声言語運用が要求する認知的能力と音声言語工学が構築した計算論的能力 ○峯松信明（東大） NLC2008-29 SP2008-84
抄録	（和）	波形素片やスペクトル素片をテンプレートとして保有し，音響照合・音声生成を行なう方法論から，HMMやGMMに代表される数理統計的な音響モデリング技術の台頭によって，音声認識・合成技術の性能・柔軟性は著しく向上した。しかし音声言語工学が構築して来た計算論的能力と，音声言語運用に関する人間の認知的能力には大きな差異があることは否めない。音声認識では，多様な話者性に対処するために数千・万の話者を使って音響モデルを構築するが（かつ，適応技術を用いたモデル補正が必要となる），幼児の音声言語獲得過程を考えると，聴取する声の多くは母親，父親，自身の声であり，非常に限られた話者性の音声である。音声合成に目を向ければ，合成器が生成するのは学習話者の声である。しかし父親の太い声を模倣する幼児はいない。親の声の物真似を通して音声言語を獲得した事例は存在しない。音声言語工学は音声言語に基づくマン・マシンインターフェイスの構築を目的としているが，上記した差異は技術的未熟さから来るのだろうか？それとも，技術を構築する研究者の人間理解の未熟さから来るのだろうか？本稿では，音声言語の獲得に難を示す重度自閉症者や，音声言語を獲得しない霊長類などの情報処理能力を参照しつつ，音声言語運用が要求する認知的能力と音声言語工学が構築した計算論的能力について考察する。
	（英）	The performance and flexibility of speech recognition and synthesis technologies have been remarkably enhanced by introducing statistical modeling of speech sounds, before which, waveform-based or spectrum-based templates were stored and used for acoustic matching and speech generation. However, a large gap cannot be denied yet between the computational competence realized by spoken language engineering and the cognitive and language competence of humans. Acoustic models used for speech recognition are often built after collecting utterances of thousands of speakers. Human infants, however, acquire spoken language by hearing a remarkably speaker-biased speech corpus; mother, father, and themselves. In speech synthesis, a synthesizer generates speech sounds of the speaker used to train the synthesizer. No infant, however, impersonates its parents to acquire spoken language. Although the aim of spoken language engineering is to realize the man-machine interface based on spoken language, a large gap still exists between humans and machines. The author wonders whether this gap is due to the technical immatureness or due to researchers' immatureness of understanding humans. Considering the information processing of severely damaged autistics, for whom spoken language is very difficult to acquire, and that of the other primates than humans, who cannot acquire spoken language, this paper discusses the cognitive competence required for spoken language performance and the computational competence realized by spoken language engineering.
キーワード	（和）	音声言語運用 / 音声言語工学 / 多様性と不変性 / 相対音感 / 自閉症 / 霊長類 / 進化 / 構造音韻論
	（英）	spoken language performance / spoken language engineering / variability and invariance / relative sense of tone / autistics / primates / evolution / structural phonology
文献情報		信学技報, vol. 108, no. 338, SP2008-84, pp. 31-36, 2008年12月.
資料番号		SP2008-84
発行日		2008-12-02 (NLC, SP)
ISSN		Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380
著作権について		技術研究報告に掲載された論文の著作権は電子情報通信学会に帰属します．(許諾番号：10GA0019/12GB0052/13GB0056/17GB0034/18GB0034)
PDFダウンロード		NLC2008-29 SP2008-84

研究会情報
研究会	SP NLC
開催期間	2008-12-09 - 2008-12-10
開催地（和）	早稲田大学（小野記念講堂）
開催地（英）	Waseda Univ.
テーマ（和）	第10回音声言語シンポジウム（SP/NLC/SLP共催）
テーマ（英）
講演論文情報の詳細
申込み研究会	SP
会議コード	2008-12-SP-NLC
本文の言語	日本語
タイトル（和）	音声言語運用が要求する認知的能力と音声言語工学が構築した計算論的能力
サブタイトル（和）
タイトル（英）	Cognitive competence required for spoken language performance and computational competence realized by spoken language engineering
サブタイトル（英）
キーワード(1)（和/英）	音声言語運用 / spoken language performance
キーワード(2)（和/英）	音声言語工学 / spoken language engineering
キーワード(3)（和/英）	多様性と不変性 / variability and invariance
キーワード(4)（和/英）	相対音感 / relative sense of tone
キーワード(5)（和/英）	自閉症 / autistics
キーワード(6)（和/英）	霊長類 / primates
キーワード(7)（和/英）	進化 / evolution
キーワード(8)（和/英）	構造音韻論 / structural phonology
第1著者氏名（和/英/ヨミ）	峯松信明 / Nobuaki Minematsu / ミネマツノブアキ
第1著者所属（和/英）	東京大学 (略称：東大) The University of Tokyo (略称： Univ. of Tokyo)
第2著者氏名（和/英/ヨミ）	/ /
第2著者所属（和/英）	(略称： ) (略称： )
第3著者氏名（和/英/ヨミ）	/ /
第3著者所属（和/英）	(略称： ) (略称： )
第4著者氏名（和/英/ヨミ）	/ /
第4著者所属（和/英）	(略称： ) (略称： )
第5著者氏名（和/英/ヨミ）	/ /
第5著者所属（和/英）	(略称： ) (略称： )
第6著者氏名（和/英/ヨミ）	/ /
第6著者所属（和/英）	(略称： ) (略称： )
第7著者氏名（和/英/ヨミ）	/ /
第7著者所属（和/英）	(略称： ) (略称： )
第8著者氏名（和/英/ヨミ）	/ /
第8著者所属（和/英）	(略称： ) (略称： )
第9著者氏名（和/英/ヨミ）	/ /
第9著者所属（和/英）	(略称： ) (略称： )
第10著者氏名（和/英/ヨミ）	/ /
第10著者所属（和/英）	(略称： ) (略称： )
第11著者氏名（和/英/ヨミ）	/ /
第11著者所属（和/英）	(略称： ) (略称： )
第12著者氏名（和/英/ヨミ）	/ /
第12著者所属（和/英）	(略称： ) (略称： )
第13著者氏名（和/英/ヨミ）	/ /
第13著者所属（和/英）	(略称： ) (略称： )
第14著者氏名（和/英/ヨミ）	/ /
第14著者所属（和/英）	(略称： ) (略称： )
第15著者氏名（和/英/ヨミ）	/ /
第15著者所属（和/英）	(略称： ) (略称： )
第16著者氏名（和/英/ヨミ）	/ /
第16著者所属（和/英）	(略称： ) (略称： )
第17著者氏名（和/英/ヨミ）	/ /
第17著者所属（和/英）	(略称： ) (略称： )
第18著者氏名（和/英/ヨミ）	/ /
第18著者所属（和/英）	(略称： ) (略称： )
第19著者氏名（和/英/ヨミ）	/ /
第19著者所属（和/英）	(略称： ) (略称： )
第20著者氏名（和/英/ヨミ）	/ /
第20著者所属（和/英）	(略称： ) (略称： )
講演者	第1著者
発表日時	2008-12-09 13:10:00
発表時間	50分
申込先研究会	SP
資料番号	NLC2008-29, SP2008-84
巻番号（vol）	vol.108
号番号（no）	no.337(NLC), no.338(SP)
ページ範囲	pp.31-36
ページ数	6
発行日	2008-12-02 (NLC, SP)

[研究会発表申込システムのトップページに戻る]

[電子情報通信学会ホームページ]

IEICE / 電子情報通信学会