タスク依存音響モデルのための発話レベルでの選択学習法(Session-8 ポスターセッション: 一般, 第7回音声言語シンポジウム)

ツィンツァレク トビアス; 戸田 智基; 猿渡 洋; 鹿野 清宏

講演名	2005/12/15 タスク依存音響モデルのための発話レベルでの選択学習法(Session-8 ポスターセッション: 一般, 第7回音声言語シンポジウム) ツィンツァレクトビアス, 戸田智基, 猿渡洋, 鹿野清宏,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	高性能音響モデルを構築するために、音声データが大量に必要である。音響モデルの認識性能が対象タスクによるので、タスク別に音響モデルを準備する必要がある。しかし、音声データの収集と書き起こしにおけるコストが膨大であり、任意タスクのために十分の音声データを用意するのが困難である。本稿では、コスト削減を目的にした発話単位の選択学習法を検討する。提案手法は、既存の音声データベースを利用し、対象タスク用の開発データに対する尤度が上昇するように、学習発話を選択する。十分統計量を用いることで、尤度計算は高速に可能である。評価実験において、小学生の音声データで幼児モデル、大人の音声データで高齢者モデルを構築する選択学習を適用した。選択学習は10発話程度の開発データの場合にも有効であった。又、選別した発話で再学習した音響モデルの認識性能は、開発データに基づくMAPとMLLR適応で得られたモデルより優位であった。
抄録(英)	Large amounts of speech data are necessary to construct high performance acoustic models. Since speech recognition performance is task-dependent and the effort and costs for speech data collection and transcription are very high, it is infeasable to prepare enough data for every new application which makes use of speech recognition technology. In this paper an algorithm for utterance-based selective training is proposed, which enables the automatic and cost-effective construction of task-dependent acoustic models. Training utterances are selected from existing speech data resources so that the likelihood of an independent development data set is maximized. Fast calculation of the likelihood is possible with sufficient statistics. The algorithm is evaluated for constructing an infant-dependent model with speech from elementary school children and an elderly-dependent model with adult speech data. Selective training is already effective with only ten development utterances. Furthermore, a higher word accuracy than with the standard adaptation methods MAP and MLLR was achieved.
キーワード(和)	音響モデル / タスク依存性 / コスト削減 / 選択学習 / 十分統計量
キーワード(英)	Acoustic Modeling / Task-Dependency / Cost Reduction / Selective Training / Sufficient Statistics
資料番号	NLC2005-102,SP2005-135
発行日

研究会情報
研究会	NLC
開催期間	2005/12/15(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Natural Language Understanding and Models of Communication (NLC)
本文の言語	JPN
タイトル（和）	タスク依存音響モデルのための発話レベルでの選択学習法(Session-8 ポスターセッション: 一般, 第7回音声言語シンポジウム)
サブタイトル（和）
タイトル（英）	Utterance-based Selective Training for Task-Dependent Acoustic Modeling
サブタイトル（和）
キーワード(1)（和/英）	音響モデル / Acoustic Modeling
キーワード(2)（和/英）	タスク依存性 / Task-Dependency
キーワード(3)（和/英）	コスト削減 / Cost Reduction
キーワード(4)（和/英）	選択学習 / Selective Training
キーワード(5)（和/英）	十分統計量 / Sufficient Statistics
第 1 著者氏名（和/英）	ツィンツァレクトビアス / Tobias CINCAREK
第 1 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科音情報処理学講座 Nara Institute of Science and Technology
第 2 著者氏名（和/英）	戸田智基 / Tomoki TODA
第 2 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科音情報処理学講座 Nara Institute of Science and Technology
第 3 著者氏名（和/英）	猿渡洋 / Hiroshi SARUWATARI
第 3 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科音情報処理学講座 Nara Institute of Science and Technology
第 4 著者氏名（和/英）	鹿野清宏 / Kiyohiro SHIKANO
第 4 著者所属（和/英）	奈良先端科学技術大学院大学情報科学研究科音情報処理学講座 Nara Institute of Science and Technology
発表年月日	2005/12/15
資料番号	NLC2005-102,SP2005-135
巻番号（vol）	vol.105
号番号（no）	494
ページ範囲	pp.-
ページ数	6
発行日