精度の異なる分布の混合による頑健な音響モデル

高野 優

講演名	1998/12/11 精度の異なる分布の混合による頑健な音響モデル高野優,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	本稿では、自由発話の発声変形によって生ずる精細な音素カテゴリ間の混同を防ぎ、なおかつ周辺音素環境による発声変形に関する音素環境依存モデルの利点を保持するモデルとして、精度の低い分布を混入した音響モデルの使用を提案する。本モデルではHMMの各状態を表現する分布として、通常の音素環境依存モデルに使用する精細なモデルから得られる分布に音素環境非依存モデル等の粗いモデルから得られる分布を加えた混合分布を使用する。粗いモデルを併用することで、自由発話の発生変形によって生ずる、精細なモデルに適合しない音声の吸収を図る。本モデルを用いた、ホテル予約タスク自由発話認識実験では、同分布数の音素環境依存モデルに比べて一割程度少ない誤認識率を示すことを確認した。
抄録(英)	In this report, new acoustic models made by mixing probabilistic distributions with various precision are proposed. The proposed models can prevent phonetic confusion among precise phonetic models caused by acoustic variations in spontaneous speech, while keeping the advantage of precise model that can deal with acoustic variations caused by phonetic environment. Distributions in the proposed models are taken from both precise cotext-dependent(CD)model and another rough model, such as context-independent model. By using distributions from rough model, a model can match acoustic features which don't fit the precise model because of variations due to spontaneity. Experiments on spontaneous speech for hotel reservation task indicated that some of the proposed models can reduce error rate with CD model by around 1/10.
キーワード(和)	自由発話音声 / 音素環境依存モデル / 音素環境非依存モデル / 頑健性 / 混合分布 / 精度 / 音声認識
キーワード(英)	spontaneous speech / context-dependent model / context-independent model / robustness / continuous mixture density / precision / speech recognition
資料番号	NLC98-51,SP98-115
発行日

研究会情報
研究会	NLC
開催期間	1998/12/11(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Natural Language Understanding and Models of Communication (NLC)
本文の言語	JPN
タイトル（和）	精度の異なる分布の混合による頑健な音響モデル
サブタイトル（和）
タイトル（英）	Robust acoustic models by mixing distributions with various precision
サブタイトル（和）
キーワード(1)（和/英）	自由発話音声 / spontaneous speech
キーワード(2)（和/英）	音素環境依存モデル / context-dependent model
キーワード(3)（和/英）	音素環境非依存モデル / context-independent model
キーワード(4)（和/英）	頑健性 / robustness
キーワード(5)（和/英）	混合分布 / continuous mixture density
キーワード(6)（和/英）	精度 / precision
キーワード(7)（和/英）	音声認識 / speech recognition
第 1 著者氏名（和/英）	高野優 / Masaru Takano
第 1 著者所属（和/英）	ATR音声翻訳通信研究所 ATR Interpreting Telecommunications Research Laboratories
発表年月日	1998/12/11
資料番号	NLC98-51,SP98-115
巻番号（vol）	vol.98
号番号（no）	461
ページ範囲	pp.-
ページ数	8
発行日