連続単語認識における認識結果の逐次早期確定アルゴリズムの評価(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般)

大野 博之; 小島 弘; 南角 吉彦; 李 晃伸; 徳田 恵一

講演名	2010-12-21 連続単語認識における認識結果の逐次早期確定アルゴリズムの評価(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般) 大野博之, 小島弘, 南角吉彦, 李晃伸, 徳田恵一,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	音声認識システムにおいて,ユーザの入力発話に対する応答の遅延は入力インターフェースとして重要な課題である.遅延を軽減しユーザに早期のフィードバックを行う方法として,これまでに,仮説を部分的に確定していくことで逐次的に結果を出力する仮説早期確定手法などが提案されてきた.我々は音声システムにおけるさらに高速,低遅延な応答速度の実現を目指し,これまでに,孤立単語認識を対象として仮説ネットワーク(木構造化辞書)の構造および認識処理中のフレームごとの状態尤度を用いて,入力の途中で探索を打ち切り発話終了よりも前に仮説を確定する手法を提案してきた.本稿では,この手法を連続単語認識へと拡張した手法を提案する.評価実験では,14単語の小規模な連続発声タスクにおいて,各単語の発話終了よりも平均約0.053秒前に,認識精度を劣化させることなく各仮説の確定ができた.8738単語の駅名の連続発声タスクにおいては,各単語の発話終了から平均約0.48秒の遅延で,各仮説の確定ができた.また,音響モデルの規模による比較を行った結果も報告する.
抄録(英)	Minimizing response delay of speech recognition system and giving rapid feed backs are important properties for an intuitive, easy-to-use speech interfaces. Many studies has been conducted to improve the response delay, such as making progressive outputs while recognition process "after" the words are half-determined in the context. In order to achieve higher speed input responses, we have proposed an algorithm to determine the most likely hypothesis "before" the utterance ends. The method has been examined for isolated word recognition, and this paper extends it for continuous word recognition. Experimental evaluations were performed for tasks of various vocabulary size. The result at a small vocabulary task with 14 words has shown that our proposed algorithm can determine each word for about 0.053 second prior to the actual end of speech on average, without any degradation of recognition accuracy. Another result on a station names recognition task with vocabulary size of 8738 has shown that our proposed algorithm can determine each word for about 0.48 second on average after the actual end of speech. The comparison results on various acoustic models are also reported.
キーワード(和)	音声認識 / 探索アルゴリズム / 早期確定 / 木構造化辞書 / 信頼度
キーワード(英)	Speech recognition / search algorithm / progressive output / tree lexicon / confidence measure
資料番号	NLC2010-21,SP2010-94
発行日

研究会情報
研究会	SP
開催期間	2010/12/13(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Speech (SP)
本文の言語	JPN
タイトル（和）	連続単語認識における認識結果の逐次早期確定アルゴリズムの評価(認識アルゴリズム,第12回音声言語シンポジウム:情報アクセス,音声・言語処理一般)
サブタイトル（和）
タイトル（英）	Evaluation of Successive Rapid Hypothesis Determination Algorithm for Continuous Word Recognition
サブタイトル（和）
キーワード(1)（和/英）	音声認識 / Speech recognition
キーワード(2)（和/英）	探索アルゴリズム / search algorithm
キーワード(3)（和/英）	早期確定 / progressive output
キーワード(4)（和/英）	木構造化辞書 / tree lexicon
キーワード(5)（和/英）	信頼度 / confidence measure
第 1 著者氏名（和/英）	大野博之 / Hiroyuki OHNO
第 1 著者所属（和/英）	名古屋工業大学大学院工学研究科創成シミュレーション工学専攻 Department of Computer Science and Engineering, Nagoya Institute of Technology
第 2 著者氏名（和/英）	小島弘 / Hiroshi KOJIMA
第 2 著者所属（和/英）	名古屋工業大学大学院工学研究科創成シミュレーション工学専攻:(現)株式会社日立ソリューションズ Department of Computer Science and Engineering, Nagoya Institute of Technology:(Present Office)Hitachi Solutions, Ltd.
第 3 著者氏名（和/英）	南角吉彦 / Yoshihiko NANKAKU
第 3 著者所属（和/英）	名古屋工業大学大学院工学研究科創成シミュレーション工学専攻 Department of Computer Science and Engineering, Nagoya Institute of Technology
第 4 著者氏名（和/英）	李晃伸 / Akinobu LEE
第 4 著者所属（和/英）	名古屋工業大学大学院工学研究科創成シミュレーション工学専攻 Department of Computer Science and Engineering, Nagoya Institute of Technology
第 5 著者氏名（和/英）	徳田恵一 / Keiichi TOKUDA
第 5 著者所属（和/英）	名古屋工業大学大学院工学研究科創成シミュレーション工学専攻 Department of Computer Science and Engineering, Nagoya Institute of Technology
発表年月日	2010-12-21
資料番号	NLC2010-21,SP2010-94
巻番号（vol）	vol.110
号番号（no）	357
ページ範囲	pp.-
ページ数	6
発行日