非言語情報を用いたHMMによるユーザ発話前内部状態の推定(テーマセッション,時系列パターン認識)

千葉 祐弥; 伊藤 仁; 伊藤 彰則

講演名	2012-02-09 非言語情報を用いたHMMによるユーザ発話前内部状態の推定(テーマセッション,時系列パターン認識) 千葉祐弥, 伊藤仁, 伊藤彰則,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	本稿では,音声対話システムにおけるユーザの発話前内部状態を推定する手法について述べる.実環境におけるシステム主導型対話システムのプロンプトは,たびたびユーザを混乱させる.一般的な対話システムは入力に時間か掛かっているユーザに対してより詳細な内容の情報を提示するなどの補助を行うが,これらの補助はプロンプトに対する入力を考えているユーザにとってはわずらわしいものとなる.適切な応対を行うためには,システムは発話前のユーザ内部状態を考慮できなくてはならない.従来のユーザモデル研究は発話の言語的な情報に注目してきた.このアプローチの問題の一つはユーザの内部状態が人力発話の終了まで待たないと推定できないことである.したがって,本研究ではユーザ発話が起こる前のフィラーや無音区間,頭部運動などのユーザの非言語的な情報に着目する.本稿では,これまで検討してきた固定長の特徴量を時系列特徴として構築し直し,隠れマルコフモデルによってユーザモデルの推定を行う.被験者に対してopenな識別実験を行ったところ,79.6%の識別精度を得た.
抄録(英)	This paper describes a method for estimating the internal state of the user of a spoken dialog system before his input utterance. In the practical use of dialogue-based system, the user often perplexed with the prompt. An ordinary system provides more detailed information to the user taking time to input, but these helps are meddlesome for the user considering the answer to the prompt. To make appropriate response, the spoken dialogue system have to be able to consider the user's internal state before user's input. The conventional researches on user modeling have focused on the linguistic information of the utterance. One problem of these approaches is that it cannot estimate the user's states until the end of the user's first utterance. Therefor, our study focused on the user's non-verbal output such as fillers, silence or head-moving until the occurrence of user's input utterance. This paper describes the method of the user modeling by HMM. We conducted the examination discrimination, and obtained the accuracy of 79.6%.
キーワード(和)	音声対話 / ユーザモデル / 音声処理 / 画像処理
キーワード(英)	spoken dialogue / user modeling / speech processing / image processing
資料番号	PRMU2011-187,SP2011-102
発行日

研究会情報
研究会	PRMU
開催期間	2012/2/2(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Pattern Recognition and Media Understanding (PRMU)
本文の言語	JPN
タイトル（和）	非言語情報を用いたHMMによるユーザ発話前内部状態の推定(テーマセッション,時系列パターン認識)
サブタイトル（和）
タイトル（英）	Estimation of a User's Internal State before the First Input Utterance Using HMM with Non-verbal Information
サブタイトル（和）
キーワード(1)（和/英）	音声対話 / spoken dialogue
キーワード(2)（和/英）	ユーザモデル / user modeling
キーワード(3)（和/英）	音声処理 / speech processing
キーワード(4)（和/英）	画像処理 / image processing
第 1 著者氏名（和/英）	千葉祐弥 / Yuya CHIBA
第 1 著者所属（和/英）	東北大学大学院工学研究科 Graduate School of Engineering, Tohoku University
第 2 著者氏名（和/英）	伊藤仁 / Masashi ITO
第 2 著者所属（和/英）	東北工業大学知能エレクトロニクス学科 Department of Electronics and Intelligent System, Tohoku Institute of Technology
第 3 著者氏名（和/英）	伊藤彰則 / Akinori ITO
第 3 著者所属（和/英）	東北大学大学院工学研究科 Graduate School of Engineering, Tohoku University
発表年月日	2012-02-09
資料番号	PRMU2011-187,SP2011-102
巻番号（vol）	vol.111
号番号（no）	430
ページ範囲	pp.-
ページ数	6
発行日