適応的ガウス混合モデルを用いた画像アノテーション(一般セッション,実世界文字認識と理解)

坪下 幸寛; 加藤 典司; 岡田 真人

講演名	2011-12-16 適応的ガウス混合モデルを用いた画像アノテーション(一般セッション,実世界文字認識と理解) 坪下幸寛, 加藤典司, 岡田真人,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	レポートでは,混合ガウス分布モデル(GMM: Gaussian Mixture Model)を確率モデルとして用いた学習ベースの自動アノテーション技術に着目する.GMMを用いた手法の一つである教師あり多クラスラベリングでは,ラベルごとに,そのラベルの付与された学習サンプルを収集し,学習を個別に行うので,学習サンプル数がラベルごとに大きく異なる.そのため,学習サンプルが少ないラベルでは,過学習の問題が生じてしまい,性能が低くなるという問題点があった.我々は,ラベルを規定しない学習サンプル集合を用いたペナルティ項を付与して個々のラベル確率モデルの学習を行うことを提案する.提案手法によれば,独立に個々のラベルの確率モデルの学習を行いながらも,システム全体の最適化が可能となり,低出現ラベルに対する過学習を抑制する.標準的なテストコーパスを用いた評価の結果,提案手法は性能指標,特に再現率,N+を大きく向上させた.
抄録(英)	In the present study, we focus on learning based automatic image annotation method using Gaussian mixture model (GMM) as a probabilistic model. In Supervised Multiclass Labeling (SML), which is a conventional image annotation method to use GMM, training samples assigned to each semantic label are collected separately, and each probabilistic model of semantic labels is trained independently. The number of training samples therefore varies a great deal according to the labels. Consequently, there is a problem of low performances of semantic labels that have a few training samples because of over fitting. In the present study, we propose to introduce a penalty term using training samples that is not confined by a particular semantic label when training each semantic label model. According to the proposed method, while each probabilistic model to semantic labels is trained independently, optimization of whole annotation system is achieved, and the over fitting of models of labels that have a few samples is suppressed. As the result of evaluation tests using a standard test collection for image annotation, we found the proposed method exhibited higher performance than the conventional SML, especially recall and N+.
キーワード(和)	自動画像アノテーション / 機械学習 / 混合ガウスモデル / ペナルティ項
キーワード(英)	Automatic Image Annotation / Machine Learning / Gaussian Mixture Model / Penalty Term
資料番号	PRMU2011-144
発行日

研究会情報
研究会	PRMU
開催期間	2011/12/8(から1日開催)
開催地（和）
開催地（英）
テーマ（和）
テーマ（英）
委員長氏名（和）
委員長氏名（英）
副委員長氏名（和）
副委員長氏名（英）
幹事氏名（和）
幹事氏名（英）
幹事補佐氏名（和）
幹事補佐氏名（英）

講演論文情報詳細
申込み研究会	Pattern Recognition and Media Understanding (PRMU)
本文の言語	JPN
タイトル（和）	適応的ガウス混合モデルを用いた画像アノテーション(一般セッション,実世界文字認識と理解)
サブタイトル（和）
タイトル（英）	Image Annotation Using Adapted Gaussian Mixture Model
サブタイトル（和）
キーワード(1)（和/英）	自動画像アノテーション / Automatic Image Annotation
キーワード(2)（和/英）	機械学習 / Machine Learning
キーワード(3)（和/英）	混合ガウスモデル / Gaussian Mixture Model
キーワード(4)（和/英）	ペナルティ項 / Penalty Term
第 1 著者氏名（和/英）	坪下幸寛 / Yukihiro TSUBOSHITA
第 1 著者所属（和/英）	富士ゼロックス(株)研究技術開発本部 Research & Technology Group, Fuji Xerox Co., Ltd.
第 2 著者氏名（和/英）	加藤典司 / Noriji KATO
第 2 著者所属（和/英）	富士ゼロックス(株)研究技術開発本部 Research & Technology Group, Fuji Xerox Co., Ltd.
第 3 著者氏名（和/英）	岡田真人 / Masato OKADA
第 3 著者所属（和/英）	東京大学新領域創成科学研究科 Graduate School of Frontier Sciences, The University of Tokyo
発表年月日	2011-12-16
資料番号	PRMU2011-144
巻番号（vol）	vol.111
号番号（no）	353
ページ範囲	pp.-
ページ数	6
発行日