YOLOv5の転移学習を活用した指先のジェスチャによる物体認識

工藤 司

講演名	2022-08-26 YOLOv5の転移学習を活用した指先のジェスチャによる物体認識工藤司(静岡理工科大),
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	動画や画像に対する物体認識では，対象の領域が小さい場合には，まず物体検出により対象領域を抽出し，その領域に対して物体認識が行われる．この物体検出と認識を同時に行うために，深層学習を活用したさまざまな方法が提案されている．しかし，多様な物体を対象とする場合には，全ての物体に対する訓練データを準備する必要があるという課題が発生する．この課題に対して，先行研究において，YOLOv5を使用した手の検出モデルを活用して，手のジェスチャの軌跡を認識し，この軌跡から対象領域を抽出できることを示した．本研究では，対象領域の抽出精度をさらに向上するための方式を提案する．まず，より小さな指先によりジェスチャを行う場合，転移学習により効率的にモデルを訓練できることを示す．次に，ジェスチャを停止するという動作を交えることで，一連のジェスチャから対象領域を指示している区間や，対象の特徴点を抽出できることを示す．さらに，書籍を対象とした実験を通じて，提案方式により，書籍の対象領域を傾きを補正した矩形として抽出できることを示す．
抄録(英)	In object recognition for videos and images, when the target area is small, the area is firstly extracted by object detection, and then object recognition is performed for that area. Various methods utilizing deep learning have been proposed to simultaneously perform object detection and recognition. However, when targeting various objects, there is a problem that it is necessary to prepare training data for all objects. To address this issue, in my previous study, I showed that a hand gesture trajectory can be recognized and then the target area can be extracted by utilizing a hand detection model using YOLOv5. In this study, I propose a method to improve the accuracy of the target area extraction. First, I show that the model can be trained efficiently by transfer learning to perform the gesture with a smaller fingertip. Next, I show that by utilizing the action of stopping the gesture, the gesture's interval indicating the target area and the feature points of the target can be extracted. Furthermore, experiments using books show that these methods can extract the books' target area as a tilt-corrected rectangle.
キーワード(和)	YOLO / 物体検出 / 物体認識 / ジェスチャ認識 / 転移学習
キーワード(英)	YOLO / object detection / object recognition / gesture recognition / transfer learning
資料番号	SWIM2022-12,SC2022-18
発行日	2022-08-19 (SWIM, SC)

研究会情報
研究会	SWIM / SC
開催期間	2022/8/26(から1日開催)
開催地（和）	オンライン開催
開催地（英）	Held online
テーマ（和）	ソサエティ5.0に向けたサービスコンピューティングとインタプライズモデル化技術,一般
テーマ（英）	Service Computing and Interprise Technology Toward Sciety 5.0 and etc
委員長氏名（和）	五月女健治(法政大) / 木村功作(富士通研)
委員長氏名（英）	Kenji Saotome(Hosei Univ.) / Kosaku Kimura(Fujitsu Lab.)
副委員長氏名（和）	林章浩(静岡理工科大) / 中口孝雄(京都情報大学院大)
副委員長氏名（英）	Akihiro Hayashi(Shizuoka Inst. of Science and Tech.) / Takao Nakaguchi(KCGI)
幹事氏名（和）	能上慎也(東京理科大) / 山田耕嗣(阪産大) / 中村匡秀(神戸大) / 干川尚人(小山工専) / 北島信哉(富士通)
幹事氏名（英）	Shinya Nogami(Tokyo Univ. of Science) / Koji Yamada(Osaka Sangyo Univ.) / Masahide Nakamura(Kobe Univ.) / Naoto Hoshikawa(National Institute of Technology, Oyama College) / Shinya Kitajima(Fujitsu)
幹事補佐氏名（和）	工藤司(静岡理工科大) / 辻孝吉(愛知県立大) / 細野繁(東京工科大)
幹事補佐氏名（英）	Tsukasa Kudo(Shizuoka Inst. of Science and Tech.) / Kokichi Tsuji(Aichi Pref. Univ.) / Shigeru Hosono(Tokyo Univ. of Tech.)

講演論文情報詳細
申込み研究会	Technical Committee on Software Interprise Modeling / Technical Committee on Service Computing
本文の言語	JPN
タイトル（和）	YOLOv5の転移学習を活用した指先のジェスチャによる物体認識
サブタイトル（和）
タイトル（英）	Object Recognition by Fingertip Gestures Utilizing Transfer Learning of YOLOv5
サブタイトル（和）
キーワード(1)（和/英）	YOLO / YOLO
キーワード(2)（和/英）	物体検出 / object detection
キーワード(3)（和/英）	物体認識 / object recognition
キーワード(4)（和/英）	ジェスチャ認識 / gesture recognition
キーワード(5)（和/英）	転移学習 / transfer learning
第 1 著者氏名（和/英）	工藤司 / Tsukasa Kudo
第 1 著者所属（和/英）	静岡理工科大学(略称：静岡理工科大) Shizuoka Institute of Science and Technology(略称：SIST)
発表年月日	2022-08-26
資料番号	SWIM2022-12,SC2022-18
巻番号（vol）	vol.122
号番号（no）	SWIM-167,SC-168
ページ範囲	pp.9-15(SWIM), pp.9-15(SC),
ページ数	7
発行日	2022-08-19 (SWIM, SC)