A Study on generating First-Person Video in moving daily space using variational auto encoders

じょ しんひ; 岡留 有哉; 石黒 浩; 中村 泰

講演名	2023-07-01 A Study on generating First-Person Video in moving daily space using variational auto encoders じょしんひ(阪大), 岡留有哉(東京理科大), 石黒浩(阪大), 中村泰(理研),
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)	近年、ナビゲーション技術の発展により実空間で頑健に移動するロボットが実現してきたが、人間と共存する環境で周囲の人間の動きに合わせて移動するメカニズムは確立されていない。そこで我々は、人間の動作を含む移動中の状態変化を将来のビデオフレームの予測としてモデル化を行うことを考え、人間が歩行中に集めた一人称視点ビデオデータに対するビデオフレームの生成モデルについて実現可能性を調査している。本報告では、ネットワーク構造やロス関数の違いによる生成データの性質について報告する。
抄録(英)	Current researches on robot navigation mainly emphasize on responsive behaviors, but it usually presents insu?cient intelligence if we want robots be better integrated into human society. To cope with this issue, we are tackling to model the change of the state including other person’s motion by using a deep generative model. In terms of Variational Autoencoders, di?erent factors like hyperparameters, the dimensions of latent space and loss function can directly in?uence the generation quality. However, the concrete e?ect of these factors still needs to be studied. Thus, this paper works out an investigation which goes deep in this problem. In particular, we built a VAE based generative model with 3D ConvNet for video reconstruction task. Then we have summarized and compared models’ performances with various combinations of in?uencing factors and discovered their disentangled representation. A Variational Autoencoder with Arbitrary Conditioning(VAEAC) based prediction model was also evaluated. Our results demonstrate how the performance of generative models was changed to o?er a good instance for video prediction problem.
キーワード(和)
キーワード(英)	Variational AutoencoderVariational Autoencoder with Arbitrary Conditioningdisentangled representationvideo generation
資料番号	NC2023-24,IBISML2023-24
発行日	2023-06-22 (NC, IBISML)

研究会情報
研究会	NC / IBISML / IPSJ-BIO / IPSJ-MPS
開催期間	2023/6/29(から3日開催)
開催地（和）	沖縄科学技術大学院大学カンファレンス・センター
開催地（英）	OIST Conference Center
テーマ（和）	機械学習によるバイオデータマイニング、一般
テーマ（英）
委員長氏名（和）	田中宏和(東京都市大) / 杉山将(東大)
委員長氏名（英）	Hirokazu Tanaka(Tokyo City Univ.) / Masashi Sugiyama(Univ. of Tokyo)
副委員長氏名（和）	井澤淳(筑波大) / 神嶌敏弘(産総研) / 津田宏治(東大)
副委員長氏名（英）	Jun Izawa(Univ. of Tsukub) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo)
幹事氏名（和）	寺島裕貴(NTT) / 田中沙織(奈良先端大) / 岩田具治(NTT) / 中村篤祥(北大)
幹事氏名（英）	Hiroki Terashima(NTT) / Saori Tanaka(NAIST) / Tomoharu Iwata(NTT) / Atsuyoshi Nakamura(Hokkaido Univ.)
幹事補佐氏名（和）	田和辻可昌(早大) / 堀井隆斗(阪大) / 河原吉伸(阪大) / 鈴木大慈(東工大)
幹事補佐氏名（英）	Yoshimasa Tawatsuji(Waseda Univ.) / Takato Horii(Osaka Univ.) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Tokyo Inst. of Tech.)

講演論文情報詳細
申込み研究会	Technical Committee on Neurocomputing / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Bioinformatics and Genomics / Special Interest Group on Mathematical Modeling and Problem Solving
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	A Study on generating First-Person Video in moving daily space using variational auto encoders
サブタイトル（和）
キーワード(1)（和/英）	/ Variational AutoencoderVariational Autoencoder with Arbitrary Conditioningdisentangled representationvideo generation
第 1 著者氏名（和/英）	じょしんひ / Xu Chenfei
第 1 著者所属（和/英）	大阪大学(略称：阪大) Osaka Univeristy(略称：Osaka Univ.)
第 2 著者氏名（和/英）	岡留有哉 / Okadome Yuya
第 2 著者所属（和/英）	東京理科大学(略称：東京理科大) Tokyo University of Science(略称：TUS)
第 3 著者氏名（和/英）	石黒浩 / Ishiguro Hiroshi
第 3 著者所属（和/英）	大阪大学(略称：阪大) Osaka Univeristy(略称：Osaka Univ.)
第 4 著者氏名（和/英）	中村泰 / Nakamura Yutaka
第 4 著者所属（和/英）	理化学研究所(略称：理研) RIKEN(略称：RIKEN)
発表年月日	2023-07-01
資料番号	NC2023-24,IBISML2023-24
巻番号（vol）	vol.123
号番号（no）	NC-90,IBISML-91
ページ範囲	pp.153-158(NC), pp.153-158(IBISML),
ページ数	6
発行日	2023-06-22 (NC, IBISML)