時空間周波数領域における部分空間表現に基づく手話認識

Presentation	2024-03-03 Sign language recognition based on subspace representations in the spatio-temporal frequency domain Ryota Sato, Suzana Rita Alves Beleza, Erica Kido Shimomoto, Matheus Silva de Lima, Nobuko Kato, Kazuhiro Fukui,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	This paper proposes a subspace-based method for sign language recognition in videos. The proposed method represents a sign video as a 3D amplitude spectrum tensor on the frequency-domains, which is invariant to the shifts in the spatial and temporal directions of target objects. Such a 3D tensor is generated by applying the three-dimensional fast Fourier transform (3D-FFT) to a sign video. A 3D amplitude spectral tensor is regarded as one point on the Product Grassmann Manifold (PGM). The classification of videos is conducted based on the distance between two points corresponding to two videos on the PGM. The extensive experiments on private and public sign language recognition datasets demonstrated the effectiveness of the proposed method, showing a significant performance improvement over conventional subspace-based methods.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Sign Language Recognition / 3D Fast Fourier Transform / Product Grassmann Manifold / Subspace-based Methods
Paper #	PRMU2023-54
Date of Issue	2024-02-25 (PRMU)

Conference Information
Committee	PRMU / IBISML / IPSJ-CVIM
Conference Date	2024/3/3(2days)
Place (in Japanese)	(See Japanese page)
Place (in English)	Hiroshima Univ. Higashi-Hiroshima campus
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Kunio Kashio(NTT) / Masashi Sugiyama(Univ. of Tokyo) / 日浦慎作(兵庫県立大)
Vice Chair	Takuya Funatomi(NAIST) / Go Irie(Tokyo Univ. of Science) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo)
Secretary	Takuya Funatomi(Tokyo Inst. of Tech.) / Go Irie(Riken) / Toshihiro Kamishima(NTT) / Koji Tsuda(Hokkaido Univ.) / (名大)
Assistant	Kei Shimonishi(Kyoto Univ.) / Kensho Hara(AIST) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Univ.of Tokyo)

Paper Information
Registration To	Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Computer Vision and Image Media
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Sign language recognition based on subspace representations in the spatio-temporal frequency domain
Sub Title (in English)
Keyword(1)	Sign Language Recognition
Keyword(2)	3D Fast Fourier Transform
Keyword(3)	Product Grassmann Manifold
Keyword(4)	Subspace-based Methods
1st Author's Name	Ryota Sato
1st Author's Affiliation	University of Tsukuba(Univ. of Tsukuba)
2nd Author's Name	Suzana Rita Alves Beleza
2nd Author's Affiliation	University of Tsukuba(Univ. of Tsukuba)
3rd Author's Name	Erica Kido Shimomoto
3rd Author's Affiliation	National Institute of Advanced Industrial Science and Technology(AIST)
4th Author's Name	Matheus Silva de Lima
4th Author's Affiliation	University of Tsukuba(Univ. of Tsukuba)
5th Author's Name	Nobuko Kato
5th Author's Affiliation	Tsukuba University of Technology(Tsukuba Univ. of Technology)
6th Author's Name	Kazuhiro Fukui
6th Author's Affiliation	University of Tsukuba(Univ. of Tsukuba)
Date	2024-03-03
Paper #	PRMU2023-54
Volume (vol)	vol.123
Number (no)	PRMU-409
Page	pp.pp.19-24(PRMU),
#Pages	6
Date of Issue	2024-02-25 (PRMU)