Presentation 2024-03-03
Sign language recognition based on subspace representations in the spatio-temporal frequency domain
Ryota Sato, Suzana Rita Alves Beleza, Erica Kido Shimomoto, Matheus Silva de Lima, Nobuko Kato, Kazuhiro Fukui,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes a subspace-based method for sign language recognition in videos. The proposed method represents a sign video as a 3D amplitude spectrum tensor on the frequency-domains, which is invariant to the shifts in the spatial and temporal directions of target objects. Such a 3D tensor is generated by applying the three-dimensional fast Fourier transform (3D-FFT) to a sign video. A 3D amplitude spectral tensor is regarded as one point on the Product Grassmann Manifold (PGM). The classification of videos is conducted based on the distance between two points corresponding to two videos on the PGM. The extensive experiments on private and public sign language recognition datasets demonstrated the effectiveness of the proposed method, showing a significant performance improvement over conventional subspace-based methods.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Sign Language Recognition / 3D Fast Fourier Transform / Product Grassmann Manifold / Subspace-based Methods
Paper # PRMU2023-54
Date of Issue 2024-02-25 (PRMU)

Conference Information
Committee PRMU / IBISML / IPSJ-CVIM
Conference Date 2024/3/3(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Hiroshima Univ. Higashi-Hiroshima campus
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Kunio Kashio(NTT) / Masashi Sugiyama(Univ. of Tokyo) / 日浦 慎作(兵庫県立大)
Vice Chair Takuya Funatomi(NAIST) / Go Irie(Tokyo Univ. of Science) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo)
Secretary Takuya Funatomi(Tokyo Inst. of Tech.) / Go Irie(Riken) / Toshihiro Kamishima(NTT) / Koji Tsuda(Hokkaido Univ.) / (名大)
Assistant Kei Shimonishi(Kyoto Univ.) / Kensho Hara(AIST) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Univ.of Tokyo)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Computer Vision and Image Media
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Sign language recognition based on subspace representations in the spatio-temporal frequency domain
Sub Title (in English)
Keyword(1) Sign Language Recognition
Keyword(2) 3D Fast Fourier Transform
Keyword(3) Product Grassmann Manifold
Keyword(4) Subspace-based Methods
1st Author's Name Ryota Sato
1st Author's Affiliation University of Tsukuba(Univ. of Tsukuba)
2nd Author's Name Suzana Rita Alves Beleza
2nd Author's Affiliation University of Tsukuba(Univ. of Tsukuba)
3rd Author's Name Erica Kido Shimomoto
3rd Author's Affiliation National Institute of Advanced Industrial Science and Technology(AIST)
4th Author's Name Matheus Silva de Lima
4th Author's Affiliation University of Tsukuba(Univ. of Tsukuba)
5th Author's Name Nobuko Kato
5th Author's Affiliation Tsukuba University of Technology(Tsukuba Univ. of Technology)
6th Author's Name Kazuhiro Fukui
6th Author's Affiliation University of Tsukuba(Univ. of Tsukuba)
Date 2024-03-03
Paper # PRMU2023-54
Volume (vol) vol.123
Number (no) PRMU-409
Page pp.pp.19-24(PRMU),
#Pages 6
Date of Issue 2024-02-25 (PRMU)