Presentation | 2024-03-03 Sign language recognition based on subspace representations in the spatio-temporal frequency domain Ryota Sato, Suzana Rita Alves Beleza, Erica Kido Shimomoto, Matheus Silva de Lima, Nobuko Kato, Kazuhiro Fukui, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper proposes a subspace-based method for sign language recognition in videos. The proposed method represents a sign video as a 3D amplitude spectrum tensor on the frequency-domains, which is invariant to the shifts in the spatial and temporal directions of target objects. Such a 3D tensor is generated by applying the three-dimensional fast Fourier transform (3D-FFT) to a sign video. A 3D amplitude spectral tensor is regarded as one point on the Product Grassmann Manifold (PGM). The classification of videos is conducted based on the distance between two points corresponding to two videos on the PGM. The extensive experiments on private and public sign language recognition datasets demonstrated the effectiveness of the proposed method, showing a significant performance improvement over conventional subspace-based methods. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Sign Language Recognition / 3D Fast Fourier Transform / Product Grassmann Manifold / Subspace-based Methods |
Paper # | PRMU2023-54 |
Date of Issue | 2024-02-25 (PRMU) |
Conference Information | |
Committee | PRMU / IBISML / IPSJ-CVIM |
---|---|
Conference Date | 2024/3/3(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Hiroshima Univ. Higashi-Hiroshima campus |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Kunio Kashio(NTT) / Masashi Sugiyama(Univ. of Tokyo) / 日浦 慎作(兵庫県立大) |
Vice Chair | Takuya Funatomi(NAIST) / Go Irie(Tokyo Univ. of Science) / Toshihiro Kamishima(AIST) / Koji Tsuda(Univ. of Tokyo) |
Secretary | Takuya Funatomi(Tokyo Inst. of Tech.) / Go Irie(Riken) / Toshihiro Kamishima(NTT) / Koji Tsuda(Hokkaido Univ.) / (名大) |
Assistant | Kei Shimonishi(Kyoto Univ.) / Kensho Hara(AIST) / Yoshinobu Kawahara(Osaka Univ.) / Taiji Suzuki(Univ.of Tokyo) |
Paper Information | |
Registration To | Technical Committee on Pattern Recognition and Media Understanding / Technical Committee on Information-Based Induction Sciences and Machine Learning / Special Interest Group on Computer Vision and Image Media |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Sign language recognition based on subspace representations in the spatio-temporal frequency domain |
Sub Title (in English) | |
Keyword(1) | Sign Language Recognition |
Keyword(2) | 3D Fast Fourier Transform |
Keyword(3) | Product Grassmann Manifold |
Keyword(4) | Subspace-based Methods |
1st Author's Name | Ryota Sato |
1st Author's Affiliation | University of Tsukuba(Univ. of Tsukuba) |
2nd Author's Name | Suzana Rita Alves Beleza |
2nd Author's Affiliation | University of Tsukuba(Univ. of Tsukuba) |
3rd Author's Name | Erica Kido Shimomoto |
3rd Author's Affiliation | National Institute of Advanced Industrial Science and Technology(AIST) |
4th Author's Name | Matheus Silva de Lima |
4th Author's Affiliation | University of Tsukuba(Univ. of Tsukuba) |
5th Author's Name | Nobuko Kato |
5th Author's Affiliation | Tsukuba University of Technology(Tsukuba Univ. of Technology) |
6th Author's Name | Kazuhiro Fukui |
6th Author's Affiliation | University of Tsukuba(Univ. of Tsukuba) |
Date | 2024-03-03 |
Paper # | PRMU2023-54 |
Volume (vol) | vol.123 |
Number (no) | PRMU-409 |
Page | pp.pp.19-24(PRMU), |
#Pages | 6 |
Date of Issue | 2024-02-25 (PRMU) |