Presentation | 2022-03-07 [Poster Presentation] Head location estimation using CSRNet for understanding Highly Congested Scenes Takuya Nagatoshi, Michiharu Niimi, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Neural network for congested scene recognition called CSRNet has been proposed. The purpose of CSRNet is to count the number of heads based on the output of CSRNet. We plane to apply CSRNet to the attendance management system which is able to recognize who are attending in a classroom, in order to do that, we need to estimate head location. In CSRNet, the input is an RGB image that is taken for a scene, and the output is the density map of people. The number of heads is given by the integral of density map. The learning process of CSRNet is performed by decreasing the mean square error between the output and the correct density map. Note that the size of density map is 1/8 of input image. Because the density map can be regarded as 2-dimentinal signal, we can estimate head locations by determining its maximal value. In this report, we put an up-sampling layer inside of CSRNet to try to make a detailed density map. The size of density map become 1/4 of input image. We apply gaussian filters to density map to reduce the influence of noise. In the experiments, we used ShanghaiTech data set and real pictures with is taken at a real classroom. As a result of experiments, we confirmed that the effectiveness of adding up-sampling layer, but it is difficult to adjust head location error and multiple extraction. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | congested scene recognition / head location estimation / CSRNet |
Paper # | EMM2021-95 |
Date of Issue | 2022-02-28 (EMM) |
Conference Information | |
Committee | EMM |
---|---|
Conference Date | 2022/3/7(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | (Primary: Online, Secondary: On-site) |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Image and Sound Quality, Metrics for Perception and Recognition, Human Auditory and Visual System, etc. |
Chair | Ryoichi Nishimura(NICT) |
Vice Chair | Masaaki Fujiyoshi(Tokyo Metropolitan Univ.) / Masatsugu Ichino(Univ. of Electro-Comm.) |
Secretary | Masaaki Fujiyoshi(Utsunomiya Univ.) / Masatsugu Ichino(NICT) |
Assistant | Shoko Imaizumi(Chiba Univ.) / Youichi Takashima(Kaishi Professional Univ.) |
Paper Information | |
Registration To | Technical Committee on Enriched MultiMedia |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Head location estimation using CSRNet for understanding Highly Congested Scenes |
Sub Title (in English) | |
Keyword(1) | congested scene recognition |
Keyword(2) | head location estimation |
Keyword(3) | CSRNet |
1st Author's Name | Takuya Nagatoshi |
1st Author's Affiliation | Kyushu Institute of Technology(KIT) |
2nd Author's Name | Michiharu Niimi |
2nd Author's Affiliation | Kyushu Institute of Technology(KIT) |
Date | 2022-03-07 |
Paper # | EMM2021-95 |
Volume (vol) | vol.121 |
Number (no) | EMM-417 |
Page | pp.pp.17-22(EMM), |
#Pages | 6 |
Date of Issue | 2022-02-28 (EMM) |