Presentation | 2021-03-03 [Poster Presentation] Issues on automatic soundscape generation based on image object detection Yoshifumi Chisaki, Toshiharu Horiuchi, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This study describes automatic soundscape generation process for non-audio movie and photo. The processes consists of image analysis using machine learning, estimation of direction and relative distance for objects from a image, orbit estimation of objects and mixing to take into account of harmony between audio and image. In this paper, a design of overall system are mentioned ,and requirements and issues for image processing using machine learning, spatial audio processing process, harmonic mixing are discussed. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Soundscape / Machine learning / Automatic mixing / Object's orbit estimation / Harmony between audio and image |
Paper # | EA2020-66,SIP2020-97,SP2020-31 |
Date of Issue | 2021-02-24 (EA, SIP, SP) |
Conference Information | |
Committee | EA / US / SP / SIP / IPSJ-SLP |
---|---|
Conference Date | 2021/3/3(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Speech, Engineering/Electro Acoustics, Signal Processing, Ultrasonics, and Related Topics |
Chair | Kenichi Furuya(Oita Univ.) / Hikaru Miura(Nihon Univ.) / Hisashi Kawai(NICT) / Kazunori Hayashi(Kyoto Univ.) / 北岡 教英(豊橋技科大) |
Vice Chair | Yoshinobu Kajikawa(Kansai Univ.) / Kentaro Matsui(NHK) / Jun Kondo(Shizuoka Univ.) / Yoshikazu Koike(Shibaura Inst. of Tech.) / / Yukihiro Bandou(NTT) / Toshihisa Tanaka(Tokyo Univ. Agri.&Tech.) |
Secretary | Yoshinobu Kajikawa(Univ. of Tokyo) / Kentaro Matsui(NTT) / Jun Kondo(Doshisha Univ.) / Yoshikazu Koike(Tohoku Univ.) / (Univ. of Tokyo) / Yukihiro Bandou(Waseda Univ.) / Toshihisa Tanaka(Hosei Univ.) / (Waseda Univ.) |
Assistant | Yukou Wakabayashi(Tokyo Metropolitan Univ.) / Tatsuya Komatsu(LINE) / Shinnosuke Hirata(Tokyo Inst. of Tech.) / Yusuke Ijima(NTT) / Yuichi Tanaka(Tokyo Univ. Agri.&Tech.) |
Paper Information | |
Registration To | Technical Committee on Engineering Acoustics / Technical Committee on Ultrasonics / Technical Committee on Speech / Technical Committee on Signal Processing / Special Interest Group on Spoken Language Processing |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Issues on automatic soundscape generation based on image object detection |
Sub Title (in English) | |
Keyword(1) | Soundscape |
Keyword(2) | Machine learning |
Keyword(3) | Automatic mixing |
Keyword(4) | Object's orbit estimation |
Keyword(5) | Harmony between audio and image |
1st Author's Name | Yoshifumi Chisaki |
1st Author's Affiliation | Chiba Institute of Technology(CIT) |
2nd Author's Name | Toshiharu Horiuchi |
2nd Author's Affiliation | KDDI Research, Inc.(KDDI Research, Inc.) |
Date | 2021-03-03 |
Paper # | EA2020-66,SIP2020-97,SP2020-31 |
Volume (vol) | vol.120 |
Number (no) | EA-397,SIP-398,SP-399 |
Page | pp.pp.41-44(EA), pp.41-44(SIP), pp.41-44(SP), |
#Pages | 4 |
Date of Issue | 2021-02-24 (EA, SIP, SP) |