Presentation 2020-03-03
[Poster Presentation] Design of automatic soundscape generation based on image object detection
Yoshifumi Chisaki, Toshiharu Horiuchi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This study describes automatic soundscape generation process for non-audio movie/photo. The processes consists of image analysis using deep learning, estimation of direction and relative distance for objects from a image, and mixing to take into account of harmony between audio and image. In this paper, a design of overall system are mentioned and requirements for image processing using deep learning, spatial audio processing process, harmonic mixing are discussed.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Soundscape / Deep learning / Mixing / Harmony between audio and image
Paper # EA2019-144,SIP2019-146,SP2019-93
Date of Issue 2020-02-24 (EA, SIP, SP)

Conference Information
Committee SP / EA / SIP
Conference Date 2020/3/2(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Industry Support Center
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Hisashi Kawai(NICT) / Kenichi Furuya(Oita Univ.) / Naoyuki Aikawa(TUS)
Vice Chair Akinobu Ri(Nagoya Inst. of Tech.) / Suehiro Shimauchi(Kanazawa Inst. of Tech.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / Kazunori Hayashi(Osaka City Univ) / Yukihiro Bandou(NTT)
Secretary Akinobu Ri(Kyoto Univ.) / Suehiro Shimauchi(Waseda Univ.) / Shigeto Takeoka(NHK) / Kazunori Hayashi(Univ. of Tokyo) / Yukihiro Bandou(Hiroshima Univ.)
Assistant Tomoki Koriyama(Univ. of Tokyo) / Yusuke Ijima(NTT) / Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / Kenjiro Sugimoto(Waseda Univ.)

Paper Information
Registration To Technical Committee on Speech / Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) [Poster Presentation] Design of automatic soundscape generation based on image object detection
Sub Title (in English)
Keyword(1) Soundscape
Keyword(2) Deep learning
Keyword(3) Mixing
Keyword(4) Harmony between audio and image
1st Author's Name Yoshifumi Chisaki
1st Author's Affiliation Chiba Institute of Technology(CIT)
2nd Author's Name Toshiharu Horiuchi
2nd Author's Affiliation KDDI Research, Inc.(KDDI Research, Inc.)
Date 2020-03-03
Paper # EA2019-144,SIP2019-146,SP2019-93
Volume (vol) vol.119
Number (no) EA-439,SIP-440,SP-441
Page pp.pp.251-254(EA), pp.251-254(SIP), pp.251-254(SP),
#Pages 4
Date of Issue 2020-02-24 (EA, SIP, SP)