Presentation 2017-03-01
[Poster Presentation] Estimation of playing position from music and speech sources based on music database
Satoshi Inui, Toru Takahashi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We describe about music information retrieval (MIR) system in real environment. In the environment, music and non-music signals are simultaneously occurred by different points. We investigate that MIR system in the simplest real environment, where music and voice signals are simultaneously occurred by two different points. The system must identify music and estimate the position in the music from received signal that is mixture signal of music signal with voice signal. The proposed system uses sound source localization and separation techniques to capture the target music signal. Then, the most likely music and its position are estimated based on a similarity between the captured signal and entry in the predefined database. The similarity is defined by acoustic feature named binary chroma spectrum. We examine MIR performance of typical single channel based system and microphone-array based system. F-measure is calculated to evaluate performance of the systems. It is confirmed that microphone-array based system is outperform typical single channel based system. We conclude that binary chroma spectrum is suitable for sound source separation.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Binary Chroma Spectrum / Music Playing Position / Sound Source Localization / Sound Source Separation / Music Information Retrieval
Paper # EA2016-105,SIP2016-160,SP2016-100
Date of Issue 2017-02-22 (EA, SIP, SP)

Conference Information
Committee SP / SIP / EA
Conference Date 2017/3/1(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Okinawa Industry Support Center
Topics (in Japanese) (See Japanese page)
Topics (in English) Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics
Chair Kazunori Mano(Shibaura Inst. of Tech.) / Makoto Nakashizuka(Chiba Inst. of Tech.) / Mitsunori Mizumachi(Kyushu Inst. of Tech.)
Vice Chair Hiroki Mori(Utsunomiya Univ.) / Masahiro Okuda(Univ. of Kitakyushu) / Shogo Muramatsu(Niigata Univ.) / Yoichi Haneda(Univ. of Electro-Comm.) / Suehiro Shimauchi(NTT)
Secretary Hiroki Mori(Kobe Univ.) / Masahiro Okuda(Shizuoka Univ.) / Shogo Muramatsu(Ritsumeikan Univ.) / Yoichi Haneda(Chiba Inst. of Tech.) / Suehiro Shimauchi(KDDI R&D Labs.)
Assistant Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Osamu Watanabe(Takushoku Univ.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / TREVINO Jorge(Tohoku Univ.)

Paper Information
Registration To Technical Committee on Speech / Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) [Poster Presentation] Estimation of playing position from music and speech sources based on music database
Sub Title (in English)
Keyword(1) Binary Chroma Spectrum
Keyword(2) Music Playing Position
Keyword(3) Sound Source Localization
Keyword(4) Sound Source Separation
Keyword(5) Music Information Retrieval
1st Author's Name Satoshi Inui
1st Author's Affiliation Osaka Sangyo University(OSU)
2nd Author's Name Toru Takahashi
2nd Author's Affiliation Osaka Sangyo University(OSU)
Date 2017-03-01
Paper # EA2016-105,SIP2016-160,SP2016-100
Volume (vol) vol.116
Number (no) EA-475,SIP-476,SP-477
Page pp.pp.129-134(EA), pp.129-134(SIP), pp.129-134(SP),
#Pages 6
Date of Issue 2017-02-22 (EA, SIP, SP)