Presentation | 2017-03-01 [Poster Presentation] Estimation of playing position from music and speech sources based on music database Satoshi Inui, Toru Takahashi, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We describe about music information retrieval (MIR) system in real environment. In the environment, music and non-music signals are simultaneously occurred by different points. We investigate that MIR system in the simplest real environment, where music and voice signals are simultaneously occurred by two different points. The system must identify music and estimate the position in the music from received signal that is mixture signal of music signal with voice signal. The proposed system uses sound source localization and separation techniques to capture the target music signal. Then, the most likely music and its position are estimated based on a similarity between the captured signal and entry in the predefined database. The similarity is defined by acoustic feature named binary chroma spectrum. We examine MIR performance of typical single channel based system and microphone-array based system. F-measure is calculated to evaluate performance of the systems. It is confirmed that microphone-array based system is outperform typical single channel based system. We conclude that binary chroma spectrum is suitable for sound source separation. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Binary Chroma Spectrum / Music Playing Position / Sound Source Localization / Sound Source Separation / Music Information Retrieval |
Paper # | EA2016-105,SIP2016-160,SP2016-100 |
Date of Issue | 2017-02-22 (EA, SIP, SP) |
Conference Information | |
Committee | SP / SIP / EA |
---|---|
Conference Date | 2017/3/1(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Okinawa Industry Support Center |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Speech, Engineering/Electro Acoustics, Signal Processing, and Related Topics |
Chair | Kazunori Mano(Shibaura Inst. of Tech.) / Makoto Nakashizuka(Chiba Inst. of Tech.) / Mitsunori Mizumachi(Kyushu Inst. of Tech.) |
Vice Chair | Hiroki Mori(Utsunomiya Univ.) / Masahiro Okuda(Univ. of Kitakyushu) / Shogo Muramatsu(Niigata Univ.) / Yoichi Haneda(Univ. of Electro-Comm.) / Suehiro Shimauchi(NTT) |
Secretary | Hiroki Mori(Kobe Univ.) / Masahiro Okuda(Shizuoka Univ.) / Shogo Muramatsu(Ritsumeikan Univ.) / Yoichi Haneda(Chiba Inst. of Tech.) / Suehiro Shimauchi(KDDI R&D Labs.) |
Assistant | Taichi Asami(NTT) / Kei Hashimoto(Nagoya Inst. of Tech.) / Osamu Watanabe(Takushoku Univ.) / Shigeto Takeoka(Shizuoka Inst. of Science and Tech.) / TREVINO Jorge(Tohoku Univ.) |
Paper Information | |
Registration To | Technical Committee on Speech / Technical Committee on Signal Processing / Technical Committee on Engineering Acoustics |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] Estimation of playing position from music and speech sources based on music database |
Sub Title (in English) | |
Keyword(1) | Binary Chroma Spectrum |
Keyword(2) | Music Playing Position |
Keyword(3) | Sound Source Localization |
Keyword(4) | Sound Source Separation |
Keyword(5) | Music Information Retrieval |
1st Author's Name | Satoshi Inui |
1st Author's Affiliation | Osaka Sangyo University(OSU) |
2nd Author's Name | Toru Takahashi |
2nd Author's Affiliation | Osaka Sangyo University(OSU) |
Date | 2017-03-01 |
Paper # | EA2016-105,SIP2016-160,SP2016-100 |
Volume (vol) | vol.116 |
Number (no) | EA-475,SIP-476,SP-477 |
Page | pp.pp.129-134(EA), pp.129-134(SIP), pp.129-134(SP), |
#Pages | 6 |
Date of Issue | 2017-02-22 (EA, SIP, SP) |