Presentation 2023-11-16
A consideration of a music detection method based on object detection for separating music events that overlap with each other
Masaki Kitayama, Kazuhiro Onishi,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We consider the music event detection which is based on the framework of object detection, for detecting individual music events that often overlap with each other in audio contents. Conventional music detection methods which perform frame-level classification have difficulty in detecting individual music events that overlap with each other. Since the object detection framework in computer vision directly regresses event intervals, these overlapping events can be individually detected at the event-level. We propose a music detection model based on Faster R-CNN, an object detection method, and evaluate it on a dataset simulating DJ mixing techniques, assuming commercial audio contents. This study will form the basis of technology that will contribute to more advanced analysis and content production for audio content currently in commercial use.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Audio event detection / Music detection / Object detection / Faster R-CNN / Event-level detection
Paper # PRMU2023-22
Date of Issue 2023-11-09 (PRMU)

Conference Information
Committee PRMU / IPSJ-CVIM / IPSJ-DCC / IPSJ-CGVI
Conference Date 2023/11/16(2days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Kunio Kashio(NTT)
Vice Chair Takuya Funatomi(NAIST) / Go Irie(Tokyo Univ. of Science)
Secretary Takuya Funatomi(Tokyo Inst. of Tech.) / Go Irie(Riken)
Assistant Kei Shimonishi(Kyoto Univ.) / Kensho Hara(AIST)

Paper Information
Registration To Technical Committee on Pattern Recognition and Media Understanding / Special Interest Group on Computer Vision and Image Media / Special Interest Group on Digital Contents Creation / Special Interest Group on Computer Graphics and Visual Informatics
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A consideration of a music detection method based on object detection for separating music events that overlap with each other
Sub Title (in English)
Keyword(1) Audio event detection
Keyword(2) Music detection
Keyword(3) Object detection
Keyword(4) Faster R-CNN
Keyword(5) Event-level detection
1st Author's Name Masaki Kitayama
1st Author's Affiliation Hakuhodo Technologies Inc.(Hakuhodo Technologies)
2nd Author's Name Kazuhiro Onishi
2nd Author's Affiliation Hakuhodo Technologies Inc.(Hakuhodo Technologies)
Date 2023-11-16
Paper # PRMU2023-22
Volume (vol) vol.123
Number (no) PRMU-266
Page pp.pp.37-42(PRMU),
#Pages 6
Date of Issue 2023-11-09 (PRMU)