Presentation 2014-06-16
Design and Implementation of Multidirectional Sound Annotation Tool with HARK
Osamu SUGIYAMA, Katsutoshi ITOYAMA, Kazuhiro NAKADAI, Hiroshi G. OKUNO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this study we designed and developed the multidirectional sound source annotation tool with the robot audition software, HARK. With the rise of inexpensive microphone array products and the robot audition software called HARK, we can record and analyze multidirectional sound sources easily. The combination of microphone array and the software enables us to separate, localize, and track multidirectional sound sources. Most of the solutions for accessing these separated sound source information provide clients for interpreting simplified information about the separated sources, but not to directly execute the semantic annotations. Our proposed sound annotation tool provides drag & drop operation of annotation with a 3D sound source view and also provides annotation autocompletion with a SVM trained with the user ' s annotation history. The proposed features enable users to do the annotation task intuitively and confirm its result. We also conducted an evaluation demonstrating the efficiency of annotation done using the tool.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) User interface / sound source separation / sound source localization / annotation / autocompletion / HARK
Paper # CNR2014-5
Date of Issue

Conference Information
Committee CNR
Conference Date 2014/6/9(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Cloud Network Robotics (CNR)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Design and Implementation of Multidirectional Sound Annotation Tool with HARK
Sub Title (in English)
Keyword(1) User interface
Keyword(2) sound source separation
Keyword(3) sound source localization
Keyword(4) annotation
Keyword(5) autocompletion
Keyword(6) HARK
1st Author's Name Osamu SUGIYAMA
1st Author's Affiliation Graduate School of Information Science and Engineering, Tokyo Institute of Technology()
2nd Author's Name Katsutoshi ITOYAMA
2nd Author's Affiliation Graduate School of Informatics, Kyoto University
3rd Author's Name Kazuhiro NAKADAI
3rd Author's Affiliation Graduate School of Information Science and Engineering, Tokyo Institute of Technology:Honda Research Institute Japan
4th Author's Name Hiroshi G. OKUNO
4th Author's Affiliation Graduate Program for Embodiment Informatics, Waseda University
Date 2014-06-16
Paper # CNR2014-5
Volume (vol) vol.114
Number (no) 85
Page pp.pp.-
#Pages 4
Date of Issue