Presentation 2007-02-22
A Proposal for Speech/Music/Noise Discrimination Using Image Features of Sonagram
Nao TAKAYANAGI, Takahiro HAYASHI, Rikio ONAI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we propose a speech/music/noise discrimination technique using image features of sonagram (a 2-dimension image, the horizontal axis of which represents time, the vertical axis of which represents frequency, and the intensity of each point in which represents power of a particular frequency at a particular time). In the proposal technique, we focus on a difference in the sonagram between each audio section. In case of speech, gentle curves are drawn on the sonagram because the peak frequency of speech changes gently. In case of music, horizontal lines are drawn on the sonagram because the peak frequency of music is stable. And in case of noise, random points are drawn on the sonagram because the peak frequency of noise is unstable. Therefore, we calculate the 2-dimension frequency of the sonagram and discriminate three kinds of audio sections using discriminant analysis. We evaluated the proposal technique by precision and error rate of discrimination.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Speech/Music/Noise Discrimination / Sonagram
Paper # PRMU2006-209,HIP2006-102
Date of Issue

Conference Information
Committee HIP
Conference Date 2007/2/15(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Human Information Processing (HIP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Proposal for Speech/Music/Noise Discrimination Using Image Features of Sonagram
Sub Title (in English)
Keyword(1) Speech/Music/Noise Discrimination
Keyword(2) Sonagram
1st Author's Name Nao TAKAYANAGI
1st Author's Affiliation Department of Computer Science, Graduate School of Electro-Communications, The University of Electro-Communications()
2nd Author's Name Takahiro HAYASHI
2nd Author's Affiliation Department of Computer Science, The University of Electro-Communications
3rd Author's Name Rikio ONAI
3rd Author's Affiliation Department of Computer Science, Graduate School of Electro-Communications, The University of Electro-Communications:Department of Computer Science, The University of Electro-Communications
Date 2007-02-22
Paper # PRMU2006-209,HIP2006-102
Volume (vol) vol.106
Number (no) 540
Page pp.pp.-
#Pages 6
Date of Issue