Presentation | 2007-02-22 A Proposal for Speech/Music/Noise Discrimination Using Image Features of Sonagram Nao TAKAYANAGI, Takahiro HAYASHI, Rikio ONAI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we propose a speech/music/noise discrimination technique using image features of sonagram (a 2-dimension image, the horizontal axis of which represents time, the vertical axis of which represents frequency, and the intensity of each point in which represents power of a particular frequency at a particular time). In the proposal technique, we focus on a difference in the sonagram between each audio section. In case of speech, gentle curves are drawn on the sonagram because the peak frequency of speech changes gently. In case of music, horizontal lines are drawn on the sonagram because the peak frequency of music is stable. And in case of noise, random points are drawn on the sonagram because the peak frequency of noise is unstable. Therefore, we calculate the 2-dimension frequency of the sonagram and discriminate three kinds of audio sections using discriminant analysis. We evaluated the proposal technique by precision and error rate of discrimination. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Speech/Music/Noise Discrimination / Sonagram |
Paper # | PRMU2006-209,HIP2006-102 |
Date of Issue |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2007/2/15(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Pattern Recognition and Media Understanding (PRMU) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Proposal for Speech/Music/Noise Discrimination Using Image Features of Sonagram |
Sub Title (in English) | |
Keyword(1) | Speech/Music/Noise Discrimination |
Keyword(2) | Sonagram |
1st Author's Name | Nao TAKAYANAGI |
1st Author's Affiliation | Department of Computer Science, Graduate School of Electro-Communications, The University of Electro-Communications() |
2nd Author's Name | Takahiro HAYASHI |
2nd Author's Affiliation | Department of Computer Science, The University of Electro-Communications |
3rd Author's Name | Rikio ONAI |
3rd Author's Affiliation | Department of Computer Science, Graduate School of Electro-Communications, The University of Electro-Communications:Department of Computer Science, The University of Electro-Communications |
Date | 2007-02-22 |
Paper # | PRMU2006-209,HIP2006-102 |
Volume (vol) | vol.106 |
Number (no) | 538 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |