Presentation 2023-03-01
Anomalous sound detection with complex-valued hybrid neural networks considering phase variations
Shota Nishiyama, Akira Tamamori,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Anomalous sound detection is the task of identifying whether an incoming mechanical sound is normal or anomalous. Since anomalous sounds occur infrequently and are highly diverse, it is treated as a problem of detecting anomalous sounds from normal sounds only. The acoustic features used as input to most anomalous sound detection models are mel-spectrogram. However, the phase variation is lost when the complex-spectrogram obtained by Fourier transforming the sound waveform is converted to the mel-spectrogram. In this study, we compare anomalous sound detection methods using complex-valued neural networks and real-valued neural networks to demonstrate the usefulness of phase variation. As a result of the comparison, there existed machine sounds for which phase variation was valuable and machine sounds for which it was not valuable. In this study, we propose a complex-valued hybrid neural network that combines a complex-valued module that preserves the structure of complex values and a real-valued module that takes mel-spectrogram as input for all feature extraction operations in which complex-spectrogram can be input in order to take phase variation into account. We propose a complex-valued hybrid neural network that combines a complex-valued structure-preserving module and a real-valued module that takes the mel-spectrogram as input for all feature extraction operations. Experiments verified the effectiveness of the proposed method on anomalous sound detection for multi-channel sound in the ToyADMOS dataset. Experimental results showed that the proposed method improved the average AUC of all machine sounds by around 3% compared to both complex-valued and real-valued neural networks.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Anomalous sound detection / complex-valued neural networks / phase variations
Paper # EA2022-106,SIP2022-150,SP2022-70
Date of Issue 2023-02-21 (EA, SIP, SP)

Conference Information
Committee SP / IPSJ-SLP / EA / SIP
Conference Date 2023/2/28(2days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Tomoki Toda(Nagoya Univ.) / Tomoki Toda(Nagoya Univ.) / Kenichi Furuya(Oita Univ.) / Toshihisa Tanaka(Tokyo Univ. Agri.&Tech.)
Vice Chair / / Tatsuya Kako(NTT) / Junki Ono(Tokyo Metropolitan Univ.) / Koichi Ichige(Yokohama National Univ.) / Takayuki Nakachi(Ryukyu Univ.)
Secretary (NTT) / (Univ. of Electro-Comm.) / Tatsuya Kako(NTT) / Junki Ono(Univ. of Electro-Comm.) / Koichi Ichige(NTT) / Takayuki Nakachi(RitsumeikanUniv.)
Assistant Ryo Aihara(Mitsubishi Electric) / Daisuke Saito(Univ. of Tokyo) / Ryo Aihara(Mitsubishi Electric) / Daisuke Saito(Univ. of Tokyo) / Masato Nakayama(Osaka Sangyo Univ.) / Kouhei Yatabe(Tuat) / Taichi Yoshida(UEC) / Shoko Imaizumi(Chiba Univ.)

Paper Information
Registration To Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Anomalous sound detection with complex-valued hybrid neural networks considering phase variations
Sub Title (in English)
Keyword(1) Anomalous sound detection
Keyword(2) complex-valued neural networks
Keyword(3) phase variations
1st Author's Name Shota Nishiyama
1st Author's Affiliation Aichi Institute of Technology(AIT)
2nd Author's Name Akira Tamamori
2nd Author's Affiliation Aichi Institute of Technology(AIT)
Date 2023-03-01
Paper # EA2022-106,SIP2022-150,SP2022-70
Volume (vol) vol.122
Number (no) EA-387,SIP-388,SP-389
Page pp.pp.185-190(EA), pp.185-190(SIP), pp.185-190(SP),
#Pages 6
Date of Issue 2023-02-21 (EA, SIP, SP)