Presentation 2017-12-21
[ポスター講演]LSTMを用いた音響信号からの擬音語生成
Shota Ikawa, Kunio Kashino,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Representing various acoustic events by natural language seems to play an important role in natural man-machine communication, retrieval of multimedia database, detection of abnormal sound. In this paper, we propose a new method to generate onomatopoeia from acoustic signals. In the conventional onomatopoeia generation method, a method of classifying an input signal into classes presumed in advance or subdividing it into segments corresponding to phonemes was studied, but it was difficult to deal with unknown sounds and to segment the signal in units of phonemes. The proposed method is based on the Sequence-to-Sequence framework, It automatically generates onomatopoeia in End-to-End. Experiment showed that the mean phoneme error rate (MPER) was 2.8% and the word error rate (WER) was 7.2%, indicating that it is possible to realize a lower error rate than the conventional method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Acoustic signal processing / Environmental sound / Onomatopoeia / Sequence-to-Sequence
Paper # SP2017-58
Date of Issue 2017-12-14 (SP)

Conference Information
Committee NLC / IPSJ-NL / SP / IPSJ-SLP
Conference Date 2017/12/20(3days)
Place (in Japanese) (See Japanese page)
Place (in English) Waseda Univ. Green Computing Systems Research Organization
Topics (in Japanese) (See Japanese page)
Topics (in English) The 4th Natural Language Processing Symposium & The 19th Spoken Language Symposium
Chair Hiroshi Kanayama(IBM) / Kentaro Inui(Tohoku Univ.) / Yoichi Yamashita(Ritsumeikan Univ.) / Nobuaki Minematsu(Univ. Tokyo)
Vice Chair Takeshi Sakaki(Hottolink) / Kazutaka Shimada(Kyushu Inst. of Tech.) / / Hiroki Mori(Utsunomiya Univ.)
Secretary Takeshi Sakaki(Ryukoku Univ.) / Kazutaka Shimada(NTT) / (Osaka Univ.) / Hiroki Mori(Tokyo Inst. of Tech.) / (Mixi Co. Ltd.)
Assistant Mitsuo Yoshida(Toyohashi Univ. of Tech.) / Takeshi Kobayakawa(NICT) / / Kei Hashimoto(Nagoya Inst. of Tech.) / Satoshi Kobashikawa(NTT)

Paper Information
Registration To Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing
Language JPN-ONLY
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English)
Sub Title (in English)
Keyword(1) Acoustic signal processing
Keyword(2) Environmental sound
Keyword(3) Onomatopoeia
Keyword(4) Sequence-to-Sequence
1st Author's Name Shota Ikawa
1st Author's Affiliation The University of Tokyo(Univ. Tokyo)
2nd Author's Name Kunio Kashino
2nd Author's Affiliation The University of Tokyo(Univ. Tokyo/NTT)
Date 2017-12-21
Paper # SP2017-58
Volume (vol) vol.117
Number (no) SP-368
Page pp.pp.17-20(SP),
#Pages 4
Date of Issue 2017-12-14 (SP)