Presentation | 2019-03-15 [Poster Presentation] A Design of Reduced Phoneme Set Based on a Language Model Shuji Komeiji, Toshihisa Tanaka, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A design of reduced phoneme set based on a language model is proposed. The reduction of the phoneme set improves discriminability of phonemes under the condition where the amount of training data is too small to train each phoneme model. On the other hand, it increases homophones that yield degradation of speech recognition. In the proposed approach, it is possible to reduce phonemes preventing degradation, regarding pronunciation/word sequence confusion rate calculated from N-grams in a language model. In an experiment, the phoneme set designed with proposed approach was applied to Japanese large vocabulary speech recognition system. The word error rate with the 10 phonemes set was 14.6%, while the error rate with full 39 phonemes set was 17.7%. The degradation was able to be suppressed in about 3%. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Automatic speech recognition / Brain machine interface / Phoneme set / Language model / N-gram |
Paper # | EA2018-134,SIP2018-140,SP2018-96 |
Date of Issue | 2019-03-07 (EA, SIP, SP) |
Conference Information | |
Committee | EA / SIP / SP |
---|---|
Conference Date | 2019/3/14(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | i+Land nagasaki (Nagasaki-shi) |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | Engineering/Electro Acoustics, Signal Processing, Speech, and Related Topics |
Chair | Suehiro Shimauchi(Kanazawa Inst. of Tech.) / Shogo Muramatsu(Niigata Univ.) / Yoichi Yamashita(Ritsumeikan Univ.) |
Vice Chair | Kenichi Furuya(Oita Univ.) / Kanji Watanabe(Akita Pref. Univ.) / Naoyuki Aikawa(TUS) / Kazunori Hayashi(Osaka City Univ) / Akinobu Ri(Nagoya Inst. of Tech.) |
Secretary | Kenichi Furuya(Shizuoka Inst. of Science and Tech.) / Kanji Watanabe(NHK) / Naoyuki Aikawa(Takushoku Univ.) / Kazunori Hayashi(Hiroshima Univ.) / Akinobu Ri(Kyoto Univ.) |
Assistant | Keisuke Imoto(Ritsumeikan Univ.) / Daisuke Morikawa(Toyama Pref Univ.) / Katsumi Konishi(Hosei Univ.) / hyihsin(Takushoku Univ.) / Tomoki Koriyama(Tokyo Inst. of Tech.) / Satoshi Kobashikawa(NTT) |
Paper Information | |
Registration To | Technical Committee on Engineering Acoustics / Technical Committee on Signal Processing / Technical Committee on Speech |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Poster Presentation] A Design of Reduced Phoneme Set Based on a Language Model |
Sub Title (in English) | |
Keyword(1) | Automatic speech recognition |
Keyword(2) | Brain machine interface |
Keyword(3) | Phoneme set |
Keyword(4) | Language model |
Keyword(5) | N-gram |
1st Author's Name | Shuji Komeiji |
1st Author's Affiliation | Tokyo University of Agriculture and Technology(Tokyo Univ. of Agriculture and Tech.) |
2nd Author's Name | Toshihisa Tanaka |
2nd Author's Affiliation | Tokyo University of Agriculture and Technology(Tokyo Univ. of Agriculture and Tech.) |
Date | 2019-03-15 |
Paper # | EA2018-134,SIP2018-140,SP2018-96 |
Volume (vol) | vol.118 |
Number (no) | EA-495,SIP-496,SP-497 |
Page | pp.pp.205-210(EA), pp.205-210(SIP), pp.205-210(SP), |
#Pages | 6 |
Date of Issue | 2019-03-07 (EA, SIP, SP) |