Presentation | 2022-11-29 Density Ratio Approach-based multiple Encoder-Decoder ASR model integration Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models and a language model. Based on the Density Ratio Approach, we propose a method to build an ASR system by integrating multiple ASR models and combining them with an external language models. The proposed method enables speech recognition use a variety of acoustic information and linguistic information that has not been learned by the ASR models. Experimental results show that the proposed method is more accurate than conventional integration methods. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Multiple ASR model / Integration of acoustic information / Language model replacement / Density Ratio Approach |
Paper # | NLC2022-10,SP2022-30 |
Date of Issue | 2022-11-22 (NLC, SP) |
Conference Information | |
Committee | NLC / IPSJ-NL / SP / IPSJ-SLP |
---|---|
Conference Date | 2022/11/29(3days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Mitsuo Yoshida(Univ. of Tsukuba) / 須藤 克仁(奈良先端科学技術大学院大学) / Tomoki Toda(Nagoya Univ.) / 戸田 智基(名古屋大学) |
Vice Chair | Hiroki Sakaji(Univ. of Tokyo) / Takeshi Kobayakawa(NHK) |
Secretary | Hiroki Sakaji(NTT) / Takeshi Kobayakawa(Hiroshima Univ. of Economics) / (株式会社デンソーアイティーラボラトリ) / (北海学園大学) / (東京農工大学) |
Assistant | Kanjin Takahashi(Sansan) / Yasuhiro Ogawa(Nagoya Univ.) / / Ryo Aihara(Mitsubishi Electric) / Daisuke Saito(Univ. of Tokyo) |
Paper Information | |
Registration To | Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Density Ratio Approach-based multiple Encoder-Decoder ASR model integration |
Sub Title (in English) | |
Keyword(1) | Multiple ASR model |
Keyword(2) | Integration of acoustic information |
Keyword(3) | Language model replacement |
Keyword(4) | Density Ratio Approach |
1st Author's Name | Keigo Hojo |
1st Author's Affiliation | Toyohashi University of Technology(TUT) |
2nd Author's Name | Daiki Mori |
2nd Author's Affiliation | Toyohashi University of Technology(TUT) |
3rd Author's Name | Yukoh Wakabayashi |
3rd Author's Affiliation | Toyohashi University of Technology(TUT) |
4th Author's Name | Atsunori Ogawa |
4th Author's Affiliation | NIPPON TELEGRAPH AND TELEPHONE CORPORATION(NTT) |
5th Author's Name | Norihide Kitaoka |
5th Author's Affiliation | Toyohashi University of Technology(TUT) |
Date | 2022-11-29 |
Paper # | NLC2022-10,SP2022-30 |
Volume (vol) | vol.122 |
Number (no) | NLC-287,SP-288 |
Page | pp.pp.5-9(NLC), pp.5-9(SP), |
#Pages | 5 |
Date of Issue | 2022-11-22 (NLC, SP) |