Density Ratio Approachに基づく複数Encoder-Decoder音声認識モデル統合手法

Presentation	2022-11-29 Density Ratio Approach-based multiple Encoder-Decoder ASR model integration Keigo Hojo, Daiki Mori, Yukoh Wakabayashi, Atsunori Ogawa, Norihide Kitaoka,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	One of the methods to improve the performance of Encoder--Decoder speech recognition is the integration of an ASR models and a language model. Based on the Density Ratio Approach, we propose a method to build an ASR system by integrating multiple ASR models and combining them with an external language models. The proposed method enables speech recognition use a variety of acoustic information and linguistic information that has not been learned by the ASR models. Experimental results show that the proposed method is more accurate than conventional integration methods.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Multiple ASR model / Integration of acoustic information / Language model replacement / Density Ratio Approach
Paper #	NLC2022-10,SP2022-30
Date of Issue	2022-11-22 (NLC, SP)

Conference Information
Committee	NLC / IPSJ-NL / SP / IPSJ-SLP
Conference Date	2022/11/29(3days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair	Mitsuo Yoshida(Univ. of Tsukuba) / 須藤克仁(奈良先端科学技術大学院大学) / Tomoki Toda(Nagoya Univ.) / 戸田智基(名古屋大学)
Vice Chair	Hiroki Sakaji(Univ. of Tokyo) / Takeshi Kobayakawa(NHK)
Secretary	Hiroki Sakaji(NTT) / Takeshi Kobayakawa(Hiroshima Univ. of Economics) / (株式会社デンソーアイティーラボラトリ) / (北海学園大学) / (東京農工大学)
Assistant	Kanjin Takahashi(Sansan) / Yasuhiro Ogawa(Nagoya Univ.) / / Ryo Aihara(Mitsubishi Electric) / Daisuke Saito(Univ. of Tokyo)

Paper Information
Registration To	Technical Committee on Natural Language Understanding and Models of Communication / Special Interest Group on Natural Language / Technical Committee on Speech / Special Interest Group on Spoken Language Processing
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Density Ratio Approach-based multiple Encoder-Decoder ASR model integration
Sub Title (in English)
Keyword(1)	Multiple ASR model
Keyword(2)	Integration of acoustic information
Keyword(3)	Language model replacement
Keyword(4)	Density Ratio Approach
1st Author's Name	Keigo Hojo
1st Author's Affiliation	Toyohashi University of Technology(TUT)
2nd Author's Name	Daiki Mori
2nd Author's Affiliation	Toyohashi University of Technology(TUT)
3rd Author's Name	Yukoh Wakabayashi
3rd Author's Affiliation	Toyohashi University of Technology(TUT)
4th Author's Name	Atsunori Ogawa
4th Author's Affiliation	NIPPON TELEGRAPH AND TELEPHONE CORPORATION(NTT)
5th Author's Name	Norihide Kitaoka
5th Author's Affiliation	Toyohashi University of Technology(TUT)
Date	2022-11-29
Paper #	NLC2022-10,SP2022-30
Volume (vol)	vol.122
Number (no)	NLC-287,SP-288
Page	pp.pp.5-9(NLC), pp.5-9(SP),
#Pages	5
Date of Issue	2022-11-22 (NLC, SP)