アクセント句境界情報を利用したN-gram言語モデルの高精度化

寺尾 真; 峯松 信明; 広瀬 啓吉

Presentation	2001/12/13 Improvement of N-gram Language Models Using Accent Phrase Boundaries Makoto TERAO, Nobuaki MINEMATSU, Keikichi HIROSE,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	Current continuous speech recognition systems make much use of segmental features but little use of prosodic features. This paper proposes a novel method to integrate prosodic boundary information into N-gram-based language modeling. In this method, two types of language sub-models are built. One characterizes word transitions crossing accent phrase boundaries and the other not crossing the boundaries. To realize these two sub-models directly from a speech corpus, its size should be comparable to a text corpus used for N-gram model training. However, the preparation of such a large speech corpus is not realistic. To solve this problem, we focus upon transition of words in terms of their part-of-speech (POS), and differences in POS transition crossing and not crossing the boundaries are used to generate the two sub-models. Through experiments, the proposed model showed 11% perplexity reduction given the correct boundary position, and 8% reduction with the automatically extracted boundaries. Even when test speech samples were spoken by another speaker than the speaker used in characterizing the POS transitions, 6% reduction was observed.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	language model / prosody / accent phrase boundary / transition of part-of-speech / continuous speech recognition
Paper #	NLC2001-66,SP2001-101
Date of Issue

Conference Information
Committee	NLC
Conference Date	2001/12/13(1days)
Place (in Japanese)	(See Japanese page)
Place (in English)
Topics (in Japanese)	(See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Improvement of N-gram Language Models Using Accent Phrase Boundaries
Sub Title (in English)
Keyword(1)	language model
Keyword(2)	prosody
Keyword(3)	accent phrase boundary
Keyword(4)	transition of part-of-speech
Keyword(5)	continuous speech recognition
1st Author's Name	Makoto TERAO
1st Author's Affiliation	Graduate School of Engineering, University of Tokyo()
2nd Author's Name	Nobuaki MINEMATSU
2nd Author's Affiliation	Graduate School of Information Science and Technology, University of Tokyo
3rd Author's Name	Keikichi HIROSE
3rd Author's Affiliation	Graduate School of Frontier Sciences, University of Tokyo
Date	2001/12/13
Paper #	NLC2001-66,SP2001-101
Volume (vol)	vol.101
Number (no)	520
Page	pp.pp.-
#Pages	6
Date of Issue