Presentation 2001/12/13
Improvement of N-gram Language Models Using Accent Phrase Boundaries
Makoto TERAO, Nobuaki MINEMATSU, Keikichi HIROSE,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Current continuous speech recognition systems make much use of segmental features but little use of prosodic features. This paper proposes a novel method to integrate prosodic boundary information into N-gram-based language modeling. In this method, two types of language sub-models are built. One characterizes word transitions crossing accent phrase boundaries and the other not crossing the boundaries. To realize these two sub-models directly from a speech corpus, its size should be comparable to a text corpus used for N-gram model training. However, the preparation of such a large speech corpus is not realistic. To solve this problem, we focus upon transition of words in terms of their part-of-speech (POS), and differences in POS transition crossing and not crossing the boundaries are used to generate the two sub-models. Through experiments, the proposed model showed 11% perplexity reduction given the correct boundary position, and 8% reduction with the automatically extracted boundaries. Even when test speech samples were spoken by another speaker than the speaker used in characterizing the POS transitions, 6% reduction was observed.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) language model / prosody / accent phrase boundary / transition of part-of-speech / continuous speech recognition
Paper # NLC2001-66,SP2001-101
Date of Issue

Conference Information
Committee NLC
Conference Date 2001/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Improvement of N-gram Language Models Using Accent Phrase Boundaries
Sub Title (in English)
Keyword(1) language model
Keyword(2) prosody
Keyword(3) accent phrase boundary
Keyword(4) transition of part-of-speech
Keyword(5) continuous speech recognition
1st Author's Name Makoto TERAO
1st Author's Affiliation Graduate School of Engineering, University of Tokyo()
2nd Author's Name Nobuaki MINEMATSU
2nd Author's Affiliation Graduate School of Information Science and Technology, University of Tokyo
3rd Author's Name Keikichi HIROSE
3rd Author's Affiliation Graduate School of Frontier Sciences, University of Tokyo
Date 2001/12/13
Paper # NLC2001-66,SP2001-101
Volume (vol) vol.101
Number (no) 520
Page pp.pp.-
#Pages 6
Date of Issue