Presentation 2007/12/13
Automatic Speech Recognition of Congressional Speech Based on Statistical Style Transformation of Language Model and Pronunciation Model
Yuya AKITA, Tatsuya KAWAHARA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) For automatic speech recognition (ASR) of spontaneous speech such as congressional meetings, we have been proposing statistical transformation methods of language model and pronunciation model. In these methods, differences between faithful transcripts and orthographical transcripts are statistically extracted. Then, transformation models which consist of probabilistic transformation patterns are derived from the statistics for language model and pronunciation model. For language model, the transformation model predicts spoken-style N-gram entries with estimated occurrence counts. For pronunciation model, pronunciation variants and their probabilities are predicted by the transformation model. The language model and pronunciation model generated by the proposed methods were evaluated on ASR of committee meetings of Japanese National Congress (Diet),and realized absolute reduction of word error rates by 0.6-0.7% and 1.0%, respectively, compared with models produced by conventional methods. Finally, total reduction of 1.7% was obtained by combining both models.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Spontaneous speech / Speech recognition / Language model / Lexicon / Speaking style
Paper # NLC2007-43,SP2007-106
Date of Issue

Conference Information
Committee NLC
Conference Date 2007/12/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Automatic Speech Recognition of Congressional Speech Based on Statistical Style Transformation of Language Model and Pronunciation Model
Sub Title (in English)
Keyword(1) Spontaneous speech
Keyword(2) Speech recognition
Keyword(3) Language model
Keyword(4) Lexicon
Keyword(5) Speaking style
1st Author's Name Yuya AKITA
1st Author's Affiliation Academic Center for Computing and Media Studies, Kyoto University()
2nd Author's Name Tatsuya KAWAHARA
2nd Author's Affiliation Academic Center for Computing and Media Studies, Kyoto University
Date 2007/12/13
Paper # NLC2007-43,SP2007-106
Volume (vol) vol.107
Number (no) 405
Page pp.pp.-
#Pages 6
Date of Issue