Presentation | 2002/12/13 Recognition of Spontaneous Speech by Using a General-Purpose LVCSR with 0-gram and Distinctive Phonetic Features Shingo ISEJI, Takashi FUKUDA, Kouichi KATSURADA, Tsuneo NITTA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | This paper describes an attempt to recognize spontaneously spoken dialogue by using a general-purpose LVCSR software. In the proposed method, a phoneme string output from the LVCSR is converted into a sequence of vectors represented with distinctive phonetic features, then keywords assigned by a dialogue manager are detected from the input vector sequence. The method takes advantage of the potential abilities of: (1) precise phoneme discrimination achieved by relaxing the linguistic constraint in the LVCSR, and (2) coping with the issued of substitution, deletion and insertion errors by combining a conversion process from a phoneme into a distinctive phonetic feature vector and a key-word spotting process. The proposed method shows significant improvements in comparison with the LVCSR software in an experiment with a spoken dialogue corpus of a map guidance task. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Spoken Dialogue / LVCSR / Keyword Spotting / Language Model / Sub-word Model / Distinctive Phonetic Feature / Confusion Matrix |
Paper # | NLC2002-79 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2002/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Recognition of Spontaneous Speech by Using a General-Purpose LVCSR with 0-gram and Distinctive Phonetic Features |
Sub Title (in English) | |
Keyword(1) | Spoken Dialogue |
Keyword(2) | LVCSR |
Keyword(3) | Keyword Spotting |
Keyword(4) | Language Model |
Keyword(5) | Sub-word Model |
Keyword(6) | Distinctive Phonetic Feature |
Keyword(7) | Confusion Matrix |
1st Author's Name | Shingo ISEJI |
1st Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology() |
2nd Author's Name | Takashi FUKUDA |
2nd Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology |
3rd Author's Name | Kouichi KATSURADA |
3rd Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology |
4th Author's Name | Tsuneo NITTA |
4th Author's Affiliation | Graduate School of Engineering, Toyohashi University of Technology |
Date | 2002/12/13 |
Paper # | NLC2002-79 |
Volume (vol) | vol.102 |
Number (no) | 528 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |