Presentation | 2008-12-10 Isolated word recognition based on speech structures and discriminant analysis Satoshi ASAKAWA, Yu QIAO, Nobuaki MINEMATSU, Keikichi HIROSE, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Non-linguistic factors of speech such as vocal tract sizes and recording devices easily change acoustic features of speech. Recently, a new representation of speech with complete cancelation of these changes has been proposed. This representation discards the absolute properties of speech events and captures only the contrasts among them. As a full set of the contrasts in the events can define a unique geometrical structure, the proposal can be regarded as structural representation. In this paper, the new representation is examined based on two kinds of isolated word recognition tasks, a five-vowel-sequence word set and a phonetically balanced word set. Here, two problems, too strong invariance and too high dimensionality, are solved by multiple stream structuralization and linear discriminant analysis. To compare the conventional method and the proposed one, frequency-warped utterances are also used for testing. The experimental results show the high robustness of our proposed method. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | structural representation of speech / non-linguistic features / Bhattacharyya distance / linear discriminant analysis / isolated word recognition |
Paper # | NLC2008-58,SP2008-113 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2008/12/2(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Isolated word recognition based on speech structures and discriminant analysis |
Sub Title (in English) | |
Keyword(1) | structural representation of speech |
Keyword(2) | non-linguistic features |
Keyword(3) | Bhattacharyya distance |
Keyword(4) | linear discriminant analysis |
Keyword(5) | isolated word recognition |
1st Author's Name | Satoshi ASAKAWA |
1st Author's Affiliation | Grad. School of Frontier Sciences, Univ. of Tokyo:(Present office)Sony Corp.() |
2nd Author's Name | Yu QIAO |
2nd Author's Affiliation | Grad. School of Eng., Univ. of Tokyo |
3rd Author's Name | Nobuaki MINEMATSU |
3rd Author's Affiliation | Grad. School of Eng., Univ. of Tokyo |
4th Author's Name | Keikichi HIROSE |
4th Author's Affiliation | Grad. School of Info. Sci. and Tech., Univ. of Tokyo |
Date | 2008-12-10 |
Paper # | NLC2008-58,SP2008-113 |
Volume (vol) | vol.108 |
Number (no) | 337 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |