Presentation 2008-12-10
Isolated word recognition based on speech structures and discriminant analysis
Satoshi ASAKAWA, Yu QIAO, Nobuaki MINEMATSU, Keikichi HIROSE,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Non-linguistic factors of speech such as vocal tract sizes and recording devices easily change acoustic features of speech. Recently, a new representation of speech with complete cancelation of these changes has been proposed. This representation discards the absolute properties of speech events and captures only the contrasts among them. As a full set of the contrasts in the events can define a unique geometrical structure, the proposal can be regarded as structural representation. In this paper, the new representation is examined based on two kinds of isolated word recognition tasks, a five-vowel-sequence word set and a phonetically balanced word set. Here, two problems, too strong invariance and too high dimensionality, are solved by multiple stream structuralization and linear discriminant analysis. To compare the conventional method and the proposed one, frequency-warped utterances are also used for testing. The experimental results show the high robustness of our proposed method.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) structural representation of speech / non-linguistic features / Bhattacharyya distance / linear discriminant analysis / isolated word recognition
Paper # NLC2008-58,SP2008-113
Date of Issue

Conference Information
Committee NLC
Conference Date 2008/12/2(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Isolated word recognition based on speech structures and discriminant analysis
Sub Title (in English)
Keyword(1) structural representation of speech
Keyword(2) non-linguistic features
Keyword(3) Bhattacharyya distance
Keyword(4) linear discriminant analysis
Keyword(5) isolated word recognition
1st Author's Name Satoshi ASAKAWA
1st Author's Affiliation Grad. School of Frontier Sciences, Univ. of Tokyo:(Present office)Sony Corp.()
2nd Author's Name Yu QIAO
2nd Author's Affiliation Grad. School of Eng., Univ. of Tokyo
3rd Author's Name Nobuaki MINEMATSU
3rd Author's Affiliation Grad. School of Eng., Univ. of Tokyo
4th Author's Name Keikichi HIROSE
4th Author's Affiliation Grad. School of Info. Sci. and Tech., Univ. of Tokyo
Date 2008-12-10
Paper # NLC2008-58,SP2008-113
Volume (vol) vol.108
Number (no) 337
Page pp.pp.-
#Pages 6
Date of Issue