Presentation 2006/12/14
Making Use of Wavelet Transform in Template Matching for Phoneme Analyses of Japanese Voice
Shinji KARASAWA, Hiroshi SAKURABA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Speaker dependent voice recognition performance was achieved with template matching (TM). In order to give a margin to TM, sums of absolute value of wavelet transform coefficients in each scale (SWLC's) are used for vector quantization. Japanese moras are recognized under the condition of 204.8msec (1024 pieces of data those are sampled every 0.2msec) as a unit of processing. As for a segmentation, the ratio of SWLC (the 0.8msec band)/SWLC (the 1.6msec band) and SWLC (the 1.6msec band)/SWLC (the 3.2msec band) become a node in t he transition region of vowel [a,i,u,e,o]. Vowels uttered the same speaker were recognized by TM with 15 piece of WLC's in low resolution (scale is over 0.8msec) where the segmentation of processing is shorter then the pitch in order to make adaptable to the valid speech sound. Here, the data were sampled at each 0,1mces and 64 pieces of data were picked up from each peak of voice and the set of data are transferred to Haar's discrete wavelet coefficients (WLC's).
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Haar's discrete wavelet transform / Template matching / Speaker dependent phoneme recognition / Japanese mora
Paper # NLC2006-42,SP2006-98
Date of Issue

Conference Information
Committee NLC
Conference Date 2006/12/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Making Use of Wavelet Transform in Template Matching for Phoneme Analyses of Japanese Voice
Sub Title (in English)
Keyword(1) Haar's discrete wavelet transform
Keyword(2) Template matching
Keyword(3) Speaker dependent phoneme recognition
Keyword(4) Japanese mora
1st Author's Name Shinji KARASAWA
1st Author's Affiliation Miyagi National College of Technology()
2nd Author's Name Hiroshi SAKURABA
2nd Author's Affiliation Miyagi National College of Technology
Date 2006/12/14
Paper # NLC2006-42,SP2006-98
Volume (vol) vol.106
Number (no) 441
Page pp.pp.-
#Pages 6
Date of Issue