Presentation 2011-07-22
Estimation of vocal tract length ratio using auditory filterbank
Erika OKAMOTO, Toshio IRINO, Ryuichi NISIMURA, Hideki KAWAHARA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Vocal tract length normalization (VTLN) is an important issue in speech applications, such as automatic speech recognition and high-quality voice morphing. Individual spectral differences are primarily dependent on vocal tract length differences.They are also dependent on glottal source signal and the shape of pyriform fossa. This paper propose a new method for vocal tract length (VTL) estimation and normalization based on a gammachirp auditory filterbank (GCFB). VTLratios were estimated based on spectral distances between the same sentence spoken by 2 speakers. The calculation was carried out for all permutations of 28 speakers (_<28>P_<27> =756). Then the estimated error was calculated by the regression analysis. VTL estimation using the mel-frequency filterbank (MFFB), which is a preprocessor for calculating MFCCs commonly used in ASR, the gammatone fileterbank(GCFB) and the gammachirp filterbank(GCFB). The results indicated that the proposed GCFB-based VTL estimation outperforms the MFCC-based and the GTFB-based methods in the objective evaluations.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) auditory filterbank / vocal tract length / frequency band / spectral distance
Paper # SP2011-43
Date of Issue

Conference Information
Committee SP
Conference Date 2011/7/14(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Estimation of vocal tract length ratio using auditory filterbank
Sub Title (in English)
Keyword(1) auditory filterbank
Keyword(2) vocal tract length
Keyword(3) frequency band
Keyword(4) spectral distance
1st Author's Name Erika OKAMOTO
1st Author's Affiliation Wakayama University()
2nd Author's Name Toshio IRINO
2nd Author's Affiliation Wakayama University
3rd Author's Name Ryuichi NISIMURA
3rd Author's Affiliation Wakayama University
4th Author's Name Hideki KAWAHARA
4th Author's Affiliation Wakayama University
Date 2011-07-22
Paper # SP2011-43
Volume (vol) vol.111
Number (no) 153
Page pp.pp.-
#Pages 6
Date of Issue