Presentation | 2011-07-22 Estimation of vocal tract length ratio using auditory filterbank Erika OKAMOTO, Toshio IRINO, Ryuichi NISIMURA, Hideki KAWAHARA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Vocal tract length normalization (VTLN) is an important issue in speech applications, such as automatic speech recognition and high-quality voice morphing. Individual spectral differences are primarily dependent on vocal tract length differences.They are also dependent on glottal source signal and the shape of pyriform fossa. This paper propose a new method for vocal tract length (VTL) estimation and normalization based on a gammachirp auditory filterbank (GCFB). VTLratios were estimated based on spectral distances between the same sentence spoken by 2 speakers. The calculation was carried out for all permutations of 28 speakers (_<28>P_<27> =756). Then the estimated error was calculated by the regression analysis. VTL estimation using the mel-frequency filterbank (MFFB), which is a preprocessor for calculating MFCCs commonly used in ASR, the gammatone fileterbank(GCFB) and the gammachirp filterbank(GCFB). The results indicated that the proposed GCFB-based VTL estimation outperforms the MFCC-based and the GTFB-based methods in the objective evaluations. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | auditory filterbank / vocal tract length / frequency band / spectral distance |
Paper # | SP2011-43 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2011/7/14(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Estimation of vocal tract length ratio using auditory filterbank |
Sub Title (in English) | |
Keyword(1) | auditory filterbank |
Keyword(2) | vocal tract length |
Keyword(3) | frequency band |
Keyword(4) | spectral distance |
1st Author's Name | Erika OKAMOTO |
1st Author's Affiliation | Wakayama University() |
2nd Author's Name | Toshio IRINO |
2nd Author's Affiliation | Wakayama University |
3rd Author's Name | Ryuichi NISIMURA |
3rd Author's Affiliation | Wakayama University |
4th Author's Name | Hideki KAWAHARA |
4th Author's Affiliation | Wakayama University |
Date | 2011-07-22 |
Paper # | SP2011-43 |
Volume (vol) | vol.111 |
Number (no) | 153 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |