Presentation 2015-06-18
Noise-robust Prediction of Pronunciation Distances Aiming at Clustering of World Englishes Using a Learner's Self-centered Viewpoint
Yuichi Sato, Yosuke Kashiwagi, Shun Kasahara, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In recent years,we have more and more international tourists and in 2020, we have Tokyo Olympic Games. For communicating with those tourists, the default language is English but they speak English with various accents. To realize smooth communication with these tourists, we are developing a technical infrastructure to accustom Japanese people to variously accented Englishes (World Englishes). The infrastructure aims at clustering a large diversity of English pronunciations on an individual basis and visualizing the diversity in an educationally effective way. For clustering, a technique is needed that can predict the accent gap between any speaker pair and we developed it by integrating pronunciation structure analysis and support vector regression. In this paper, the prediction performance is evaluated when the prediction technique is applied for visualization using a user's self-centered viewpoint and when it is applied with a noise suppression technique. Results show that the performance is comparable to that observed when we use phonemic, not phonetic, transcripts and that 10 [dB] is enough as SNR to guarantee the prediction performance realized in a clean condition.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) World Englishes / pronunciation clustering / structural representation / support vector regression / self-centered visualization / noise suppression / DNN
Paper # PRMU2015-45,SP2015-14,WIT2015-14
Date of Issue 2015-06-11 (PRMU, SP, WIT)

Conference Information
Committee WIT / SP / ASJ-H / PRMU
Conference Date 2015/6/18(2days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Kiyohiko Nunokawa(Tokyo International Univ.) / Kazunori Mano(Shibaura Inst. of Tech.) / Masato Akagi(北陸先端大) / Eisaku Maeda(NTT)
Vice Chair Chikamune Wada(Kyushu Inst. of Tech.) / Norihide Kitaoka(Tokushima Univ.) / Shigeto Furukawa(NTT) / Shuji Senda(NEC) / Seiichi Uchida(Kyushu Univ.)
Secretary Chikamune Wada(Nagoya Inst. of Tech.) / Norihide Kitaoka(AIST) / Shigeto Furukawa(Tsukuba Univ. of Tech.) / Shuji Senda(Tokyo City Univ.) / Seiichi Uchida(Kobe Univ.)
Assistant Tomohiro Amemiya(NTT) / Takeaki Shionome(Tsukuba Univ. of Tech.) / Manabi Miyagi(Tsukuba Univ. of Tech.) / Takashi Nose(Tohoku Univ.) / Taichi Asami(NTT) / / Kazuaki Kondo(Kyoto Univ.) / Akisato Kimura(NTT)

Paper Information
Registration To Technical Committee on Well-being Information Technology / Technical Committee on Speech / * / Technical Committee on Pattern Recognition and Media Understanding
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Noise-robust Prediction of Pronunciation Distances Aiming at Clustering of World Englishes Using a Learner's Self-centered Viewpoint
Sub Title (in English)
Keyword(1) World Englishes
Keyword(2) pronunciation clustering
Keyword(3) structural representation
Keyword(4) support vector regression
Keyword(5) self-centered visualization
Keyword(6) noise suppression
Keyword(7) DNN
1st Author's Name Yuichi Sato
1st Author's Affiliation The University of Tokyo(UT)
2nd Author's Name Yosuke Kashiwagi
2nd Author's Affiliation The University of Tokyo(UT)
3rd Author's Name Shun Kasahara
3rd Author's Affiliation The University of Tokyo(UT)
4th Author's Name Nobuaki Minematsu
4th Author's Affiliation The University of Tokyo(UT)
5th Author's Name Daisuke Saito
5th Author's Affiliation The University of Tokyo(UT)
6th Author's Name Keikichi Hirose
6th Author's Affiliation The University of Tokyo(UT)
Date 2015-06-18
Paper # PRMU2015-45,SP2015-14,WIT2015-14
Volume (vol) vol.115
Number (no) PRMU-98,SP-99,WIT-100
Page pp.pp.77-82(PRMU), pp.77-82(SP), pp.77-82(WIT),
#Pages 6
Date of Issue 2015-06-11 (PRMU, SP, WIT)