Presentation | 2015-07-18 On the intelligibility of Japanese speech reduced into and resynthesized from principal-component spaces Takuya Kishida, Yoshitaka Nakajima, Kazuo Ueda, Gerard B. Remijn, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Power fluctuations in a set of critical-band filters convey sufficient information for linguistic communication, and the present study examined whether the number of fluctuations could be reduced by principal component analysis keeping high enough intelligibility of speech signals resynthesized from the obtained subspace. Spoken Japanese sentences were converted into power fluctuations in 20 critical-band filters. Principal component analyses were performed to reduce these 20-dimensional fluctuations into subspaces of 1-9 dimension(s). Speech signals were resynthesized from these subspaces as noise-vocoded speech, and their intelligibility was measured as percentages of Japanese morae identified correctly. A big leap of mora identification appeared, from 5.4% to 79.7%, as the number of subspace dimensions increased from 2 to 3. Since very similar 3-dimensional subspaces had appeared in spoken sentences of different languages in previous studies, it is very likely that essential linguistic information can be conveyed by 3-dimensional fluctuations extracted from the 20 power fluctuations. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speech perception / principal component analysis / noise-vocoded speech / critical band |
Paper # | HIP2015-47 |
Date of Issue | 2015-07-11 (HIP) |
Conference Information | |
Committee | HIP |
---|---|
Conference Date | 2015/7/18(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Kyushu Sangyo University |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | Hideyuki Ando(Osaka Univ.) |
Vice Chair | Masahiro Ishii(Sapporo City Univ.) / Miyuki Kamachi(Kogakuin Univ.) |
Secretary | Masahiro Ishii(NICT) / Miyuki Kamachi(KDDI R&D Labs.) |
Assistant | Sinobu Kuroki(NTT) / Mutsumi Suganuma(Waseda Univ.) |
Paper Information | |
Registration To | Technical Committee on Human Information Processing |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | On the intelligibility of Japanese speech reduced into and resynthesized from principal-component spaces |
Sub Title (in English) | |
Keyword(1) | speech perception |
Keyword(2) | principal component analysis |
Keyword(3) | noise-vocoded speech |
Keyword(4) | critical band |
1st Author's Name | Takuya Kishida |
1st Author's Affiliation | Kyushu University(Kyushu Univ.) |
2nd Author's Name | Yoshitaka Nakajima |
2nd Author's Affiliation | Kyushu University(Kyushu Univ.) |
3rd Author's Name | Kazuo Ueda |
3rd Author's Affiliation | Kyushu University(Kyushu Univ.) |
4th Author's Name | Gerard B. Remijn |
4th Author's Affiliation | Kyushu University(Kyushu Univ.) |
Date | 2015-07-18 |
Paper # | HIP2015-47 |
Volume (vol) | vol.115 |
Number (no) | HIP-149 |
Page | pp.pp.31-35(HIP), |
#Pages | 5 |
Date of Issue | 2015-07-11 (HIP) |