Presentation 2015-07-18
On the intelligibility of Japanese speech reduced into and resynthesized from principal-component spaces
Takuya Kishida, Yoshitaka Nakajima, Kazuo Ueda, Gerard B. Remijn,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Power fluctuations in a set of critical-band filters convey sufficient information for linguistic communication, and the present study examined whether the number of fluctuations could be reduced by principal component analysis keeping high enough intelligibility of speech signals resynthesized from the obtained subspace. Spoken Japanese sentences were converted into power fluctuations in 20 critical-band filters. Principal component analyses were performed to reduce these 20-dimensional fluctuations into subspaces of 1-9 dimension(s). Speech signals were resynthesized from these subspaces as noise-vocoded speech, and their intelligibility was measured as percentages of Japanese morae identified correctly. A big leap of mora identification appeared, from 5.4% to 79.7%, as the number of subspace dimensions increased from 2 to 3. Since very similar 3-dimensional subspaces had appeared in spoken sentences of different languages in previous studies, it is very likely that essential linguistic information can be conveyed by 3-dimensional fluctuations extracted from the 20 power fluctuations.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) speech perception / principal component analysis / noise-vocoded speech / critical band
Paper # HIP2015-47
Date of Issue 2015-07-11 (HIP)

Conference Information
Committee HIP
Conference Date 2015/7/18(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Kyushu Sangyo University
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair Hideyuki Ando(Osaka Univ.)
Vice Chair Masahiro Ishii(Sapporo City Univ.) / Miyuki Kamachi(Kogakuin Univ.)
Secretary Masahiro Ishii(NICT) / Miyuki Kamachi(KDDI R&D Labs.)
Assistant Sinobu Kuroki(NTT) / Mutsumi Suganuma(Waseda Univ.)

Paper Information
Registration To Technical Committee on Human Information Processing
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) On the intelligibility of Japanese speech reduced into and resynthesized from principal-component spaces
Sub Title (in English)
Keyword(1) speech perception
Keyword(2) principal component analysis
Keyword(3) noise-vocoded speech
Keyword(4) critical band
1st Author's Name Takuya Kishida
1st Author's Affiliation Kyushu University(Kyushu Univ.)
2nd Author's Name Yoshitaka Nakajima
2nd Author's Affiliation Kyushu University(Kyushu Univ.)
3rd Author's Name Kazuo Ueda
3rd Author's Affiliation Kyushu University(Kyushu Univ.)
4th Author's Name Gerard B. Remijn
4th Author's Affiliation Kyushu University(Kyushu Univ.)
Date 2015-07-18
Paper # HIP2015-47
Volume (vol) vol.115
Number (no) HIP-149
Page pp.pp.31-35(HIP),
#Pages 5
Date of Issue 2015-07-11 (HIP)