Presentation 2021-06-19
Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones
Chen Ruiyan, Nishimura Tazuko, Minematsu Nobuaki, Saito Daisuke,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) When one hears his/her recorded voices for the first time, s/he is probably surprised and not rarely disappointed at the differences of voice quality between the recorded voices and his/her own voices. In psychology, this phenomenon is called voice confrontation.Conversion from recorded voices of a speaker to his/her own voices was technically investigated in previous studies, and in the currentstudy, we propose a novel framework for conversion. Here, four new ideas are introduced and tested technically:a) multiple pathways of in-body voice transmission from the oral cavity to the inner ear are taken into account for recording,b) body-conducted speech, not bone-conducted speech, is defined and simulated,c) a special device is prepared to avoid habituation effects in listening tests, and d) a network-based voice conversion technique is applied using a parallel corpus prepared by the above three steps. Experiments show that the proposed framework can generate one's own voices with higher quality, compared to a conventional method, even in cross-language contexts.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) one's own speech / body-conducted speech / sound-proof earmuff / bone-conducted microphones / statistical voice conversion
Paper # SP2021-15
Date of Issue 2021-06-11 (SP)

Conference Information
Committee SP / IPSJ-SLP / IPSJ-MUS
Conference Date 2021/6/18(2days)
Place (in Japanese) (See Japanese page)
Place (in English) Online
Topics (in Japanese) (See Japanese page)
Topics (in English) OTOGAKU Symposium 2021
Chair Hisashi Kawai(NICT) / 北岡 教英(豊橋技科大) / 竹川 佳成(はこだて未来大)
Vice Chair
Secretary (Univ. of Tokyo) / (Waseda Univ.) / (京大)
Assistant Yusuke Ijima(NTT)

Paper Information
Registration To Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Special Interest Group on Music and Computer
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones
Sub Title (in English)
Keyword(1) one's own speech
Keyword(2) body-conducted speech
Keyword(3) sound-proof earmuff
Keyword(4) bone-conducted microphones
Keyword(5) statistical voice conversion
1st Author's Name Chen Ruiyan
1st Author's Affiliation the University of Tokyo(UTokyo)
2nd Author's Name Nishimura Tazuko
2nd Author's Affiliation the University of Tokyo(UTokyo)
3rd Author's Name Minematsu Nobuaki
3rd Author's Affiliation the University of Tokyo(UTokyo)
4th Author's Name Saito Daisuke
4th Author's Affiliation the University of Tokyo(UTokyo)
Date 2021-06-19
Paper # SP2021-15
Volume (vol) vol.121
Number (no) SP-66
Page pp.pp.63-68(SP),
#Pages 6
Date of Issue 2021-06-11 (SP)