Presentation | 2021-06-19 Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones Chen Ruiyan, Nishimura Tazuko, Minematsu Nobuaki, Saito Daisuke, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | When one hears his/her recorded voices for the first time, s/he is probably surprised and not rarely disappointed at the differences of voice quality between the recorded voices and his/her own voices. In psychology, this phenomenon is called voice confrontation.Conversion from recorded voices of a speaker to his/her own voices was technically investigated in previous studies, and in the currentstudy, we propose a novel framework for conversion. Here, four new ideas are introduced and tested technically:a) multiple pathways of in-body voice transmission from the oral cavity to the inner ear are taken into account for recording,b) body-conducted speech, not bone-conducted speech, is defined and simulated,c) a special device is prepared to avoid habituation effects in listening tests, and d) a network-based voice conversion technique is applied using a parallel corpus prepared by the above three steps. Experiments show that the proposed framework can generate one's own voices with higher quality, compared to a conventional method, even in cross-language contexts. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | one's own speech / body-conducted speech / sound-proof earmuff / bone-conducted microphones / statistical voice conversion |
Paper # | SP2021-15 |
Date of Issue | 2021-06-11 (SP) |
Conference Information | |
Committee | SP / IPSJ-SLP / IPSJ-MUS |
---|---|
Conference Date | 2021/6/18(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Online |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | OTOGAKU Symposium 2021 |
Chair | Hisashi Kawai(NICT) / 北岡 教英(豊橋技科大) / 竹川 佳成(はこだて未来大) |
Vice Chair | |
Secretary | (Univ. of Tokyo) / (Waseda Univ.) / (京大) |
Assistant | Yusuke Ijima(NTT) |
Paper Information | |
Registration To | Technical Committee on Speech / Special Interest Group on Spoken Language Processing / Special Interest Group on Music and Computer |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Simulation of Body-conducted Speech and Synthesis of One's Own Voice with a Sound-proof Earmuff and Bone-conduction Microphones |
Sub Title (in English) | |
Keyword(1) | one's own speech |
Keyword(2) | body-conducted speech |
Keyword(3) | sound-proof earmuff |
Keyword(4) | bone-conducted microphones |
Keyword(5) | statistical voice conversion |
1st Author's Name | Chen Ruiyan |
1st Author's Affiliation | the University of Tokyo(UTokyo) |
2nd Author's Name | Nishimura Tazuko |
2nd Author's Affiliation | the University of Tokyo(UTokyo) |
3rd Author's Name | Minematsu Nobuaki |
3rd Author's Affiliation | the University of Tokyo(UTokyo) |
4th Author's Name | Saito Daisuke |
4th Author's Affiliation | the University of Tokyo(UTokyo) |
Date | 2021-06-19 |
Paper # | SP2021-15 |
Volume (vol) | vol.121 |
Number (no) | SP-66 |
Page | pp.pp.63-68(SP), |
#Pages | 6 |
Date of Issue | 2021-06-11 (SP) |