Presentation | 2013-06-13 A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Compensation and Statistical Source Excitation Generation Kou TANAKA, Tomoki TODA, Graham NEUBIG, Sakriani SAKTI, Satoshi NAKAMURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | We present a hybrid approach to improving naturalness of electrolaryngeal (EL) speech while minimizing degradation in listenability. An electrolarynx is a device that artificially generates excitation sounds to enable laryngectomees to produce EL speech. Although proficient laryngectomees can produce quite intelligible EL speech, it sounds very unnatural due to the mechanical excitation produced by the device. Moreover, the excitation sounds produced by the device often leak outside, adding noise to EL speech. To address these issues, we propose a hybrid method using the noise reduction method for enhancing spectral parameters and voice conversion method for predicting excitation parameters. The experimental results demonstrate the proposed method yields significant improvements in naturalness compared with EL speech while keeping listenability high enough. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | speaking-aid / electrolaryngeal speech / spectral subtraction / voice conversion / hybrid approach |
Paper # | SP2013-37,WIT2013-7 |
Date of Issue |
Conference Information | |
Committee | WIT |
---|---|
Conference Date | 2013/6/6(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Well-being Information Technology(WIT) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Hybrid Approach to Electrolaryngeal Speech Enhancement Based on Spectral Compensation and Statistical Source Excitation Generation |
Sub Title (in English) | |
Keyword(1) | speaking-aid |
Keyword(2) | electrolaryngeal speech |
Keyword(3) | spectral subtraction |
Keyword(4) | voice conversion |
Keyword(5) | hybrid approach |
1st Author's Name | Kou TANAKA |
1st Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology() |
2nd Author's Name | Tomoki TODA |
2nd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
3rd Author's Name | Graham NEUBIG |
3rd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
4th Author's Name | Sakriani SAKTI |
4th Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
5th Author's Name | Satoshi NAKAMURA |
5th Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology |
Date | 2013-06-13 |
Paper # | SP2013-37,WIT2013-7 |
Volume (vol) | vol.113 |
Number (no) | 77 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |