Presentation | 2014-12-16 Voice Conversion Using Speaker Adaptive Restricted Boltzmann Machine Toru NAKASHIKA, Tetsuya TAKIGUCHI, Yasuo ARIKI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Voice conversion (VC) is a technique where only speaker-specific information in source speech is converted while keeping phonological information. The technique can be applied to various tasks such as speaker-identity conversion, emotion conversion and aid to speaking for people with articulation disorders. Most of the existing VC methods rely on parallel data-pairs of speech data from source and target speakers uttering the same articles. However, this approach involves several problems; firstly, the data used for the training is limited to the pre-defined articles. Secondly, the use of the trained model is limited only to the speaker pair used in the training. In this paper, we propose a novel probabilistic model called an adaptive restricted Boltzmann machine (ARBM) for VC between arbitrary speakers without use of parallel data. This model consists of a visible-unit and a hidden-unit layer with the speaker-dependent connection. In this paper, we report our experimental results of arbitrary-speaker VC using our model, an ARBM. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Voice conversion / restricted Boltzmann machine / speaker adaptation / non-parallel training |
Paper # | SP2014-126 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2014/12/8(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Voice Conversion Using Speaker Adaptive Restricted Boltzmann Machine |
Sub Title (in English) | |
Keyword(1) | Voice conversion |
Keyword(2) | restricted Boltzmann machine |
Keyword(3) | speaker adaptation |
Keyword(4) | non-parallel training |
1st Author's Name | Toru NAKASHIKA |
1st Author's Affiliation | Graduate School of System Informatics, Kobe University() |
2nd Author's Name | Tetsuya TAKIGUCHI |
2nd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
3rd Author's Name | Yasuo ARIKI |
3rd Author's Affiliation | Organization of Advanced Science and Technology, Kobe University |
Date | 2014-12-16 |
Paper # | SP2014-126 |
Volume (vol) | vol.114 |
Number (no) | 365 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |