Presentation | 2012-12-21 Sparse Coding-Based Voice Conversion from Lip Information Ryo AIHARA, Ryoichi TAKASHIMA, Tetsuya TAKIGUCHI, Yasuo ARIKI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A technology to recognize speech content from lip motion is called visual speech recognition (VSR). VSRis an important communication method for people who have a handicap with hearing or speaking. In this paper, wepropose a sparse-coding-based voice conversion method using lip motion without text information. Lip information and voices are extracted from videos, where they are used to construct lip dictionary and voice dictionary. Input lip information is represented by a linear combination of a small number of bases in the lip dictionary. The bases are replaced to coordinate bases in the voice dictionary, and they are recomposed to voice information. In this paper, we conducted vowel conversion because vowels are able to recognize from lip information. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Sparse Coding / Voice Conversion / Lipreading / Lip information |
Paper # | SP2012-95 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2012/12/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Sparse Coding-Based Voice Conversion from Lip Information |
Sub Title (in English) | |
Keyword(1) | Sparse Coding |
Keyword(2) | Voice Conversion |
Keyword(3) | Lipreading |
Keyword(4) | Lip information |
1st Author's Name | Ryo AIHARA |
1st Author's Affiliation | Graduate School of System Informatics Kobe University() |
2nd Author's Name | Ryoichi TAKASHIMA |
2nd Author's Affiliation | Graduate School of System Informatics Kobe University |
3rd Author's Name | Tetsuya TAKIGUCHI |
3rd Author's Affiliation | Organization of Advanced Science and Technology Kobe University |
4th Author's Name | Yasuo ARIKI |
4th Author's Affiliation | Organization of Advanced Science and Technology Kobe University |
Date | 2012-12-21 |
Paper # | SP2012-95 |
Volume (vol) | vol.112 |
Number (no) | 369 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |