Paper Abstract and Keywords |
Presentation |
2012-12-21 15:25
Sparse Coding-Based Voice Conversion from Lip Information Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki (Kobe Univ.) SP2012-95 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
A technology to recognize speech content from lip motion is called visual speech recognition (VSR). VSRis an important communication method for people who have a handicap with hearing or speaking. In this paper, we
propose a sparse-coding-based voice conversion method using lip motion without text information. Lip information and voices are extracted from videos, where they are used to construct lip dictionary and voice dictionary. Input lip information is represented by a linear combination of a small number of bases in the lip dictionary. The bases are replaced to coordinate bases in the voice dictionary, and they are recomposed to voice information. In this paper, we conducted vowel conversion because vowels are able to recognize from lip information. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Sparse Coding / Voice Conversion / Lipreading / Lip information / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 112, no. 369, SP2012-95, pp. 119-124, Dec. 2012. |
Paper # |
SP2012-95 |
Date of Issue |
2012-12-13 (SP) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
SP2012-95 |
Conference Information |
Committee |
SP IPSJ-SLP |
Conference Date |
2012-12-20 - 2012-12-21 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
TITECH(Ookayama) |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
14th Symposium on Spoken Language |
Paper Information |
Registration To |
SP |
Conference Code |
2012-12-SP-SLP |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Sparse Coding-Based Voice Conversion from Lip Information |
Sub Title (in English) |
|
Keyword(1) |
Sparse Coding |
Keyword(2) |
Voice Conversion |
Keyword(3) |
Lipreading |
Keyword(4) |
Lip information |
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Ryo Aihara |
1st Author's Affiliation |
Kobe University (Kobe Univ.) |
2nd Author's Name |
Ryoichi Takashima |
2nd Author's Affiliation |
Kobe University (Kobe Univ.) |
3rd Author's Name |
Tetsuya Takiguchi |
3rd Author's Affiliation |
Kobe University (Kobe Univ.) |
4th Author's Name |
Yasuo Ariki |
4th Author's Affiliation |
Kobe University (Kobe Univ.) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2012-12-21 15:25:00 |
Presentation Time |
90 minutes |
Registration for |
SP |
Paper # |
SP2012-95 |
Volume (vol) |
vol.112 |
Number (no) |
no.369 |
Page |
pp.119-124 |
#Pages |
6 |
Date of Issue |
2012-12-13 (SP) |
|