Paper Abstract and Keywords |
Presentation |
2009-09-11 09:30
Contribution of Visual Information to Speech Cognition Kazuhiro Ito (Chiba Inst. of Tech), Kaname Mochizuki (Teikyo Univ.), Hitoshi Ohnishi (The Open Univ. of Japan), Naoto Nakamura (Chiba Inst. of Tech) CQ2009-34 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
When we talk with somebody, not only the voice of speaker but also his or her facial expressions play an important role in mutual understanding. This report examined the contribution of visual information of speaker’s face to speech cognition. In our experiment, participants were presented two types of materials: (1) voice only, (2) motion picture of speaker with voice and face under four different levels of background noise conditions and were asked to repeat precisely what speaker said. The results showed that accuracy of repeat was same or higher in the voice only material than motion picture when materials were presented without noise. But when the noise added, accuracy was higher in motion picture. The analysis of gaze tracking data showed that participants tended to look at speaker’s eyes when the motion picture presented without noise, but they looked at speaker’s mouth when noise added. These results clearly show the contribution of the visual information of speaker’s face to speech cognition. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Audio-visual integration / Speech recognition / Gaze tracking / / / / / |
Reference Info. |
IEICE Tech. Rep., vol. 109, no. 191, CQ2009-34, pp. 49-52, Sept. 2009. |
Paper # |
CQ2009-34 |
Date of Issue |
2009-09-03 (CQ) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
CQ2009-34 |
Conference Information |
Committee |
CQ |
Conference Date |
2009-09-10 - 2009-09-11 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Takayama Culture Center |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Quality of multi-sensory media, Audio and video quality, Quality control of networks, Architecture of Next/New generation networks, Communications in virtual environments, general fields |
Paper Information |
Registration To |
CQ |
Conference Code |
2009-09-CQ |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Contribution of Visual Information to Speech Cognition |
Sub Title (in English) |
|
Keyword(1) |
Audio-visual integration |
Keyword(2) |
Speech recognition |
Keyword(3) |
Gaze tracking |
Keyword(4) |
|
Keyword(5) |
|
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Kazuhiro Ito |
1st Author's Affiliation |
Chiba Institude of Technology (Chiba Inst. of Tech) |
2nd Author's Name |
Kaname Mochizuki |
2nd Author's Affiliation |
Teikyo University (Teikyo Univ.) |
3rd Author's Name |
Hitoshi Ohnishi |
3rd Author's Affiliation |
The Open University of Japan (The Open Univ. of Japan) |
4th Author's Name |
Naoto Nakamura |
4th Author's Affiliation |
Chiba Institude of Technology (Chiba Inst. of Tech) |
5th Author's Name |
|
5th Author's Affiliation |
() |
6th Author's Name |
|
6th Author's Affiliation |
() |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2009-09-11 09:30:00 |
Presentation Time |
30 minutes |
Registration for |
CQ |
Paper # |
CQ2009-34 |
Volume (vol) |
vol.109 |
Number (no) |
no.191 |
Page |
pp.49-52 |
#Pages |
4 |
Date of Issue |
2009-09-03 (CQ) |
|