Paper Abstract and Keywords |
Presentation |
2013-02-02 15:25
Eye motion input based speech synthesis interface for communication aids Fuming Fang, Takahiro Shinozaki, Yasuo Horiuchi, Shingo Kuroiwa (Chiba Univ), Sadaoki Furui (Tokyo Tech), Toshimitsu Musha (BFL) WIT2012-38 |
Abstract |
(in Japanese) |
(See Japanese page) |
(in English) |
In order to provide an efficient means of communication for those who cannot move muscles of their whole body except eyes due to amyotrophic lateral sclerosis (ALS), we are studying a speech synthesis interface based on electrooculogram (EOG) input. The system consists of an EOG input module, an eye motion recognizer, and a speech synthesizer. In this paper, we improve the EOG input based eye motion recognizer applying speech recognition techniques. In our previous system, a hidden Markov model (HMM) based bi eye-motion model was used. However, it was not enough to effectively model the context effects of eye motions. In this study, we investigate using a tied-state tri eye-motion model. Moreover, an N-gram model is integrated to the recognition system. In the experiment, it is shown that 96.2% of character ecognition accuracy is obtained by using the tri eye-motion model whereas it is 84.3% and 89.1% for mono and bi eye-motion models, respectively. By using a character 3-gram model in combination with the tri eye motion-model, the highest character accuracy of 97.3% has been obtained. |
Keyword |
(in Japanese) |
(See Japanese page) |
(in English) |
Electrooculogram / Hidden Markov model / N-gram / Speech synthesis / Communication aids / / / |
Reference Info. |
IEICE Tech. Rep., vol. 112, no. 426, WIT2012-38, pp. 29-34, Feb. 2013. |
Paper # |
WIT2012-38 |
Date of Issue |
2013-01-26 (WIT) |
ISSN |
Print edition: ISSN 0913-5685 Online edition: ISSN 2432-6380 |
Copyright and reproduction |
All rights are reserved and no part of this publication may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopy, recording, or any information storage and retrieval system, without permission in writing from the publisher. Notwithstanding, instructors are permitted to photocopy isolated articles for noncommercial classroom use without fee. (License No.: 10GA0019/12GB0052/13GB0056/17GB0034/18GB0034) |
Download PDF |
WIT2012-38 |
Conference Information |
Committee |
WIT |
Conference Date |
2013-02-02 - 2013-02-02 |
Place (in Japanese) |
(See Japanese page) |
Place (in English) |
Nagoya Institute of Technology |
Topics (in Japanese) |
(See Japanese page) |
Topics (in English) |
Well-being Information Technology, Local Community and Well‐being |
Paper Information |
Registration To |
WIT |
Conference Code |
2013-02-WIT |
Language |
Japanese |
Title (in Japanese) |
(See Japanese page) |
Sub Title (in Japanese) |
(See Japanese page) |
Title (in English) |
Eye motion input based speech synthesis interface for communication aids |
Sub Title (in English) |
|
Keyword(1) |
Electrooculogram |
Keyword(2) |
Hidden Markov model |
Keyword(3) |
N-gram |
Keyword(4) |
Speech synthesis |
Keyword(5) |
Communication aids |
Keyword(6) |
|
Keyword(7) |
|
Keyword(8) |
|
1st Author's Name |
Fuming Fang |
1st Author's Affiliation |
Chiba University (Chiba Univ) |
2nd Author's Name |
Takahiro Shinozaki |
2nd Author's Affiliation |
Chiba University (Chiba Univ) |
3rd Author's Name |
Yasuo Horiuchi |
3rd Author's Affiliation |
Chiba University (Chiba Univ) |
4th Author's Name |
Shingo Kuroiwa |
4th Author's Affiliation |
Chiba University (Chiba Univ) |
5th Author's Name |
Sadaoki Furui |
5th Author's Affiliation |
Tokyo Institute of Technology (Tokyo Tech) |
6th Author's Name |
Toshimitsu Musha |
6th Author's Affiliation |
Brain Functions Laboratory, Inc (BFL) |
7th Author's Name |
|
7th Author's Affiliation |
() |
8th Author's Name |
|
8th Author's Affiliation |
() |
9th Author's Name |
|
9th Author's Affiliation |
() |
10th Author's Name |
|
10th Author's Affiliation |
() |
11th Author's Name |
|
11th Author's Affiliation |
() |
12th Author's Name |
|
12th Author's Affiliation |
() |
13th Author's Name |
|
13th Author's Affiliation |
() |
14th Author's Name |
|
14th Author's Affiliation |
() |
15th Author's Name |
|
15th Author's Affiliation |
() |
16th Author's Name |
|
16th Author's Affiliation |
() |
17th Author's Name |
|
17th Author's Affiliation |
() |
18th Author's Name |
|
18th Author's Affiliation |
() |
19th Author's Name |
|
19th Author's Affiliation |
() |
20th Author's Name |
|
20th Author's Affiliation |
() |
Speaker |
Author-1 |
Date Time |
2013-02-02 15:25:00 |
Presentation Time |
25 minutes |
Registration for |
WIT |
Paper # |
WIT2012-38 |
Volume (vol) |
vol.112 |
Number (no) |
no.426 |
Page |
pp.29-34 |
#Pages |
6 |
Date of Issue |
2013-01-26 (WIT) |
|