Presentation | 2018-10-25 [Tutorial Lecture] Development and Application of Voice User Interface for Device Operation with High Usability Noboru Hayasaka, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A speech recognition framework in smart speakers and smart phones require a network connection, which increases the burden on the user, such as an increase in the amount of data communication and knowledge of the network. Also, in order to realize high recognition accuracy, it is necessary to properly construct acoustic models and language models. However, they require a huge cost and hamper new entry. In order to deal with these problems, I have developed a voice user interface for device operation with a standalone word recognition method. This interface reduces the burden on the network by implementing it as a standalone type, and greatly improves usability by frontal face detection. Furthermore, in consideration of the application scene of this interface, feature normalization and multi-condition training were introduced in order to realize high recognition performance in noisy environments. In this paper, we describe the flow of the developed interface in detail, show the experiment results under noisy environments, and finally introduce the adjustable bed with this voice user interface. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Isolated word recognition / Voice user interface for device operation |
Paper # | SIS2018-15 |
Date of Issue | 2018-10-18 (SIS) |
Conference Information | |
Committee | SIS / ITE-BCT |
---|---|
Conference Date | 2018/10/25(2days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | Kyoto University Clock Tower Centennial Hall |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | System Implementation Technology, Short Range Wireless Systems, Smart Multimedia Systems, Broadcasting Technology, etc. |
Chair | Takayuki Nakachi(NTT) / Tomoaki Otsuki(Keio Univ) |
Vice Chair | Noriaki Suetake(Yamaguchi Univ.) / Tomoaki Kimura(Kanagawa Inst. of Tech.) / Kyoichi Saito(NHK) / Yasushi Kasuga(TV Asahi) |
Secretary | Noriaki Suetake(Kyushu Inst. of Tech.) / Tomoaki Kimura(Tokyo Metropolitan Univ.) / Kyoichi Saito(B-SAT) / Yasushi Kasuga(NHK) |
Assistant | Takanori Koga(National Inst. of Tech. Tokuyama College) / Hideaki Misawa(National Inst. of Tech., Ube College) / Shigeki Shiokawa(Kanagawa Inst. of Tech.) / Toshiharu Morizumi(NTT) / Iwao Namikawa(Kansai Telecasting Corporation) |
Paper Information | |
Registration To | Technical Committee on Smart Info-Media Systems / Technical Group on Broadcasting Technology |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | [Tutorial Lecture] Development and Application of Voice User Interface for Device Operation with High Usability |
Sub Title (in English) | |
Keyword(1) | Isolated word recognition |
Keyword(2) | Voice user interface for device operation |
1st Author's Name | Noboru Hayasaka |
1st Author's Affiliation | Osaka Electro-Communication University(OECU) |
Date | 2018-10-25 |
Paper # | SIS2018-15 |
Volume (vol) | vol.118 |
Number (no) | SIS-264 |
Page | pp.pp.57-62(SIS), |
#Pages | 6 |
Date of Issue | 2018-10-18 (SIS) |