Presentation | 2008-03-20 Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment Kousuke HIRAKI, Takahiro SHINOZAKI, Koichi SHINODA, Agnieszka BETKOWSKA, Koji IWANO, Sadaoki FURUI, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Car navigation systems are getting more and more popular and many of them equip a speech recognition system for hands-free interface. However, the speech input interface is not widely used because of insufficient recognition performance. In order to improve the recognition performance and make the speech interface more practical, a real-car-environment speech corpus "Drivers' Japanese Speech Corpus in a Car Environment" is under construction by a project supported by the Japanese Ministry of Economy, Trade and Industry. In this study, we used the command task portion of the corpus recorded under three conditions: idling, running in a city, and running on a highway. We used the data from the corpus only as a test set and made a recognition system by optimally combining several existing corpora with several noise robustness techniques. Experimental results show that using an HMM trained on multiple conditions with spectral subtraction is the best for the car noises. Recognition performance was largely improved and more than 90% word accuracy was achieved for all the recording conditions. In particular, over a 50% absolute improvement in accuracy was observed for speeches given by female speakers uttered when driving on a highway. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | car navigation system / noise robustness / spectral subtraction / tree-structured clustering |
Paper # | SP2007-202 |
Date of Issue |
Conference Information | |
Committee | SP |
---|---|
Conference Date | 2008/3/13(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Speech (SP) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment |
Sub Title (in English) | |
Keyword(1) | car navigation system |
Keyword(2) | noise robustness |
Keyword(3) | spectral subtraction |
Keyword(4) | tree-structured clustering |
1st Author's Name | Kousuke HIRAKI |
1st Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology() |
2nd Author's Name | Takahiro SHINOZAKI |
2nd Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology |
3rd Author's Name | Koichi SHINODA |
3rd Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology |
4th Author's Name | Agnieszka BETKOWSKA |
4th Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology |
5th Author's Name | Koji IWANO |
5th Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology |
6th Author's Name | Sadaoki FURUI |
6th Author's Affiliation | Department of Computer Science, Tokyo Institute of Technology |
Date | 2008-03-20 |
Paper # | SP2007-202 |
Volume (vol) | vol.107 |
Number (no) | 551 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |