Presentation 2008-03-20
Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment
Kousuke HIRAKI, Takahiro SHINOZAKI, Koichi SHINODA, Agnieszka BETKOWSKA, Koji IWANO, Sadaoki FURUI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Car navigation systems are getting more and more popular and many of them equip a speech recognition system for hands-free interface. However, the speech input interface is not widely used because of insufficient recognition performance. In order to improve the recognition performance and make the speech interface more practical, a real-car-environment speech corpus "Drivers' Japanese Speech Corpus in a Car Environment" is under construction by a project supported by the Japanese Ministry of Economy, Trade and Industry. In this study, we used the command task portion of the corpus recorded under three conditions: idling, running in a city, and running on a highway. We used the data from the corpus only as a test set and made a recognition system by optimally combining several existing corpora with several noise robustness techniques. Experimental results show that using an HMM trained on multiple conditions with spectral subtraction is the best for the car noises. Recognition performance was largely improved and more than 90% word accuracy was achieved for all the recording conditions. In particular, over a 50% absolute improvement in accuracy was observed for speeches given by female speakers uttered when driving on a highway.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) car navigation system / noise robustness / spectral subtraction / tree-structured clustering
Paper # SP2007-202
Date of Issue

Conference Information
Committee SP
Conference Date 2008/3/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language ENG
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Initial Evaluation of the Drivers' Japanese Speech Corpus in a Car Environment
Sub Title (in English)
Keyword(1) car navigation system
Keyword(2) noise robustness
Keyword(3) spectral subtraction
Keyword(4) tree-structured clustering
1st Author's Name Kousuke HIRAKI
1st Author's Affiliation Department of Computer Science, Tokyo Institute of Technology()
2nd Author's Name Takahiro SHINOZAKI
2nd Author's Affiliation Department of Computer Science, Tokyo Institute of Technology
3rd Author's Name Koichi SHINODA
3rd Author's Affiliation Department of Computer Science, Tokyo Institute of Technology
4th Author's Name Agnieszka BETKOWSKA
4th Author's Affiliation Department of Computer Science, Tokyo Institute of Technology
5th Author's Name Koji IWANO
5th Author's Affiliation Department of Computer Science, Tokyo Institute of Technology
6th Author's Name Sadaoki FURUI
6th Author's Affiliation Department of Computer Science, Tokyo Institute of Technology
Date 2008-03-20
Paper # SP2007-202
Volume (vol) vol.107
Number (no) 551
Page pp.pp.-
#Pages 6
Date of Issue