Presentation 2013-10-07
A 2.4x-Real-Time VLSI Processor for 60-kWord Continuous Speech Recognition
Guangji HE, Yuki MIYAMOTO, IZUMI Shintaro /, Hiroshi KAWAGUCHI, Masahiko YOSHIMOTO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper describes a low-power VLSI chip for speaker-independent 60-kWord continuous speech recognition based on a context-dependent Hidden Markov Model (HMM). We implement parallel and pipelined architecture for GMM computation and Viterbi processing. It includes a 8-path Viterbi transition architecture to maximize the processing speed and adopts tri-gram language model to improve the recognition accuracy. A two-level cache architecture is implemented for the demo system. The test chip, fabricated in 40 nm CMOS technology, occupies 1.77 mm × 2.18 mm containing 2.98 M transistors for logic and 4.29 Mbit on-chip memory. The measured results show that our implementation achieves 25% required frequency reduction (62.5 MHz) and 26% power consumption reduction (54.8 mW) for 60 k-Word real-time continuous speech recognition compared to the previous work. This chip can maximally process 3.02× and 2.25× times faster than real-time at 200 MHz using the bigram and trigram language models, respectively.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) 40nm VLSI / Hidden Markov Model (HMM) / large vocabulary continuous speech recognition (LVCSR)
Paper # VLD2013-52,ICD2013-76,IE2013-52
Date of Issue

Conference Information
Committee ICD
Conference Date 2013/9/30(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Integrated Circuits and Devices (ICD)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A 2.4x-Real-Time VLSI Processor for 60-kWord Continuous Speech Recognition
Sub Title (in English)
Keyword(1) 40nm VLSI
Keyword(2) Hidden Markov Model (HMM)
Keyword(3) large vocabulary continuous speech recognition (LVCSR)
1st Author's Name Guangji HE
1st Author's Affiliation Graduate School of System Informatics, Kobe University()
2nd Author's Name Yuki MIYAMOTO
2nd Author's Affiliation /
3rd Author's Name IZUMI Shintaro /
3rd Author's Affiliation / /
4th Author's Name Hiroshi KAWAGUCHI
4th Author's Affiliation
5th Author's Name Masahiko YOSHIMOTO
5th Author's Affiliation
Date 2013-10-07
Paper # VLD2013-52,ICD2013-76,IE2013-52
Volume (vol) vol.113
Number (no) 236
Page pp.pp.-
#Pages 6
Date of Issue