講義音声認識における講義スライド情報の利用(Session-6 音声認識,第8回音声言語シンポジウム)

Presentation	2006/12/15 Using presentation slide information for lecture speech recognition Hiroki YAMAZAKI, Koji IWANO, Koichi SHINODA, Sadaoki FURUI, Haruo YOKOTA,
PDF Download Page	PDF download Page Link
Abstract(in Japanese)	(See Japanese page)
Abstract(in English)	We propose a dynamic language model adaptation method for lecture speech recognition in which the information of text on slides for lectures is used. The speech data corresponding to each slide are recognized with a language model adapted to them by using the slide texts as adaptation data. We evaluated the proposed method by using the speech data of three classroom courses in Japanese, and confirmed its effectiveness. The average speech recognition error was reduced by 3.1% by the global adaptation using all slides used in a cource. The error rates of recall and precision for keywords were also reduced by 21.5% and 13.8% respectively. Furthermore, we achieved the improvement of keyword detection performance by the adaptation using each slide locally. The error rates of recall and precision for keywords were reduced by 3.1% and 1.4% respectively from global adaptation.
Keyword(in Japanese)	(See Japanese page)
Keyword(in English)	Language model adaptation / speech recognition / classroom lecture speech
Paper #	NLC2006-66,SP2006-122
Date of Issue

Paper Information
Registration To	Natural Language Understanding and Models of Communication (NLC)
Language	JPN
Title (in Japanese)	(See Japanese page)
Sub Title (in Japanese)	(See Japanese page)
Title (in English)	Using presentation slide information for lecture speech recognition
Sub Title (in English)
Keyword(1)	Language model adaptation
Keyword(2)	speech recognition
Keyword(3)	classroom lecture speech
1st Author's Name	Hiroki YAMAZAKI
1st Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology()
2nd Author's Name	Koji IWANO
2nd Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
3rd Author's Name	Koichi SHINODA
3rd Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
4th Author's Name	Sadaoki FURUI
4th Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
5th Author's Name	Haruo YOKOTA
5th Author's Affiliation	Department of Computer Science, Tokyo Institute of Technology
Date	2006/12/15
Paper #	NLC2006-66,SP2006-122
Volume (vol)	vol.106
Number (no)	442
Page	pp.pp.-
#Pages	6
Date of Issue