Presentation 2007/11/21
Open-Vocabulary Spoken Utterance Retrieval Using Confusion Networks and Its Evaluation
Takaaki HORI, I. Lee HETHERINGTON, Timothy J. HAZEN, James R. GLASS,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) We recently proposed an open-vocabulary spoken utterance retrieval method using confusion networks. In this paper, we particularly focus on its technical details and experimental results with a large-scale corpus of approximately 120 hours. Spoken utterance retrieval is a task to find utterances including a key word or key phrase as a query from a large spoken archive. The proposed method generates word or phone confusion networks from the archive, and translates them into the Inverted Index. By using this index structure, we can quickly retrieve a set of utterances including the query. To deal with out-of-vocabulary (OOV) words in queries and the archive, we apply phone confusion networks and combine them with word confusion networks. With this approach, we enables robust keyword matching for queries including both in-vocabulary and out-of-vocabulary words. In the retrieval experiments with speech recordings in MIT lecture corpus, our method using word-phone-combined confusion networks outperformed conventional indexing and retrieval methods.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Open Vocabulary / Utterance Retrieval / Confusion Network
Paper # SP2007-93
Date of Issue

Conference Information
Committee SP
Conference Date 2007/11/21(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Speech (SP)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Open-Vocabulary Spoken Utterance Retrieval Using Confusion Networks and Its Evaluation
Sub Title (in English)
Keyword(1) Open Vocabulary
Keyword(2) Utterance Retrieval
Keyword(3) Confusion Network
1st Author's Name Takaaki HORI
1st Author's Affiliation NTT Communication Science Laboratories, NTT Corporation()
2nd Author's Name I. Lee HETHERINGTON
2nd Author's Affiliation MIT Computer Science and Artificial Intelligence Laboratory
3rd Author's Name Timothy J. HAZEN
3rd Author's Affiliation MIT Computer Science and Artificial Intelligence Laboratory
4th Author's Name James R. GLASS
4th Author's Affiliation MIT Computer Science and Artificial Intelligence Laboratory
Date 2007/11/21
Paper # SP2007-93
Volume (vol) vol.107
Number (no) 356
Page pp.pp.-
#Pages 6
Date of Issue