Presentation 2006-02-03
Automatic Generation of Domain-specific Vocabulary Template and Integration of Web Pages
Masayuki SUDA, Koji IWANUMA, Hidetomo NABESHIMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Demand of comparing products or services provided in Internet frequently occurs. In this paper, we propose an approach of automatic integration of Web pages in a specific field for reducing the time and effort of comparison by hand between the Web pages. Our integration approach is based on a vocabulary template which consists of a set of related words in a certain field. The vocabulary template is automatically generated by (1) removing common words in other fields from words in web pages of the certain field, and (2) clustering words using co-occurrence information. We implemented the integration system for comparing web pages based on the vocabulary template. The preliminary experimental results show the usefulness of our approach based on the vocabulary template.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) information integration / word co-occurrence / information retrieval
Paper # NLC2005-117
Date of Issue

Conference Information
Committee NLC
Conference Date 2006/1/27(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Automatic Generation of Domain-specific Vocabulary Template and Integration of Web Pages
Sub Title (in English)
Keyword(1) information integration
Keyword(2) word co-occurrence
Keyword(3) information retrieval
1st Author's Name Masayuki SUDA
1st Author's Affiliation Computer Science and Media Engineering, Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi()
2nd Author's Name Koji IWANUMA
2nd Author's Affiliation Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi
3rd Author's Name Hidetomo NABESHIMA
3rd Author's Affiliation Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi
Date 2006-02-03
Paper # NLC2005-117
Volume (vol) vol.105
Number (no) 595
Page pp.pp.-
#Pages 6
Date of Issue