Presentation | 2006-02-03 Automatic Generation of Domain-specific Vocabulary Template and Integration of Web Pages Masayuki SUDA, Koji IWANUMA, Hidetomo NABESHIMA, |
---|---|
PDF Download Page | ![]() |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Demand of comparing products or services provided in Internet frequently occurs. In this paper, we propose an approach of automatic integration of Web pages in a specific field for reducing the time and effort of comparison by hand between the Web pages. Our integration approach is based on a vocabulary template which consists of a set of related words in a certain field. The vocabulary template is automatically generated by (1) removing common words in other fields from words in web pages of the certain field, and (2) clustering words using co-occurrence information. We implemented the integration system for comparing web pages based on the vocabulary template. The preliminary experimental results show the usefulness of our approach based on the vocabulary template. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | information integration / word co-occurrence / information retrieval |
Paper # | NLC2005-117 |
Date of Issue |
Conference Information | |
Committee | NLC |
---|---|
Conference Date | 2006/1/27(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Natural Language Understanding and Models of Communication (NLC) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Automatic Generation of Domain-specific Vocabulary Template and Integration of Web Pages |
Sub Title (in English) | |
Keyword(1) | information integration |
Keyword(2) | word co-occurrence |
Keyword(3) | information retrieval |
1st Author's Name | Masayuki SUDA |
1st Author's Affiliation | Computer Science and Media Engineering, Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi() |
2nd Author's Name | Koji IWANUMA |
2nd Author's Affiliation | Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi |
3rd Author's Name | Hidetomo NABESHIMA |
3rd Author's Affiliation | Interdisciplinary Graduate School of Medical and Engineering, University of Yamanashi |
Date | 2006-02-03 |
Paper # | NLC2005-117 |
Volume (vol) | vol.105 |
Number (no) | 595 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |