Presentation 2005/7/25
Feature Analysis and Keyword Extraction for Automatic Classification of Web Site
Takatomo HONDA, Masahito YAMAMOTO, Hidenori KAWAMURA, Azuma OHUCHI,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) A website is considered to be belonging to a certain category such as accommodations, restaurants, facilities and so on, as shown in the directory-typed search engines. Generally, a website includes valuable information such as links to or from other sites, many files with extensions and many text data, in terms of the category classification. In this paper, we investigate whether many websites belonging to a certain category have some common features or not. In particular, we show that some keywords are very important to classify websites to the categories. By using these analyzed keyword information, we present that many websites could be classified to an appropriate category with high precision when three categories (museum, restaurant and accommodations) related to tourism are treated as examples.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Web site / Automatic Classification / Feature Analysis / Keyword Extraction
Paper # AI2005-7
Date of Issue

Conference Information
Committee AI
Conference Date 2005/7/25(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Feature Analysis and Keyword Extraction for Automatic Classification of Web Site
Sub Title (in English)
Keyword(1) Web site
Keyword(2) Automatic Classification
Keyword(3) Feature Analysis
Keyword(4) Keyword Extraction
1st Author's Name Takatomo HONDA
1st Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University()
2nd Author's Name Masahito YAMAMOTO
2nd Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University
3rd Author's Name Hidenori KAWAMURA
3rd Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University
4th Author's Name Azuma OHUCHI
4th Author's Affiliation Graduate School of Information Science and Technology, Hokkaido University
Date 2005/7/25
Paper # AI2005-7
Volume (vol) vol.105
Number (no) 224
Page pp.pp.-
#Pages 4
Date of Issue