Presentation 2008-06-30
A Method for Extraction Regional Information from HTML Documents
Susumu KONNO, Shigeru FUJITA, Yusuke WATANABE,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In recent years, the local government is offering regional information through the Web. However, the website is not made in consideration of machine processing. Some the methods of extracting information intended for the HTML document are developed for this problem. However, in the case of the regional information web pages, some problems are left in the existing method. The first is "Only same template web site". The second is "Only information of one every one page". Because there is such a problem the existing method are not applicable in the regional information web page. In this paper, we propose a method of information extraction of the regional information by the attribute word. This method grasps the meaning of the word by attribute word including the meaning of the word. And, regional information is extracted from the resemblance of case information and HTML document. In addition, the information extraction expands the object range to a hyperlink document.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) HTML Processing / Regional Information / Information Extraction
Paper # AI2008-8
Date of Issue

Conference Information
Committee AI
Conference Date 2008/6/23(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Method for Extraction Regional Information from HTML Documents
Sub Title (in English)
Keyword(1) HTML Processing
Keyword(2) Regional Information
Keyword(3) Information Extraction
1st Author's Name Susumu KONNO
1st Author's Affiliation Chiba Institute of Technology()
2nd Author's Name Shigeru FUJITA
2nd Author's Affiliation Chiba Institute of Technology
3rd Author's Name Yusuke WATANABE
3rd Author's Affiliation Chiba Institute of Technology:NTT Comware Corporation
Date 2008-06-30
Paper # AI2008-8
Volume (vol) vol.108
Number (no) 119
Page pp.pp.-
#Pages 6
Date of Issue