Presentation | 2008-06-30 A Method for Extraction Regional Information from HTML Documents Susumu KONNO, Shigeru FUJITA, Yusuke WATANABE, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In recent years, the local government is offering regional information through the Web. However, the website is not made in consideration of machine processing. Some the methods of extracting information intended for the HTML document are developed for this problem. However, in the case of the regional information web pages, some problems are left in the existing method. The first is "Only same template web site". The second is "Only information of one every one page". Because there is such a problem the existing method are not applicable in the regional information web page. In this paper, we propose a method of information extraction of the regional information by the attribute word. This method grasps the meaning of the word by attribute word including the meaning of the word. And, regional information is extracted from the resemblance of case information and HTML document. In addition, the information extraction expands the object range to a hyperlink document. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | HTML Processing / Regional Information / Information Extraction |
Paper # | AI2008-8 |
Date of Issue |
Conference Information | |
Committee | AI |
---|---|
Conference Date | 2008/6/23(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Artificial Intelligence and Knowledge-Based Processing (AI) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | A Method for Extraction Regional Information from HTML Documents |
Sub Title (in English) | |
Keyword(1) | HTML Processing |
Keyword(2) | Regional Information |
Keyword(3) | Information Extraction |
1st Author's Name | Susumu KONNO |
1st Author's Affiliation | Chiba Institute of Technology() |
2nd Author's Name | Shigeru FUJITA |
2nd Author's Affiliation | Chiba Institute of Technology |
3rd Author's Name | Yusuke WATANABE |
3rd Author's Affiliation | Chiba Institute of Technology:NTT Comware Corporation |
Date | 2008-06-30 |
Paper # | AI2008-8 |
Volume (vol) | vol.108 |
Number (no) | 119 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |