Presentation | 2003/7/24 An Automated Synthesis System of HTML Wrappers, Which Can Easly Be Used by Anyone Ken MITSUl, Koji IWANUMA, Hidetomo NABESHIMA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | In this paper, we propose an automated synthesis system of HTML wrappers, which can easily be usedby anyone. The proposed synthesis system fully use .the text information embedded in a HTML documents, andnever demand an expert knowledge on the HTML language from an end user. An intended wrapper is specifiedthrough an information extraction example which can easily be made with the very familiar "cut&paste" operation.We also show, through experiments, that automatically-synthesised HTML wrappers can achieve high accuracy ofextracting informations from real WEB sites. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | WEB / HTML / XML / wrapper / text information / cut&paste / automated synthesis |
Paper # | AI2003-17 |
Date of Issue |
Conference Information | |
Committee | AI |
---|---|
Conference Date | 2003/7/24(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Artificial Intelligence and Knowledge-Based Processing (AI) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | An Automated Synthesis System of HTML Wrappers, Which Can Easly Be Used by Anyone |
Sub Title (in English) | |
Keyword(1) | WEB |
Keyword(2) | HTML |
Keyword(3) | XML |
Keyword(4) | wrapper |
Keyword(5) | text information |
Keyword(6) | cut&paste |
Keyword(7) | automated synthesis |
1st Author's Name | Ken MITSUl |
1st Author's Affiliation | Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering() |
2nd Author's Name | Koji IWANUMA |
2nd Author's Affiliation | Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering |
3rd Author's Name | Hidetomo NABESHIMA |
3rd Author's Affiliation | Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering |
Date | 2003/7/24 |
Paper # | AI2003-17 |
Volume (vol) | vol.103 |
Number (no) | 243 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |