Presentation 2003/7/24
An Automated Synthesis System of HTML Wrappers, Which Can Easly Be Used by Anyone
Ken MITSUl, Koji IWANUMA, Hidetomo NABESHIMA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) In this paper, we propose an automated synthesis system of HTML wrappers, which can easily be usedby anyone. The proposed synthesis system fully use .the text information embedded in a HTML documents, andnever demand an expert knowledge on the HTML language from an end user. An intended wrapper is specifiedthrough an information extraction example which can easily be made with the very familiar "cut&paste" operation.We also show, through experiments, that automatically-synthesised HTML wrappers can achieve high accuracy ofextracting informations from real WEB sites.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) WEB / HTML / XML / wrapper / text information / cut&paste / automated synthesis
Paper # AI2003-17
Date of Issue

Conference Information
Committee AI
Conference Date 2003/7/24(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) An Automated Synthesis System of HTML Wrappers, Which Can Easly Be Used by Anyone
Sub Title (in English)
Keyword(1) WEB
Keyword(2) HTML
Keyword(3) XML
Keyword(4) wrapper
Keyword(5) text information
Keyword(6) cut&paste
Keyword(7) automated synthesis
1st Author's Name Ken MITSUl
1st Author's Affiliation Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering()
2nd Author's Name Koji IWANUMA
2nd Author's Affiliation Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering
3rd Author's Name Hidetomo NABESHIMA
3rd Author's Affiliation Yamanashi University, Graduate School, Dept. of Computer Scienc and Media Engineering
Date 2003/7/24
Paper # AI2003-17
Volume (vol) vol.103
Number (no) 243
Page pp.pp.-
#Pages 6
Date of Issue