Presentation 2001/7/12
Extraction of Structured Partial Documents Based on IR Technique
KENJI HATANO, HIROKO KINUTANI, MASATOSHI YOSHIKAWA, SHUNSUKE UEMURA,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Until now, a lot of researches were appearard concerned with extraction of structured partial documents. These researches can be classified into two categories - database-based approach using query language and IR-based approach. However, some text nodes, leaf nodes of structured partial documents searched by information retrieval systems, are not suitable for user's query in many cases. In this paper, we proposeed an approach of removing such nodes from the structureed partial documents searched by the IR systems, and checked the validity of our proposed method. We also proposed an evaluation method for our system of structured partial documents, and got some useful knowlege for its establishment. If our proposed method is established, people can retrieve structured partial documents relevant to user's query effectively from XHTML documents which are emerging as the standard format representing documents on the Internet.
Keyword(in Japanese) (See Japanese page)
Keyword(in English)
Paper # DE2001-96
Date of Issue

Conference Information
Committee DE
Conference Date 2001/7/12(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Data Engineering (DE)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Extraction of Structured Partial Documents Based on IR Technique
Sub Title (in English)
Keyword(1)
1st Author's Name KENJI HATANO
1st Author's Affiliation Graduate School of Information Science, Nara Institute of Science and Technology(NAIST)()
2nd Author's Name HIROKO KINUTANI
2nd Author's Affiliation Graduate School of Information Science, Nara Institute of Science and Technology(NAIST)
3rd Author's Name MASATOSHI YOSHIKAWA
3rd Author's Affiliation Graduate School of Information Science, Nara Institute of Science and Technology(NAIST):Software Research Division, National Institute of Informatics(NII)
4th Author's Name SHUNSUKE UEMURA
4th Author's Affiliation Graduate School of Information Science, Nara Institute of Science and Technology(NAIST)
Date 2001/7/12
Paper # DE2001-96
Volume (vol) vol.101
Number (no) 193
Page pp.pp.-
#Pages 8
Date of Issue