Presentation | 2001/7/12 Extraction of Structured Partial Documents Based on IR Technique KENJI HATANO, HIROKO KINUTANI, MASATOSHI YOSHIKAWA, SHUNSUKE UEMURA, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | Until now, a lot of researches were appearard concerned with extraction of structured partial documents. These researches can be classified into two categories - database-based approach using query language and IR-based approach. However, some text nodes, leaf nodes of structured partial documents searched by information retrieval systems, are not suitable for user's query in many cases. In this paper, we proposeed an approach of removing such nodes from the structureed partial documents searched by the IR systems, and checked the validity of our proposed method. We also proposed an evaluation method for our system of structured partial documents, and got some useful knowlege for its establishment. If our proposed method is established, people can retrieve structured partial documents relevant to user's query effectively from XHTML documents which are emerging as the standard format representing documents on the Internet. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | |
Paper # | DE2001-96 |
Date of Issue |
Conference Information | |
Committee | DE |
---|---|
Conference Date | 2001/7/12(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Data Engineering (DE) |
---|---|
Language | JPN |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Extraction of Structured Partial Documents Based on IR Technique |
Sub Title (in English) | |
Keyword(1) | |
1st Author's Name | KENJI HATANO |
1st Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology(NAIST)() |
2nd Author's Name | HIROKO KINUTANI |
2nd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology(NAIST) |
3rd Author's Name | MASATOSHI YOSHIKAWA |
3rd Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology(NAIST):Software Research Division, National Institute of Informatics(NII) |
4th Author's Name | SHUNSUKE UEMURA |
4th Author's Affiliation | Graduate School of Information Science, Nara Institute of Science and Technology(NAIST) |
Date | 2001/7/12 |
Paper # | DE2001-96 |
Volume (vol) | vol.101 |
Number (no) | 193 |
Page | pp.pp.- |
#Pages | 8 |
Date of Issue |