Presentation 2000/1/13
Automatic Collection of People's Information from the World Wide Web
Ayumi YAMAMOTO, Satoshi SATO,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) This paper proposes two methods for collecting people's information from the World Wide Web. From the given occupation category such as Seijika(politicians), the first method collects web pages that include tables whose content is people lists of the given occupation, and extract personal properties such as name and birthday for each person by using table analysis. The second method accepts a person name and her occupation as an input, and collects her profile in text form by using layout analysis of HTML texts.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) World Wide Web / automatic extraction of people's information / table analysis / information extraction / search engine
Paper # AI99-89
Date of Issue

Conference Information
Committee AI
Conference Date 2000/1/13(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Artificial Intelligence and Knowledge-Based Processing (AI)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Automatic Collection of People's Information from the World Wide Web
Sub Title (in English)
Keyword(1) World Wide Web
Keyword(2) automatic extraction of people's information
Keyword(3) table analysis
Keyword(4) information extraction
Keyword(5) search engine
1st Author's Name Ayumi YAMAMOTO
1st Author's Affiliation School of Information Science, Japan Advanced Institute of Science and Technology()
2nd Author's Name Satoshi SATO
2nd Author's Affiliation School of Information Science, Japan Advanced Institute of Science and Technology
Date 2000/1/13
Paper # AI99-89
Volume (vol) vol.99
Number (no) 534
Page pp.pp.-
#Pages 8
Date of Issue