Extracting Information from the Web

講演名	2001/8/23 Extracting Information from the Web ,
PDFダウンロードページ	PDFダウンロードページへ
抄録(和)
抄録(英)	A wealth of information is available in Internet, how to get the desired information from Web page quickly, precisely and automatically is a job of information extraction(IE) system. RegTab can extract information from tabular Web pages and SemiTxt can extract information from semi-structured HTML text. Both of them use Machine Learning method to generate extraction rules, which is a bottleneck in IE system.
キーワード(和)
キーワード(英)	information extraction / machine learning / semi-structured text
資料番号	KBSE2001-5
発行日

講演論文情報詳細
申込み研究会	Knowledge-Based Software Engineering (KBSE)
本文の言語	ENG
タイトル（和）
サブタイトル（和）
タイトル（英）	Extracting Information from the Web
サブタイトル（和）
キーワード(1)（和/英）	/ information extraction
第 1 著者氏名（和/英）	/ Junqing Zhang
第 1 著者所属（和/英）	School of Computer Science Beijing Polytechnic University
発表年月日	2001/8/23
資料番号	KBSE2001-5
巻番号（vol）	vol.101
号番号（no）	268
ページ範囲	pp.-
ページ数	8
発行日