Presentation 1998/1/23
Automatic HTML Document Producing from Tabular Form Document Using Node Classification
Toru Tanaka, Shinji Tsuruoka, Xinkai Chen, Muneaki Ishida,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) Dealing with lines of a table is something different from dealing with word data. And they are obstacles very often for OCR. When we handle these tables with OCR, we usually have to give it information such as the location of the table or the form of the table as a preprocessing by a mouse. In this paper, we propose a new method of understanding of tabular form document. In the process of the node classification, we can obtain the information of the location. And the node classification can give us an output file in HTML.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Line Extraction / Node Classification / HTML / Document Analysis
Paper # PRMU97-213
Date of Issue

Conference Information
Committee PRMU
Conference Date 1998/1/23(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Pattern Recognition and Media Understanding (PRMU)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) Automatic HTML Document Producing from Tabular Form Document Using Node Classification
Sub Title (in English)
Keyword(1) Line Extraction
Keyword(2) Node Classification
Keyword(3) HTML
Keyword(4) Document Analysis
1st Author's Name Toru Tanaka
1st Author's Affiliation Department of Electrical and Electronic Engineering, Faculty of Engineering, Mie University()
2nd Author's Name Shinji Tsuruoka
2nd Author's Affiliation Department of Electrical and Electronic Engineering, Faculty of Engineering, Mie University
3rd Author's Name Xinkai Chen
3rd Author's Affiliation Department of Electrical and Electronic Engineering, Faculty of Engineering, Mie University
4th Author's Name Muneaki Ishida
4th Author's Affiliation Department of Electrical and Electronic Engineering, Faculty of Engineering, Mie University
Date 1998/1/23
Paper # PRMU97-213
Volume (vol) vol.97
Number (no) 501
Page pp.pp.-
#Pages 8
Date of Issue