Presentation 1994/10/20
A Method for Extracting Logical Structure from a Document Image
Yuka Tateishi, Nobuyasu Itoh,
PDF Download Page PDF download Page Link
Abstract(in Japanese) (See Japanese page)
Abstract(in English) A method of stochastic syntactic analysis is applied to extracting the logical structure of a printed document from their physical layout and keywords indicating logical components.The document is parsed as a sentence consisting of text lines and graphic objects according to a stochastic regular grammar with attributes.By using stochastic analysis,the parser can retain possible results in order of their probability,so that it selects an optimal result more appropriately than deterministic systems if ambiguity occurs.
Keyword(in Japanese) (See Japanese page)
Keyword(in English) Logical Structure Extraction / Stochastic Grammar
Paper # NLC94-17,PRU94-42
Date of Issue

Conference Information
Committee NLC
Conference Date 1994/10/20(1days)
Place (in Japanese) (See Japanese page)
Place (in English)
Topics (in Japanese) (See Japanese page)
Topics (in English)
Chair
Vice Chair
Secretary
Assistant

Paper Information
Registration To Natural Language Understanding and Models of Communication (NLC)
Language JPN
Title (in Japanese) (See Japanese page)
Sub Title (in Japanese) (See Japanese page)
Title (in English) A Method for Extracting Logical Structure from a Document Image
Sub Title (in English)
Keyword(1) Logical Structure Extraction
Keyword(2) Stochastic Grammar
1st Author's Name Yuka Tateishi
1st Author's Affiliation IBM Japan Ltd()
2nd Author's Name Nobuyasu Itoh
2nd Author's Affiliation IBM Japan Ltd
Date 1994/10/20
Paper # NLC94-17,PRU94-42
Volume (vol) vol.94
Number (no) 291
Page pp.pp.-
#Pages 8
Date of Issue