Presentation | 2013-03-15 Automatic Word Ground Truth Generation for Camera Captured Documents Sheraz AHMED, Koichi KISE, Masakazu IWAMURA, Marcus LIWICKI, Andreas DENGEL, |
---|---|
PDF Download Page | PDF download Page Link |
Abstract(in Japanese) | (See Japanese page) |
Abstract(in English) | A database for camera captured documents is useful to train OCRs to obtain better performance. However, no dataset exists for camera captured documents because it is very laborious and costly to build these datasets manually. In this paper, a fully automatic approach allowing building the very large scale (i. e., millions of images) labeled camera captured documents dataset is proposed. The proposed approach does not require any human intervention in labeling. Evaluation of samples generated by the proposed approach shows that more than 97% of the images are correctly labeled. Novelty of the proposed approach lies in the use of document image retrieval for automatic labeling, especially for camera captured documents, which contain different distortions specific to camera, e.g., blur, perspective distortion, etc. |
Keyword(in Japanese) | (See Japanese page) |
Keyword(in English) | Ground truth / Locally Likely Arrangement Hashing (LLAH) / Camera Captured Documents / Perspective distortion / Blur |
Paper # | PRMU2012-204 |
Date of Issue |
Conference Information | |
Committee | PRMU |
---|---|
Conference Date | 2013/3/7(1days) |
Place (in Japanese) | (See Japanese page) |
Place (in English) | |
Topics (in Japanese) | (See Japanese page) |
Topics (in English) | |
Chair | |
Vice Chair | |
Secretary | |
Assistant |
Paper Information | |
Registration To | Pattern Recognition and Media Understanding (PRMU) |
---|---|
Language | ENG |
Title (in Japanese) | (See Japanese page) |
Sub Title (in Japanese) | (See Japanese page) |
Title (in English) | Automatic Word Ground Truth Generation for Camera Captured Documents |
Sub Title (in English) | |
Keyword(1) | Ground truth |
Keyword(2) | Locally Likely Arrangement Hashing (LLAH) |
Keyword(3) | Camera Captured Documents |
Keyword(4) | Perspective distortion |
Keyword(5) | Blur |
1st Author's Name | Sheraz AHMED |
1st Author's Affiliation | German Research Center for Artificial Intelligence (DFKI)() |
2nd Author's Name | Koichi KISE |
2nd Author's Affiliation | Osaka Prefecture University |
3rd Author's Name | Masakazu IWAMURA |
3rd Author's Affiliation | Osaka Prefecture University |
4th Author's Name | Marcus LIWICKI |
4th Author's Affiliation | University of Fribourg |
5th Author's Name | Andreas DENGEL |
5th Author's Affiliation | German Research Center for Artificial Intelligence (DFKI) |
Date | 2013-03-15 |
Paper # | PRMU2012-204 |
Volume (vol) | vol.112 |
Number (no) | 495 |
Page | pp.pp.- |
#Pages | 6 |
Date of Issue |