University of La Rochelle - SmartDoc QA (SmartDoc-QA)

2017-01-11 (v. 1)

Contact author

Nibal Nayef

University of La Rochelle, France

nibal.nayef@univ-lr.fr

03300000000

03300000000

You can cite this dataset as: Nibal Nayef, University of La Rochelle - SmartDoc QA (SmartDoc-QA) ,1,ID:SmartDoc-QA_1,URL:http://tc11.cvc.uab.es/datasets/SmartDoc-QA_1

Dataset Information

Keywords

smartphone, mobile document capture, quality assessment

Description

odern smartphones have a revolutionary impact on the way people digitize the paper documents. The wide ownership of smartphones and their ease of use for digitizing paper documents has resulted into massive amount of imagery data of digitized paper documents. The goal of digitizing the paper documents is not only to archive them for sharing but also, most of the times, to process them by automated document image processing systems. The latter extracts the content of the document images for recognizing it, indexing it, verifying it, comparing it with a database etc. However, it is a known fact that the cameras of the smartphones are optimized for capturing natural scene images. Taking a simple photo of a paper document does not ensure that its content would be exploitable by automated document image processing systems. This could happen because of the light conditions, the resolution of the image, the camera noise, the perspective distortion, the physical distortions (folds etc.) of the paper, the out-of-focus blur and/or the motion blur during capture. To ensure that the content of a captured document image is exploitable by automated systems, it is important to automatically assess the quality of a captured document image in real-time. Otherwise most of the times it is not possible to re-capture the document image later on, because the original document is not available anymore. Assessing the quality of a captured document image is also required in situations where the captured document images are to-be transmitted for further processing.

 

The quality assessment step is an important part of both the acquisition and the digitization processes. Assessing document quality could aid users during the capture process or help improve image enhancement methods after a document has been captured. Current state-of-the-art works lack databases in the field of document image quality assessment. 

 

In order to provide a baseline benchmark for quality assessment methods for mobile captured documents, we present a database for quality assessment that contains both single- and multiply-distorted document images.

 

capture system

 

Sample 1

Sample 2

Magnified view of blurry document

 

Comments

No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!

University of La Rochelle - SmartDoc QA