Introduction
To verify results and create novel research, it is extremely important for the Document Image Analysis and Recognition (DIAR) community to be able to cross check and reproduce results described in published papers in the field. In order to achieve this, any datasets used as the basis for publications should be publicly available, as is the norm in many other disciplines.
Authors are actively encouraged to submit the datasets they used to train and/or evaluate their algorithms to their TC(s) in order for them to be published on the corresponding Web sites.
This initiative is not restricted to datasets. We are interested in archiving online any piece of data (ground-truth data, software, etc.) which would allow to easily reproduce results, set new targets, foster healthy competition, encourage collaboration and generally advance the DIAR field as a whole.
Datasets
Latest
Per topic
Complex Text Containers (1)Complex Text Containers: Scene Text (7)Machine-printed Documents (2)Mixed Content Documents (2)Handwritten Documents (2)Handwritten Documents : On-line and Off-line (2)Handwritten Documents : On-line (6)Handwritten Documents : Off-line (22)Graphical Documents : Sketched Documents (1)Tables: Tables (1)Electronic Documents: Tables (1)Complex Text Containers: Overlaid text on images (2)Charts (3)Document Retrieval (1)Graphical Documents : Maps (1)Graphical Documents : Scientific papers (1)Graphical Documents : Identity Documents (1)Historical Documents: Registers (1)Graphical Documents : Identity Documents (1)