Datasets per Publication

avg.: 4.4 by 12 users
Task: Text Detection in Natural Images13-01-2014 (v. 1)
TradeMarks Image Database24-06-2014 (v. 1)

avg.: 4.2 by 20 users
Ground Truth: Image annotation13-01-2014 (v. 1)
Table Ground Truth for the UW3 and UNLV datasets24-06-2014 (v. 1)
Ground Truth: Table structure and OCR GT dataset for UW3 and UNLV datasets24-06-2014 (v. 1)

Ground Truth: Ground truth word and character bounding boxes for IIIT-5K word18-02-2015 (v. 1)
Task: Open and closed vocabulary scene text recognition, scene character recognition 18-02-2015 (v. 1)
Signature Verification and Writer Identification Competitions for On- and Offline Skilled Forgeries 24-02-2015 (v. 1)
Task: Task SigDutch (Dutch Signatures: Off-line)24-02-2015 (v. 1)
Task: Task SigJapanese (Japanese Signatures: Off-line)24-02-2015 (v. 1)
Task: Task SigJapanese (Japanese Signatures: On-line)24-02-2015 (v. 1)
Task: Task Wi (Writer Identification & Retrieval)24-02-2015 (v. 1)
Persian Heritage Image Binarization Dataset (PHIBD 2012)18-07-2017 (v. 1)
Ground Truth: Binarized images for PHIBD 2012 dataset03-08-2018 (v. 1)
Task: Binarization of PHIBD 2012 dataset03-08-2018 (v. 1)

Ground Truth: Ground truth tables are in the zip files containing the data set.06-01-2020 (v. 1)
Task: Handwritten digit and date string recognition06-01-2020 (v. 1)

Ground Truth: Transcription for the LSVT dataset29-05-2019 (v. 1)
Task: Text detection29-05-2019 (v. 1)
Task: End-to-end text spotting29-05-2019 (v. 1)

Ground Truth: Transcription for the ArT dataset29-05-2019 (v. 1)
Task: Scene Text Detection29-05-2019 (v. 1)
Task: Scene Text Recognition29-05-2019 (v. 1)
Task: Scene Text Spotting29-05-2019 (v. 1)
ImageCLEF 2016 Handwritten Scanned Document Retrieval Task25-06-2019 (v. 1)
Malayalam Character Image Database21-07-2019 (v. 1)
Ground Truth: Labels for the Character Images21-07-2019 (v. 1)
Task: Character Recognition for Malayalam Document Images21-07-2019 (v. 1)
Tobacco 800 Dataset13-09-2019 (v. 1)

Ground Truth: Multiply oriented and curved handwritten text line dataset25-11-2019 (v. 1)
Task: Text line segmentation of multiply oriented and curved handwritten text lines25-11-2019 (v. 1)
UHaT Dataset: Urdu Handwritten Text Dataset17-02-2020 (v. 1)

Task: Gender Classification from Offline Handwritten Images30-04-2021 (v. 1)


Ground Truth: Ground Truth for Chart Recognition09-01-2021 (v. 1)
Task: Chart Image Classification09-01-2021 (v. 1)
Task: Text Detection and Recognition09-01-2021 (v. 1)
Task: Text Role Classification09-01-2021 (v. 1)
Task: Axis Analysis09-01-2021 (v. 1)
Task: Legend Analysis09-01-2021 (v. 1)
Task: Data Extraction09-01-2021 (v. 1)
Task: End-to-End Data Extraction09-01-2021 (v. 1)

Ground Truth: XML file for LSVTD dataset01-06-2021 (v. 1)
Task: Video Text Detection01-06-2021 (v. 1)
Task: Video Text Tracking01-06-2021 (v. 1)
Task: End-to-end Video Text Spotting01-06-2021 (v. 1)

Ground Truth: Semi-annotated Birth Record14-09-2021 (v. 1)
ASAR 2017 - 1st International Workshop on Arabic Script Analysis and Recognition
VML-HD: The Historical Arabic Documents Dataset for Recognition Systems12-06-2018 (v. 1)
Ground Truth: Subword level annotation for the VML-HD Dataset12-06-2018 (v. 1)
Task: Segmentation Free Recognition Track12-06-2018 (v. 1)
Task: Segmentation Based Recognition Track12-06-2018 (v. 1)
DRR 2011 - Document Recognition and Retrieval XVIII

Ground Truth: Flowchart recognition20-06-2018 (v. 1)
ICDAR 2009 - 10th International Conference on Document Analysis and Recognition, Barcelona, Spain

Ground Truth: Ground Truth Information for the ICDAR 2009 Signature Verification competition (SigComp2009)06-03-2015 (v. 1)
Task: Signature Verification23-02-2015 (v. 1)
ICDAR 2013 - 12th International Conference on Document Analysis and Recognition, Washington, DC, USA

Ground Truth: Genders of all writers25-01-2015 (v. 1)
Task: Gender identification using all documents25-01-2015 (v. 1)

Ground Truth: Critical cells for table header segmentation16-03-2016 (v. 1)
Task: table segmentation16-03-2016 (v. 1)
![]() | ICDAR 2015 - 13th IAPR International Conference on Document Analysis and Recognition, Nancy, France |

Ground Truth: Global xml file16-03-2016 (v. 1)
Ground Truth: Detection Ground-truth files16-03-2016 (v. 1)
Task: Text Detection in Arabic NewsVideo Frames16-03-2016 (v. 1)
Task: Text Tracking in Arabic NewsVideo16-03-2016 (v. 1)
Task: Text Recognition in Arabic NewsVideo Frames16-03-2016 (v. 1)

Ground Truth: GT for the HTR Competition 201520-01-2017 (v. 1)
Task: Handwritten Text Recognition20-01-2017 (v. 1)
ICDAR2015 Competition on Signature Verification and Writer Identification for On- and Off-line Skilled Forgeries04-12-2017 (v. 1)
Synchromedia Multispectral Ancient Document Images Dataset05-08-2018 (v. 1)
Ground Truth: Ground Truth images 15-08-2018 (v. 1)
Task: ICDAR 2015 MultiSpectral Text Extraction Contest (MS-TEx 2015)15-08-2018 (v. 1)
ICDAR 2017 - 14th IAPR International Conference on Document Analysis and Recognition, Kyoto, Japan
Total-Text21-02-2018 (v. 1)
Ground Truth: Ground Truth for Total-Text dataset30-01-2019 (v. 1)
Task: Scene Text Detection in Natural Scene Images23-07-2018 (v. 1)
Task: Scene Text Recognition in Natural Scene Images26-07-2018 (v. 1)
Task: Text Segmentation in Natural Scene Images26-07-2018 (v. 1)

Ground Truth: Ground Truth for DIB Platform05-12-2017 (v. 1)
ICDAR2017 Competition on Historical Document Writer Identification (Historical-WI)02-08-2018 (v. 1)
Ground Truth: ID of the Writer02-08-2018 (v. 1)
Task: Writer identification02-08-2018 (v. 1)

Ground Truth: Transcription for the competition on Post-OCR Text Correction 201728-05-2019 (v. 1)
![]() | ICDAR 2019 - 15th International Conference on Document Analysis and Recognition, Sidney, Autralia |

Ground Truth: Chart Elements Annotations for ICDAR CHART 201918-06-2019 (v. 1)
Task: Chart Image Classification18-06-2019 (v. 1)
Task: Text Detection and Recognition18-06-2019 (v. 1)
Task: Text Role Classification18-06-2019 (v. 1)
Task: Axis Analysis18-06-2019 (v. 1)
Task: Legend Analysis18-06-2019 (v. 1)
ICDAR 2019 Historical Document Reading Challenge on Large Structured Chinese Family Records29-08-2019 (v. 1)
Ground Truth: Handwritten Character Recognition on extracted textlines29-08-2019 (v. 1)
Ground Truth: Layout Analysis on structured historical document images29-08-2019 (v. 1)
Ground Truth: Complete, integrated textline detection and recognition on a large dataset29-08-2019 (v. 1)
Task: Handwritten Character Recognition on extracted textlines30-12-2019 (v. 2)
Task: Layout Analysis on structured historical document images30-12-2019 (v. 2)
Task: Complete, integrated textline detection and recognition on a large dataset30-12-2019 (v. 2)

Ground Truth: Transcription for the competition on Post-OCR Text Correction 201920-10-2019 (v. 1)
Task: Detection of OCR errors20-10-2019 (v. 1)
Task: Correction of OCR errors20-10-2019 (v. 1)

Ground Truth: The ground truth is provided in PAGE format10-12-2019 (v. 1)
Task: Word Spotting10-12-2019 (v. 1)
ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents Dataset06-01-2020 (v. 1)
Ground Truth: Writer associations06-01-2020 (v. 1)
Task: Image Retrieval in Historical Documents06-01-2020 (v. 1)

Ground Truth: Math expression for online and off handwriting29-01-2020 (v. 1)
Ground Truth: Typeset Formula Detection29-01-2020 (v. 1)
Task: Online Handwritten Formula Recognition29-01-2020 (v. 1)
Task: Offline Handwritten Formula Recognition29-01-2020 (v. 1)
Task: Detection of Formulas in Document Pages29-01-2020 (v. 1)

Task: Handwritten Mathematical Expression Clustering30-07-2020 (v. 1)
![]() | ICDAR 2021 – 16th International Conference on Document Analysis and Recognition |

Ground Truth: Ground Truth of the images for SBR-Doc Database22-08-2021 (v. 1)
Task: Document Boundary Segmentation22-08-2021 (v. 1)
Task: Zone Text Segmentation30-08-2021 (v. 1)
Task: Signature Segmentation31-08-2021 (v. 1)

Ground Truth: Labels for Handwritten Chess Moves04-07-2021 (v. 1)
Task: Latin Handwriting Recognition in Chess Scoresheets04-07-2021 (v. 1)

Task: Natural Scenes Text Recognition under Occlusion04-09-2021 (v. 1)
ICFHR 2010 - 12th International Conference on Frontiers in Handwriting Recognition, Kolkata, India

Ground Truth: Writer ID Information for the 4NSigComp2010 dataset06-03-2015 (v. 1)
Task: Signature Verification23-02-2015 (v. 1)
![]() | ICFHR 2014 - 14th International Conference on Frontiers in Handwriting Recognition, Crete Island, Greece |

Ground Truth: CROHME16-02-2015 (v. 1)
Task: Mathematical Expression Recognition16-02-2015 (v. 1)
Task: Isolated Mathematical Symbol Recognition16-02-2015 (v. 1)
Task: Matrix Recognition16-02-2015 (v. 1)

Ground Truth: Ground truth for the HTR Competition 201420-01-2017 (v. 1)
Task: Handwritten Text Recogntion on Historical Documents20-01-2017 (v. 1)
ICFHR 2016 - 15th International Conference on Frontiers in Handwriting Recognition, China

Ground Truth: HANDS-VNOnDB25-11-2016 (v. 1)
Task: Writer Independent Handwritten Text Recognition25-11-2016 (v. 1)

Ground Truth: Ground Truth for HTR Competition 201620-01-2017 (v. 1)
Task: Handwritten Text Recognition of Historical Documents20-01-2017 (v. 1)

Ground Truth: CROHME18-07-2017 (v. 1)
Task: CROHME2016-Formulas18-07-2017 (v. 1)
Task: CROHME2016-Symbols18-07-2017 (v. 1)
Task: CROHME2016-Structure18-07-2017 (v. 1)
Task: CROHME2016-Matrices18-07-2017 (v. 1)
![]() | ICFHR 2018 |
Thai Student Signature and Name Components Datasets (TSSNCDs)23-01-2018 (v. 1)
Ground Truth: ICFHR2018_TSSNCDs_ground truth12-12-2019 (v. 1)
ICFHR 2018 - 16th International Conference on Frontiers in Handwriting Recognition, Niagara Falls, USA

Ground Truth: PAGE-XML and PNG files for the AnnotationDB dataset31-05-2018 (v. 1)
Task: Handwritten Annotation Detection and Segmentation31-05-2018 (v. 1)

Ground Truth: Groundtruth28-02-2018 (v. 1)
Task: ICFHR2018 Competition on Vietnamese Online Handwriting Recognition28-02-2018 (v. 1)