ImageCLEF 2016 Handwritten Scanned Document Retrieval Task (IMAGECLEF16-HSDR)
Dataset Information
Dataset URL
http://dx.doi.org/10.5281/zenodo.52994
Keywords
Text retrieval, Handwritten documents, Broken words
Description
Download link: http://doi.org/10.5281/zenodo.52994
Overview
Dataset compiled for the ImageCLEF 2016 Handwritten Scanned Document Retrieval challenge. It is derived from a subset of pages from unpublished manuscripts written by the philosopher and reformer Jeremy Bentham, that have been digitised and transcribed under the Transcribe Bentham project [Causer 2012]. More details about the dataset and the challenge are found in the overview paper at http://ceur-ws.org/Vol-1609/16090233.pdf the slides of the overview presentation at http://imageclef.org/system/files/Villegas16_CLEF_Handwritten-Overview_presentation.pdf or the evaluation web page http://imageclef.org/2016/handwritten.
Cite dataset as
Villegas, Mauricio, Puigcerver, Joan, & Toselli, Alejandro H. (2016). ImageCLEF 2016 Bentham Handwritten Retrieval Dataset [Data set]. Zenodo. http://doi.org/10.5281/zenodo.52994
References
[Causer 2012] T. Causer and V. Wallace, Building a Volunteer Community: Results and Findings from Transcribe Bentham, Digital Humanities Quarterly, Vol. 6 (2012), http://www.digitalhumanities.org/dhq/vol/6/2/000125/000125.html
Examples
IMAGECLEF16-HSDR
- ImageCLEF 2016 Handwritten Scanned Document Retrieval Task v.1
- Ground Truth
- Research Tasks