Handwritten Text Recognition on tranScriptorium Datasets: Bentham R0 (HTR Competition 2014)

Research Tasks

Handwritten Text Recogntion on Historical Documents

2017-01-20 (v. 1)

Contact author

Joan Andreu Sánchez

Pattern Recognition and Human Language Technologies - Universitat Politècnica de València

jandreu@prhlt.upv.es

(+34) 96 387 7358

(+34) 96 387 7359


This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.

Description

A contest on Handwritten Text Recognition organised in the context of the ICFHR 2014 conference was proposed. Two tracks with increased freedom on the use of training data were proposed. The handwritten images for this contest were drawn from an English data set which was considered in the tranScriptorium project.

The so-called ``Bentham collection'' was considered in tranScriptorium.  It encompassed a large set of manuscripts written by the renowned English philosopher and reformer Jeremy Bentham (1748-1832).  A small subset of this collection has been chosen for this HTR competition. The selected subset has been written by several hands (Bentham himself and his secretaries) and entails significant varibilities and difficulties regarding the quality of text images and writting styles. Training and test data were provided in the form of carefully segmented line images, along with the corresponding transcripts.

Cite this dataset as:

"ICFHR2014 competition on handwritten text recognition on transcriptorium datasets (HTRtS)", Joan Andreu Sánchez, Verónica Romero, Alejandro H Toselli, Enrique Vidal.  International Conference on Frontiers in Handwriting Recognition (ICFHR), 2014, pp. 785-790

 

Protocol

 Word Error Rate and Character Error Rate

Comments

No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!