Handwritten Text Recognition on tranScriptorium Datasets: Bentham R0 (HTR Competition 2014)

2017-01-19 (v. 1)

Contact author

Joan Andreu Sánchez

Pattern Recognition and Human Language Technologies - Universitat Politècnica de València

jandreu@prhlt.upv.es

(+34) 96 387 7358

(+34) 96 387 7359


This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
You can cite this dataset as: Joan Andreu Sánchez, Handwritten Text Recognition on tranScriptorium Datasets: Bentham R0 (HTR Competition 2014) ,1,ID:HTR Competition 2014_1,URL:http://tc11.cvc.uab.es/datasets/HTR Competition 2014_1

Dataset Information

Keywords

Historical Handwritten Text Recognition

Description

The Bentham collection consists of a set of images of a collection of works on law and moral philosophy written by the philosopher Jeremy Bentham.

The selected subset has been written by several hands (Bentham himself and his secretaries) and entails significant varibilities and difficulties regarding the quality of text images and writting styles. Training and test data were provided in the form of carefully segmented line images, along with the corresponding transcripts.

This dataset is free available for research purposes and it is provided into two parts: the images and the GT. The GT includes information about the  layout and the transcription at line level of each image in PAGE format. 

 

Technical Details

The dataset includes a README about the amount the data, training and test.

 

FileTypeSizeDownloadsDescription
PID3223135.pdfarticle(359 KB)10

Comments

No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!