Multiply oriented and curved handwritten text line dataset (VML-MOC)

Ground Truth

Multiply oriented and curved handwritten text line dataset

2019-11-25 (v. 1)

Contact author

Irina Rabaev

Sami Shamoon College of Engineering, Beer Sheva, Israel

irinar@ac.sce.ac.il

+972-8-6475620

Keywords

curved and skewed text lines, Arabic historical documents, historical documents

Description

The ground truth is provided in three forms: raw pixel labeling, DIVA pixel labeling [1] and PAGE [2] xml file, and can be found together with the document images on https://www.cs.bgu.ac.il/~berat/data/moc_dataset.zip

References

[1] F. Simistira, M. Bouillon, M. Seuret, M. W¨ursch, M. Alberti, R. Ingold, and M. Liwicki Icdar2017 competition on layout analysis for challenging medieval manuscripts in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1. IEEE, 2017, pp. 1361–1370.

[2] S. Pletschacher and A. Antonacopoulos The page (page analysis and ground-truth elements) format framework in 2010 20th International Conference on Pattern Recognition. IEEE, 2010, pp. 257–260.

Comments

No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!