Multiply oriented and curved handwritten text line dataset (VML-MOC)

Research Tasks

Text line segmentation of multiply oriented and curved handwritten text lines

2019-11-25 (v. 1)

Contact author

Irina Rabaev

Sami Shamoon College of Engineering, Beer Sheva, Israel

irinar@ac.sce.ac.il

+972-8-6475620

Description

To the best of our knowledge, VML-MOC dataset is the first publicly available dataset that introduces the problem of segmenting multiply oriented and curved handwritten text lines.

Experiments made [2]  have shown that ordinary text line segmentation methods are not successful on VML-MOC dataset, and text line segmentation methods without horizontal/straight line assumption has to be developed.

Protocol

The evalustion protocol is ICDAR2017 line segmentation evaluator tool [1].
The tool is freely available as open source.

References

[1] F. Simistira, M. Bouillon, M. Seuret, M. W¨ursch, M. Alberti, R. Ingold, and M. Liwicki Icdar2017 competition on layout analysis for challenging medieval manuscripts in 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR), vol. 1. IEEE, 2017, pp. 1361–1370.

[2] B.Kurar, Rafi Cohen, I. Rabaev, and J. El-Sana VML-MOC: Segmenting a multiply oriented and curved handwritten text lines dataset. In the 3rd International workshop on Arabic and derived Script Analysis and Recognition (ASAR), pp. 13 - 18, 2019. (PDF)

Comments

No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!