Synthetic Brazilian Documents Database (SBR-Doc Database)

Research Tasks

Zone Text Segmentation

2021-08-30 (v. 1)

Contact author

Celso A M Lopes Junior

Universidade de Pernambuco

+55 81 992469364

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.


The algorithm should be capable of detecting patterns in the provided dataset; that is, to receive an image (input data) of a document (without a background), and return a picture of the same dimension with non-interest regions in black pixels and regions of interest (text regions) in white pixels.

Task 2

Reference paper:

A Fast Fully Octave Convolutional Neural Network for Document Image Segmentation


Database division:
Statistics from the dataset and experimental partition used in the Train and Test, where C1, C2, and C3 correspond to the 1st, 2nd, and 3rd Tasks, respectively.

To evaluate the methods, the following similarity metrics will be considered:

Dice Similarity Coefficient (DSC);
Scale Invariant Feature Transform (SIFT);
More information in the paper - Access to paper:
Competition on Components Segmentation Task of Document Photos


No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!


In order to rate this dataset you need to be logged on
Register Now!