TANGO-DocLab web tables from international statistical sites (Troy_200)

Research Tasks

table segmentation

2016-03-16 (v. 1)

Contact author

George Nagy



1 518 2710 6885


Table segmentation includes labeling cells as table title, row or column header, Notes, Footnotes, Footnote markers, Footnote references Empty row or columns.

The posted GT suffices only for finding minimal indexing row and column headers and the data region.


The critical cells resulting from automated segmentation can be cmpared to the GT critical cells.  The error rate can be based on either the 800 critical cells of the 200 tables, or on the number of correctly segmented tables. In our IJDAR 2016 article cited we present both.


No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!


In order to rate this dataset you need to be logged on
Register Now!