TANGO-DocLab web tables from international statistical sites (Troy_200)
Table segmentation includes labeling cells as table title, row or column header, Notes, Footnotes, Footnote markers, Footnote references Empty row or columns.
The posted GT suffices only for finding minimal indexing row and column headers and the data region.
The critical cells resulting from automated segmentation can be cmpared to the GT critical cells. The error rate can be based on either the 800 critical cells of the 200 tables, or on the number of correctly segmented tables. In our IJDAR 2016 article cited we present both.
No comments on this dataset yet.