A Swedish Historical Handwritten Digit Dataset (ARDIS)
Handwritten digit recognition, ARDIS dataset, Machine learning methods, Benchmark
This is a new image-based handwritten historical digit dataset named ARDIS (Arkiv Digital Sweden). The images in ARDIS dataset are extracted from 15.000 Swedish church records which were written by different priests with various handwriting styles in the nineteenth and twentieth centuries. The constructed dataset consists of three single digit datasets and one digit strings dataset. The digit strings dataset includes 10.000 samples in Red-Green-Blue (RGB) color space, whereas, the other datasets contain 7.600 single digit images in different color spaces. Figure 1 illustrates handwritten digit images from different datasets in ARDIS.
If you use any of these data sets, please cite that as: H. Kusetogullari, A. Yavariabdi, A. Cheddad, H. Grahn and J. Hall, "ARDIS: A Swedish Historical Handwritten Digit Dataset," Neural Computing and Applications, 2019, Springer. DOI: 10.1007/s00521-019-04163-3
URL for data sets download: