ICDAR 2013 - Gender Identification Competition Dataset (GenderIdentifify2013)

Ground Truth

Genders of all writers

2015-01-25 (v. 1)

Contact author

Abdelaali Hassaine

Qatar University




Gender recognition


The genders of the writers of the training set are in train_answers.csv and those of the writers of the test set are in the test_answers.csv

Important Note: This competition has been hosted in kaggle. This website allows participants to see how well
each of their submissions performs on a part of the test set called the public test set (37% for this competition).
However, only the remaining part of the test set (called the private test set) counts towards the final standing. Therefore,
the ranking on the public test set is not necessarily similar to that of the private test set. This is mainly due to participants overfitting to the public test set. In the test_answers.csv there is a column called Usage, this column indicates for each row whether it is used in the public or private leaderboard. Consequently, in order to have results which are comparable with those of the competition, the public test set must be used as a validation set and the private test set as a final test set.

train_answers.csvdata(2 KB)17Genders of writers of the training set
test_answers.csvdata(4 KB)15Genders of writers of the test set


No comments on this dataset yet.

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!


In order to rate this dataset you need to be logged on
Register Now!