Arbitrary-Shaped Text (ICDAR-2019 ArT)

2019-05-29 (v. 1)

Contact author

Yipeng Sun

Baidu Inc, Beijing, China


+86 10 56082834

You can cite this dataset as: Yipeng Sun, Arbitrary-Shaped Text (ICDAR-2019 ArT) ,1,ID:ICDAR-2019 ArT_1,URL: ArT_1

Dataset Information

Dataset URL


Arbitrary Shaped Text, Chinese, English


Update: alternative download is available through the RRC Platform (registration required):


ArT is a combination of Total-Text, SCUT-CTW1500 and Baidu Curved Scene Text, which were collected with the motive of introducing the arbitrary-shaped text problem to the scene text community. On top of the existing images (3055), more than 7111 images are added to mixture of both datasets, which make ArT one of the larger scale scene text datasets today. There is a total of 10,166 images in the ArT dataset. It is split into a training set with 5603 images, and a testing set of 4563 newly collected images. The ArT dataset was collected with text shape diversity in mind, hence all existing text shapes (i.e. horizontal, multi-oriented, and curved) have high number of existence in the dataset.

Alexander Illarionov 07-31-2019 14:35
The baidu link ( is broken. Could you please fix it?
In order to rate this dataset you need to be logged onLogin / Register