A Dataset for Arabic Text Detection, Tracking and Recognition in News Videos - AcTiV (AcTiV)

2016-03-16 (v. 1)

Contact author

Oussama Zayene

SAGE Lab, University of Sousse, Tunisia & DIVA Group, University of Fribourg, Switzerland

oussama.zayene@unifr.ch

0041775093187


This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License.
You can cite this dataset as: Oussama Zayene, A Dataset for Arabic Text Detection, Tracking and Recognition in News Videos - AcTiV (AcTiV) ,1,ID:AcTiV_1,URL:http://tc11.cvc.uab.es/datasets/AcTiV_1

Dataset Information

Keywords

Artificial Arabic Text Detection; Artificial Arabic Text Tracking; Arabic Video-OCR; News Video, Video Indexing, Content-based Research

Description

AcTiV is the first publicly accessible annotated dataset designed to assess the performance of different Arabic VIDEO-OCR systems. The database has been named AcTiV for Arabic Text in Video. The challenges that are addressed by AcTiV-database are in text patterns variability and presence of complex background with various objects resembling text characters. AcTiV enables users to test their systems’ abilities to locate, track and read text objects in videos. The actual version of the dataset includes 80 videos collected from 4 different Arabic news channels. In the present work, two types of video stream were chosen: Standard-Definition (720x576, 25 fps) and High-Definition (1920x1080, 25fps).We mainly focus on text displayed as overlay in news video, which can be classified into two types: static text and dynamic one.

http://www.sage-eniso.org/content/fr/20/activ-data-base.html

Two sub-datasets are created from the AcTiV database: Activ-D (D for detection) and  Activ-R (R for recognition). AcTiV-D represents a sub-dataset of nonredundant frames used to measure the performance of single-frame based methods to detect/localize text regions in still HD/SD images. AcTiV-R is a sub-dataset of cropped images used to measure the performance of Arabic OCR systems to read texts in video frames.

Typical video frames from the proposed dataset. Top Sub-figures: examples of Russia Today and ElWataniya1 frames. Bottom Sub-figures: examples of Aljazeera HD and France 24 frames.

 

Comments

pengyong 05-26-2017 07:21
Hello Oussama Zayene! Can you tell me how to download your dataset?

Add your comment

In order to comment on a dataset you need to be logged on
Register Now!

Valoration

In order to rate this dataset you need to be logged on
Register Now!