Annotated Bangla Handwritten Word Images for Character Detection (HandwrittenWordsB)

2024-09-04 (v. 1)

Contact author

Sumaiya Salekin

Independent Researcher

sumaiyasalekin321@gmail.com

+8801751411712

You can cite this dataset as: Sumaiya Salekin, Annotated Bangla Handwritten Word Images for Character Detection (HandwrittenWordsB) ,1,ID:HandwrittenWordsB_1,URL:https://tc11.cvc.uab.es/datasets/HandwrittenWordsB_1

Dataset Information

Keywords

bangla writing, handwritten words, characters annotated, train YOLO

Description

We collected a dataset comprising of 300 handwritten word images.

Newspaper articles containing a total of 10,278 words were collected from Prothom Alo to find the word length distribution of commonly used Bangla words. From the articles, a list of unique words were filtered out and their lengths were analyzed. This distribution was followed when generating the word list for data collection. 20 sets containing 15 words were created to distribute among 20 participants; each participant was given a unique set. The dataset contains actual handwritten text noises like ink blots, smudges, and irregularities in writing style; this guarantees that the feature learning is carried out in a real-world environment.

 

FileTypeSizeDownloadsDescription
Handwritten Words.v1i.yolov8.zipdata(12 MB)1
Comments
No comments on this dataset yet.
Valoration
In order to rate this dataset you need to be logged onLogin / Register