Arabic & Thai & Vietnamese & Hindi & English & Chinese language Dataset

Registered users can download the PDF introduction.

Category:
Enquiry

Additional information

Dataset ID

MD-OCR-008

Dataset Name

Arabic & Thai & Vietnamese & Hindi & English & Chinese Language Dataset

Data Type

Image

Volume

About 150k

Data Collection

Data collection equipment include phones, cameras and tablets. Image resolution is above 4000*3000 in JPG format. It covers Arabic, Thai, Vietnamese, Hindi, English and Chinese. Over 10 data types including product packaging, sign boards, signposts, poster, electronic devices, parking lots, clothing, buildings, road signs, menu, book covers, shopping malls and tourist attractions, etc.

Annotation

Polygon+Text

Application Scenarios

E-commerce, Retail, Tourism