Chinese & English & Tibetan & Uyghur Language Dataset

Registered users can download the PDF introduction.

Category:
Enquiry

Additional information

Dataset ID

MD-OCR-007

Dataset Name

Chinese&English&Tibetan&Uyghur Language Dataset

Data Type

Image

Volume

About 38k

Data Collection

Data collection equipment include phones, cameras and tablets. Images include product packages, store names, signposts, posters, parking lots, car stickers, food packaging, signs and book covers, etc.

Annotation

Polygon+Text

Annotation Notes

All text include simplified Chinese, English, numbers, and punctuation marks (comma, period, colon, etc.).

Application Scenarios

Retail, Tourism