en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

29,954 Images - Southeast Asian OCR Dataset (Khmer, Lao, Burmese)

Southeast Asian OCR dataset
Khmer OCR data
Lao OCR dataset
Burmese OCR dataset
natural scene OCR
minority language text recognition
multilingual OCR dataset

29,954 Images - OCR Collection Data in Southeast Asian Languages, including Khmer (Cambodia), Lao and Burmese. The diversity of collection includes multiple languages, multiple collection types, multiple shooting angles. This set of data can be used for Southeast Asian language OCR tasks.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
29,954 images, including 8,798 images in Khmer (Cambodian), 11,575 images in Lao, and 9,581 images in Burmese
Collecting environment
Natural scenes: shop signs, posters, warnings, road signs, food packages, billboards, street views, etc. Document Photograph: cards, receipts, newspapers, books (documents, newspapers, books, test pap
Data diversity
multiple languages, multiple collection types, multiple shooting angles
Device
cellphone, computer
Data format
the image format is a common one such as .png
Accuracy rate
according to the collection requirements, the collection accuracy is not less than 95%
Sample Sample
  • 29,954 Images - Southeast Asian OCR Dataset (Khmer, Lao, Burmese)
  • 29,954 Images - Southeast Asian OCR Dataset (Khmer, Lao, Burmese)
  • 29,954 Images - Southeast Asian OCR Dataset (Khmer, Lao, Burmese)
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

9319d477-c8e7-4184-bd00-60e63c0ab619

6da5b67d-e737-4f06-8229-542a126ac40a