en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

50,538 Questions – Test Paper VQA Data

大模型
多模态
教育
试题

50,538 Images - OCR Dataset_Exam Questions, covering multiple subjects, question types and collection devices (mobile phones, scanners), and the text was transcribed, and the formulas and tables were transcribed using latex format. This dataset can be used for tasks such as intelligent exam paper marking and homework tutoring. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
50,538 questions
Image resolution
total pixels ≥ 300,000
Subject areas
primary, middle, and high school, university, vocational education, etc.
Question types
multiple-choice (single and multiple selection), fill-in-the-blank, short answer, problem-solving, and questions/answers with illustrations
Collection devices
scanner, mobile phone
Diversity
various subjects and question types
Annotation
quadrilateral bounding boxes and transcription for question stems, options, answers, and illustrations
Data processing
equations and tables transcribed in LaTeX format
Data formats
.jpg, .json, .latex
Sample Sample
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

6d6c6e92-81bb-459e-b12b-055a61dfc267

9acf5de6-1bee-455c-bda7-bfadf3e3140e