[{"@type":"PropertyValue","name":"Data size","value":"204,522 images"},{"@type":"PropertyValue","name":"Subject areas","value":"primary, middle, and high school, university, vocational education, etc."},{"@type":"PropertyValue","name":"Question types","value":"multiple-choice (single and multiple selection), fill-in-the-blank, short answer, problem-solving, and questions/answers with illustrations"},{"@type":"PropertyValue","name":"Collection devices","value":"scanner, mobile phone"},{"@type":"PropertyValue","name":"Diversity","value":"various subjects and question types"},{"@type":"PropertyValue","name":"Annotation","value":"quadrilateral bounding boxes and transcription for question stems, options, answers, and illustrations"},{"@type":"PropertyValue","name":"Data processing","value":"equations and tables transcribed in LaTeX format"},{"@type":"PropertyValue","name":"Data formats","value":".jpg, .json, .latex"}]
{"id":1574,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_tuxiang_default.webp","type1":"226","type1str":null,"type2":"254","type2str":null,"dataname":"204,522 Questions – Test Paper VQA Dataset with LaTeX Formulas","datazy":[{"title":"Data size","content":"204,522 images"},{"title":"Subject areas","content":"primary, middle, and high school, university, vocational education, etc."},{"title":"Question types","content":"multiple-choice (single and multiple selection), fill-in-the-blank, short answer, problem-solving, and questions/answers with illustrations"},{"title":"Collection devices","content":"scanner, mobile phone"},{"title":"Diversity","content":"various subjects and question types"},{"title":"Annotation","content":"quadrilateral bounding boxes and transcription for question stems, options, answers, and illustrations"},{"title":"Data processing","content":"equations and tables transcribed in LaTeX format"},{"title":"Data formats","content":".jpg, .json, .latex"}],"datatag":"","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[],"officialSummary":"This dataset contains 204,522 test paper questions, covering multiple subjects, question types, and collection devices (mobile phones, scanners). The text is fully transcribed, and formulas and tables are transcribed using LaTeX format. This dataset can be used for tasks such as intelligent exam paper marking and homework tutoring. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["VQA dataset","Exam question answering dataset","Intelligent exam dataset","Homework tutoring dataset","Question-answering exam dataset","Multi-subject exam dataset","test paper dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN,DE,KO,FR,ES\"},{\"code\":\"3\",\"language\":\"EN\"}]","productNameEn":"50,538 Questions – Test Paper VQA Data","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
204,522 Questions – Test Paper VQA Dataset with LaTeX Formulas
VQA dataset
Exam question answering dataset
Intelligent exam dataset
Homework tutoring dataset
Question-answering exam dataset
Multi-subject exam dataset
test paper dataset
This dataset contains 204,522 test paper questions, covering multiple subjects, question types, and collection devices (mobile phones, scanners). The text is fully transcribed, and formulas and tables are transcribed using LaTeX format. This dataset can be used for tasks such as intelligent exam paper marking and homework tutoring. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data size
204,522 images
Subject areas
primary, middle, and high school, university, vocational education, etc.
Question types
multiple-choice (single and multiple selection), fill-in-the-blank, short answer, problem-solving, and questions/answers with illustrations
Collection devices
scanner, mobile phone
Diversity
various subjects and question types
Annotation
quadrilateral bounding boxes and transcription for question stems, options, answers, and illustrations