[{"@type":"PropertyValue","name":"Data size","value":"10,100 images"},{"@type":"PropertyValue","name":"Race distribution","value":"Asian, Caucasian, Black, Brown"},{"@type":"PropertyValue","name":"Gender distribution","value":"male, female"},{"@type":"PropertyValue","name":"Age distribution","value":"under 18 years old, 18~45 years old, 46~60 years old, over 60 years old"},{"@type":"PropertyValue","name":"Collection environment","value":"including indoor scenes and outdoor scenes"},{"@type":"PropertyValue","name":"Collection diversity","value":"different age groups, different collection environments, and different seasons"},{"@type":"PropertyValue","name":"Diversity of content","value":"including wearing masks, adversarial samples, expression data, wearing glasses, wearing headphones, and multiple gestures"},{"@type":"PropertyValue","name":"Data format","value":"image format is .jpg, text format is .txt"},{"@type":"PropertyValue","name":"Description language","value":"English, Chinese"},{"@type":"PropertyValue","name":"Text length","value":"in principle, 30~60 words, usually 3-5 sentences"},{"@type":"PropertyValue","name":"Main description content","value":"race, gender, age, shooting angle, lighting, diversity content"},{"@type":"PropertyValue","name":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 97%"}]
{"id":1286,"datatype":"1","titleimg":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/asset/productNew/nexdata/APY231231004.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=XPzvP2M3iemA3m619YNZFFTRbo0%3D","type1":"226","type1str":null,"type2":"226","type2str":null,"dataname":"10,100 Image Caption Data of Human Face","datazy":[{"title":"Data size","value":"10,100 images"},{"title":"Race distribution","value":"Asian, Caucasian, Black, Brown"},{"title":"Gender distribution","value":"male, female"},{"title":"Age distribution","value":"under 18 years old, 18~45 years old, 46~60 years old, over 60 years old"},{"title":"Collection environment","value":"including indoor scenes and outdoor scenes"},{"title":"Collection diversity","value":"different age groups, different collection environments, and different seasons"},{"title":"Diversity of content","value":"including wearing masks, adversarial samples, expression data, wearing glasses, wearing headphones, and multiple gestures"},{"title":"Data format","value":"image format is .jpg, text format is .txt"},{"title":"Description language","value":"English, Chinese"},{"title":"Text length","value":"in principle, 30~60 words, usually 3-5 sentences"},{"title":"Main description content","value":"race, gender, age, shooting angle, lighting, diversity content"},{"title":"Accuracy rate","value":"the proportion of correctly labeled images is not less than 97%"}],"datatag":"AIGC,English caption,Face description,Multiple scenes,Multiple seasons,Multiple races","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/%3F%3F2.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2lhnWHmVb9bcm0x4s6YBf%2BCOA5A%3D","/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/??2.png",""],["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/%3F%3F5.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=y6iWPPJ1fPRcNH%2FY%2FxzvgoZ1N%2BA%3D","/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/??5.png",""],["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/%3F%3F3.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2BdAj4NJShsE2KQyNupE7SYTmb74%3D","/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/??3.png",""]],"officialSummary":"10,100 Image caption data of human face includes multiple races under the age of 18, 18~45 years old, 46~60 years old, and over 60 years old; the collection scene is rich, including indoor scenes and outdoor scenes; the image content is rich, including wearing masks, glasses, wearing headphones, facial expressions, gestures, and adversarial examples. The language of the text description is English, which mainly describes the race, gender, age, shooting angle, lighting and diversity content, etc.","dataexampl":"","datakeyword":["multi-modal"," multi-pose face image data"," face dataset"," "],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no","firstList":[["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/%3F%3F1.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=kdFbU4ZTVIJu%2F3%2B85DWU%2BOnTlnk%3D","/data/apps/damp/temp/ziptemp/APY231231004_demo1718186400197/??1.png",""]]}
10,100 Image caption data of human face includes multiple races under the age of 18, 18~45 years old, 46~60 years old, and over 60 years old; the collection scene is rich, including indoor scenes and outdoor scenes; the image content is rich, including wearing masks, glasses, wearing headphones, facial expressions, gestures, and adversarial examples. The language of the text description is English, which mainly describes the race, gender, age, shooting angle, lighting and diversity content, etc.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data size
10,100 images
Race distribution
Asian, Caucasian, Black, Brown
Gender distribution
male, female
Age distribution
under 18 years old, 18~45 years old, 46~60 years old, over 60 years old
Collection environment
including indoor scenes and outdoor scenes
Collection diversity
different age groups, different collection environments, and different seasons
Diversity of content
including wearing masks, adversarial samples, expression data, wearing glasses, wearing headphones, and multiple gestures
the proportion of correctly labeled images is not less than 97%
Sample
Recommended Dataset
2 Million Pairs Image Caption Data Of General Scenes
2 million pairs of images and descriptions, the pictures cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. They depict the overall scene of the image, the details within the scene, and the emotions conveyed by the image. The description is provided in both English and Chinese languages.
Text description multi-modality general scene data set English caption Chinese caption
700,000 Sets Image Caption Data Of General Scenes
700,000 sets of images and descriptions,the types of pictures include landscapes, animals, flowers and trees, people, cars, sports, industries, and buildings. Category and an aesthetic subset, each image has no less than two descriptions, each with one sentence; a small number of images have only one description, and the description languages are English and Chinese
Text description multi-modality general scene data set English caption Chinese caption
11,000 Image & Video Caption Data of Human Action
11,000 Image & Video caption data of human action contains 10,000 images and 10,000videos of various human behaviors in different seasons and different shooting angles, including indoor scenes and outdoor scenes. The description language is English, mainly describing the gender, age, clothing, behavior description and body movements of the characters.
AIGC human behavior data behavior recognition data human behavior recognition data human detection data
20,011 Image Caption Data of OCR in Natural Scenes
20,011 Image Caption Data of OCR in Natural Scenes, including Asian and European languages, a total of 14 languages, the collection environment includes shop plaques, stop signs, posters, road signs and other scenes, including a variety of shooting angles. The description language is English, which mainly describes the text arrangement, text content, color and other information.
AIGC English caption OCR caption multilingual OCR data multilingual OCR data OCR data OCR dataset
10,000 Image Caption Data of Gestures
10,000 Image caption data of gestures, mainly for young and middle-aged people, the collection environment includes indoor scenes and outdoor scenes, including various collection environments, various seasons, and various collection angles. The description language is English, mainly describing hand characteristics such as hand movements, gestures, image acquisition angles, gender, age, etc.
10,000 Image Caption Data Of Vehicles covers various types of cars, SUVs, MPVs, trucks, and buses. Surveillance cameras are used to collect outdoor roads for multiple periods of time, mainly describing the types of vehicles. Information such as color, vehicle orientation, scene, etc., the description language is English.
multi-modality vehicle attribute data security data intelligent monitoring data intelligent traffic data smart city data
10,000 Image Caption Data of Diverse Scenes
10,000 Image caption data of diverse scenes including natural scenes, urban street scenes, exhibitions, family environments and other scenes, shot with different brands of cameras, including multiple time periods, multiple shooting angles, description language is English, mainly describes the main scenes in the image, usually including foreground and background description.
multi-modality natural scene data set scene information data