202 People - Multi-angle Lip Multimodal Video Data

Multi-angle

lip multimodal

indoor natural light scenes

indoor fluorescent lamp scenes

13 shooting angles

Mandarin Chinese

general field

202 People - Multi-angle Lip Multimodal Video Data. The collection environments include indoor natural light scenes and indoor fluorescent lamp scenes. The device is cellphone. The diversity includes multiple scenes, different ages, 13 shooting angles. The language is Mandarin Chinese. The recording content is general field, unlimited content. The data can be used in multi-modal learning algorithms research in speech and image fields.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Data size

202 people, each person collects the audio and video data from 13 different angles +1 txt document

People distribution

race distribution: Asian (Indonesia), gender distribution: 89 males, 113 females, age distribution: 165 people aged 18-30, 32 people aged 31-45, and 5 people aged 46-60

Collecting environment

indoor natural light scenes, indoor fluorescent lamp scenes

Data diversity

including multiple scenes, different ages, different shooting angles

Device

cellphone, the resolution is 1,920*1,080

Collecting angle

audio and video data of front face, 3 angles left side face, 3 angles right side face, looking down, looking up, left side face down, right side face down, left side face up and right side face up all 13 different angles were collected at the same time

Recording content

general field, unlimited content

Language

Mandarin Chinese, each video is more than 20 seconds

Data format

the video data format is .mp4, the audio is greater than or equal to 16KHz, 16bit, the frame rate is 25-30 fps

Accuracy rata

the accuracy rate of word is more than 95%

Recommended Dataset

34,981 Images – Alpha Matte Human Body Segmentation Data (Fine Version)

34,981 Images – Alpha Matte Human Body Segmentation Data. The data includes indoor scenes and outdoor scenes. The dataset diversity includes multiple scenes, multiple age groups, multiple human body angles, multiple postures. In terms of annotation, alpha matte segmentation annotation was adopted for the human body. The data can be used for tasks such as alpha matte human body segmentation.

Alpha matte human body segmentation multiple scenes multiple age groups multiple human body angles multiple postures

4,458 People - 3D Facial Expressions Recognition Data

4,458 People - 3D Facial Expressions Recognition Data. The collection scenes include indoor scenes and outdoor scenes. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes different expressions, different ages, different races, different collecting scenes. This data can be used for tasks such as 3D facial expression recognition.

3D facial expressions recognition different expressions different ages different races different collecting scenes

5,199 People – 3D Face Recognition Images Data

5,199 People – 3D Face Recognition Images Data. The collection scene is indoor scene. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes multiple facial postures, multiple light conditions, multiple indoor scenes. This data can be used for tasks such as 3D face recognition.

3D Face Recognition multiple facial postures multiple light conditions multiple indoor scenes

1,417 People – 3D Living_Face & Anti_Spoofing Data

1,417 People – 3D Living_Face & Anti_Spoofing Data. The collection scenes include indoor and outdoor scenes. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes various expressions, facial postures, anti-spoofing samples, multiple light conditions, multiple scenes. This data can be used for tasks such as 3D face recognition, 3D Living_Face & Anti_Spoofing.

3D Living_Face & Anti_Spoofing various expressions facial postures anti-spoofing samples multiple light conditions multiple scenes

11,113 People - Face Recognition Data with Gauze Mask

11,113 People - Face Recognition Data with Gauze Mask, for each subject, 7 images were collected. The dataset diversity includes multiple mask types, multiple ages, multiple races, multiple light conditions and scenes.This data can be applied to computer vision tasks such as occluded face detection and recognition.

Face recognition Face occlusion Frontal face Gause mask

2,937 People with Occlusion and Multi-pose Face Recognition Data

2,937 People with Occlusion and Multi-pose Face Recognition Data, for each subject, 200 images were collected. The 200 images includes 4 kinds of light conditions * 10 kinds of occlusion cases (including non-occluded case) * 5 kinds of face pose. This data can be applied to computer vision tasks such as occluded face detection and recognition.rn

Face recognition Face occlusion Multi-pose per person Face with mask Multiple light conditions Multiplescenes blockage closure stoppage block stop obstruction blocking occluded front occlusive check closing embolism apoplexy shutdown hindrance blockade thrombosis impaction tampons arrest close congestion embolus fastener hitch obturation seal stopper abocclusion blocks clog clot clotting constipation holdup impediment occludent plug stoppages stopples stops tampon thrombus airlock barrier cap catch clogging cork plugging posture perplex puzzle mystify nonplus bewilder gravel flummox position baffle amaze dumbfound masquerade beat stick stupefy impersonate attitude place stance model present affectation mannerism attitudinize sit put submit show airs front propose suggest pretense propound affectedness raise strike a pose constitute facade personate show off advance pretend act bluff arrange put on airs peacock posing confront look meet front facing surface encounter side brave grimace experience visage address veneer countenance tackle cover oppose confronting defy expression aspect appearance cheek watch challenge nerve font overlook endure withstand suffer brass cope with dial head exterior typeface handle undergo be facing facade face up facial expression physiognomy beard boldness outside deal faces

4,082 Families-Family Face Data

4,082 Families-Family Face Data. The data includes various scenee, different families and 11 kinds of kinship pairs. One family photo was collected for each family, each family includes three family members at least. 11 kinds of kinship pairs, key points of two pupils, and bounding box of face were annotated. The data can be used for tasks such as kinship verification, searching for missing family members and organizing family photo albums.

family face 11 kinds of kinship pairs different families family image direct relative pairs key points of two pupils bounding box of face kinship verification searching for missing family members organizing family photo albums and genealogy research

144,810 Images Multi-class Fashion Item Detection Data

144,810 Images Multi-class Fashion Item Detection Data. In this dataset, 19,968 images of male and 124,842 images of female were included. The Fashion Items were divided into 4 parts based on the season (spring, autumn, summer and winter). In terms of annotation, rectangular bounding boxes were adopted to annotate fashion items. The data can be used for tasks such as fashion items detection, fashion recommendation and other tasks.

Fashion Item Detection Multiple scenes Different seasons Different races