{"id":1298,"datatype":"1","titleimg":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/asset/productNew/nexdata/APY230627001.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=APun%2FFClw%2Fz8gHzlhCPPBIQlbos%3D","type1":"147","type1str":null,"type2":"149","type2str":null,"dataname":"202 People - Multi-angle Lip Multimodal Video Data","datazy":[{"title":"Data size","desc":"Data size","content":"202 people, each person collects the audio and video data from 13 different angles +1 txt document"},{"title":"People distribution","desc":"People distribution","content":"race distribution: Asian (Indonesia), gender distribution: 89 males, 113 females, age distribution: 165 people aged 18-30, 32 people aged 31-45, and 5 people aged 46-60"},{"title":"Collecting environment","desc":"Collecting environment","content":"indoor natural light scenes, indoor fluorescent lamp scenes"},{"title":"Data diversity","desc":"Data diversity","content":"including multiple scenes, different ages, different shooting angles"},{"title":"Device","desc":"Device","content":"cellphone, the resolution is 1,920*1,080"},{"title":"Collecting angle","desc":"Collecting angle","content":"audio and video data of front face, 3 angles left side face, 3 angles right side face, looking down, looking up, left side face down, right side face down, left side face up and right side face up all 13 different angles were collected at the same time"},{"title":"Recording content","desc":"Recording content","content":"general field, unlimited content"},{"title":"Language","desc":"Language","content":"Mandarin Chinese, each video is more than 20 seconds"},{"title":"Data format","desc":"Data format","content":"the video data format is .mp4, the audio is greater than or equal to 16KHz, 16bit, the frame rate is 25-30 fps"},{"title":"Accuracy rata","desc":"Accuracy rata","content":"the accuracy rate of word is more than 95%"}],"datatag":"Lip multimodal,Mandarin Chinese,Multiple scenes,Different ages,Different shooting angles","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/002_male_29.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/002_male_29.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ALASNNOKRu%2FsdItuxWu7btO8Gqs%3D","intro":"","size":0,"progress":100,"type":"jpg"},{"name":"/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/001_female_30.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/001_female_30.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=mZRLnTYk5W0s3jRzP7Um81hhRvw%3D","intro":"","size":0,"progress":100,"type":"jpg"},{"name":"/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/156_male_42.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/156_male_42.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2wVvKW6e6XgkYOi9kPqptswFKGs%3D","intro":"","size":0,"progress":100,"type":"jpg"}],"officialSummary":"202 People - Multi-angle Lip Multimodal Video Data. The collection environments include indoor natural light scenes and indoor fluorescent lamp scenes. The device is cellphone. The diversity includes multiple scenes, different ages, 13 shooting angles. The language is Mandarin Chinese. The recording content is general field, unlimited content. The data can be used in multi-modal learning algorithms research in speech and image fields.","dataexampl":null,"datakeyword":["Lip multimodal","Mandarin Chinese","Multiple scenes","Different ages","Different shooting angles"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"firstList":[{"name":"/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/090_female_38.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230627001_demo1715767204254/APY230627001_demo/090_female_38.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=c6Jkb362VMrtxemlPNPSW%2FkEH%2Fk%3D","intro":"","size":0,"progress":100,"type":"jpg"}]}

en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

202 People - Multi-angle Lip Multimodal Video Data

Lip multimodal

Mandarin Chinese

Multiple scenes

Different ages

Different shooting angles

202 People - Multi-angle Lip Multimodal Video Data. The collection environments include indoor natural light scenes and indoor fluorescent lamp scenes. The device is cellphone. The diversity includes multiple scenes, different ages, 13 shooting angles. The language is Mandarin Chinese. The recording content is general field, unlimited content. The data can be used in multi-modal learning algorithms research in speech and image fields.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Specifications

Data size

202 people, each person collects the audio and video data from 13 different angles +1 txt document

People distribution

race distribution: Asian (Indonesia), gender distribution: 89 males, 113 females, age distribution: 165 people aged 18-30, 32 people aged 31-45, and 5 people aged 46-60

Collecting environment

indoor natural light scenes, indoor fluorescent lamp scenes

Data diversity

including multiple scenes, different ages, different shooting angles

Device

cellphone, the resolution is 1,920*1,080

Collecting angle

audio and video data of front face, 3 angles left side face, 3 angles right side face, looking down, looking up, left side face down, right side face down, left side face up and right side face up all 13 different angles were collected at the same time

Recording content

general field, unlimited content

Language

Mandarin Chinese, each video is more than 20 seconds

Data format

the video data format is .mp4, the audio is greater than or equal to 16KHz, 16bit, the frame rate is 25-30 fps

Accuracy rata

the accuracy rate of word is more than 95%

Sample

Sample

Recommended Datasets

Recommended Dataset

21,300 Images - Human Body Segmentation Data

21,300 Images - Human Body Segmentation Data. The data includes indoor scenes and outdoor scenes. The data covers female people and male people. The race distribution includes Asian, black race and Caucasian. The age distribution ranges from teenager to the elderly, the middle-aged and young people are the majorities. The dataset diversity includes multiple scenes, ages, races, postures, and appendages. In terms of annotation, we adpoted pixel-wise segmentation annotations on human body. The data can be used for tasks such as human body segmentation.

Multiple scenes Multiple ages Multiple races Multiple postures Multiple appendages

5,993 People – Infrared Face Recognition Data

5,993 People – Infrared Face Recognition Data. The collecting scenes of this dataset include indoor scenes and outdoor scenes. The data includes male and female. The age distribution ranges from child to the elderly, the young people and the middle aged are the majorities. The collecting device is realsense D453i. The data diversity includes multiple age periods, multiple facial postures, multiple scenes. The data can be used for tasks such as infrared face recognition.

Multiple age periods Multiple facial postures Multiple scenes

34,981 Images – Alpha Matte Human Body Segmentation Data (Fine Version)

34,981 Images – Alpha Matte Human Body Segmentation Data. The data includes indoor scenes and outdoor scenes. The dataset diversity includes multiple scenes, multiple age groups, multiple human body angles, multiple postures. In terms of annotation, alpha matte segmentation annotation was adopted for the human body. The data can be used for tasks such as alpha matte human body segmentation.

Multiple scenes Multiple age groups Multiple human body angles Multiple postures

4,458 People - 3D Facial Expressions Recognition Data

4,458 People - 3D Facial Expressions Recognition Data. The collection scenes include indoor scenes and outdoor scenes. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes different expressions, different ages, different races, different collecting scenes. This data can be used for tasks such as 3D facial expression recognition.

Different expressions Different ages Different races Different collecting scenes

5,199 People – 3D Face Recognition Images Data

5,199 People – 3D Face Recognition Images Data. The collection scene is indoor scene. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes multiple facial postures, multiple light conditions, multiple indoor scenes. This data can be used for tasks such as 3D face recognition.

3D Face Recognition Multiple facial postures Multiple light conditions Multiple indoor scenes

1,417 People – 3D Living_Face & Anti_Spoofing Data

1,417 People – 3D Living_Face & Anti_Spoofing Data. The collection scenes include indoor and outdoor scenes. The dataset includes males and females. The age distribution ranges from juvenile to the elderly, the young people and the middle aged are the majorities. The device includes iPhone X, iPhone XR. The data diversity includes various expressions, facial postures, anti-spoofing samples, multiple light conditions, multiple scenes. This data can be used for tasks such as 3D face recognition, 3D Living_Face & Anti_Spoofing.

3D Living_Face & Anti_Spoofing Various expressions Facial postures Anti-spoofing samples Multiple light conditions Multiple scenes

11,113 People - Face Recognition Data with Gauze Mask

11,113 People - Face Recognition Data with Gauze Mask, for each subject, 7 images were collected. The dataset diversity includes multiple mask types, multiple ages, multiple races, multiple light conditions and scenes.This data can be applied to computer vision tasks such as occluded face detection and recognition.

Face recognition Face occlusion Frontal face Gause mask Multiple mask types Multiple ages Multiple races or nationalities Multiple light conditions and multiple collection scenes

2,937 People with Occlusion and Multi-pose Face Recognition Data

2,937 People with Occlusion and Multi-pose Face Recognition Data, for each subject, 200 images were collected. The 200 images includes 4 kinds of light conditions * 10 kinds of occlusion cases (including non-occluded case) * 5 kinds of face pose. This data can be applied to computer vision tasks such as occluded face detection and recognition.

Face recognition Face occlusion Multi-pose per person Face with mask

Tell Us Your Special Needs

Full Name *

Contact Phone No. *

Company name *

Company Email *

Data Requirements *

By submitting, I agree to the Privacy Protection

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; LLM Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Generative AI; Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; Partners; Quality & Security; Event
Links: OPENMPD; DataPlus; Datarade

Platform: Platform
Competition: Competition
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

nexdata_ai facebook

nexdata_ai twitter

nexdata_ai linkedin

nexdata_ai youtube

Copyright © 2023 NEXDATA TECHNOLOGY INC

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

22e2baf7-c8d8-4c8b-b233-6aa6f90eeba7

bafd9966-32b4-4b49-b0e4-d70e6d2fb94d