en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

High-Quality Training Datasets

Boost the performance of your AI models with our high-quality, ready-to-use training datasets.

Language

All

Data Type

All

2 People - Korean Average Tone Speech Synthesis Corpus

2 People - Korean Average Tone Speech Synthesis Corpus. It is recorded by rnkorean native , with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS Korean Average Tone

105 Hours - Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset

Italian(Italy) Gaming Real-world Casual Conversation and Monologue speech dataset, covers spontaneous dialogue about popular and evergreen games, including player discussions on battle strategies, social interactions, esports news, etc., mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, accent, offensive expression labeling and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Italy Spontaneous Dialogue Gaming Italian

206 Hours - English Financial Entities Real-world Casual Conversation and Monologue speech dataset

English Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English Entity Spontaneous Dialogue Financial

198 Hours - Spanish Gaming Real-world Casual Conversation and Monologue speech dataset

Spanish Gaming Real-world Casual Conversation and Monologue speech dataset, covers spontaneous dialogue about popular and evergreen games, including player discussions on battle strategies, social interactions, esports news, etc., mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, accent, offensive expression labeling and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Spanish Spontaneous Dialogue Gaming Latin Spanish

203 Hours - Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Medical Entities Real-world Casual Conversation and Monologue speech dataset, covering various medical professional terminologies, primarily focuses on medical consultation, medical education, medical academic conferences and lectures, etc., mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean Entity Spontaneous Dialogue Medical

215 Hours - Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset

Korean(Korea) Financial Entities Real-world Casual Conversation and Monologue speech dataset, covering various financial professional terminologies, primarily focuses on macroeconomics and microeconomics, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, common entities and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Korean Entity Spontaneous Dialogue Financial

5,000 People Multi-race – Infrared Face Recognition Data

5,000 people multi-race – infrared face recognition data. The collecting scenes of this dataset include indoor scenes and outdoor scenes. The data includes male and female. The race distribution includes Asian, Black, Caucasian and Brown people. The age distribution ranges from child to the elderly, the young people and the middle aged are the majorities. The collecting device is DV-DH4,044S305AD. The data diversity includes multiple age periods, multiple facial postures, multiple scenes. The data can be used for tasks such as infrared face recognition. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Multi-race infrared face binocular camera multiple age periods multiple facial postures multiple scenes

Millions of Foreign Face Data_Single Image

Millions of Foreign Face Data. One person have one frontal face image. The race distribution includes Asian, Black, Caucasian and brown people, the age distribution is ranging from infant to the elderly, the middle-aged and young people are the majorities. The collection environment includes indoor and outdoor scenes. The data diversity includes multiple age periods, multiple scenes, multiple facial postures and multiple expressions. The data can be used for tasks such as face recognition. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Million id foreign face single image

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis Corpus TTS Average Tone General Northeast Dialect

424 Hours - Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset

Romanian(Romania) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Romania Romanian Casual Conversation Monologue Asr

373 Hours - Dari(Afghanistan) Spontaneous Dialogue Smartphone speech dataset

Dari(Afghanistan) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(504 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
asr audio conversational asr data dari

116,048 Sets - 3D Handpose Dataset

This dataset contains 116,048 sets of 3D handpose data, each set includes hand mask image(RGB, 24-bit), depth image(16-bit), camera intrinsic parameter file(TXT), 3D keypoints file(OBJ), mesh file(OBJ), gesture type file(TXT), keypoints demo image(JPG), and mesh demo image(JPG). The data is collected indoors, with the right hand (no handheld objects), covering both first-person and third-person perspectives, multiple gesture types, finger poses, hand overall rotation poses, individuals and Kinect devices used. This dataset does not include personally identifiable facial information, with hand mask images and depth images aligned. This dataset can be used for tasks such as handpose recognition, hand 3D reconstruction, and hand keypoints detection.
VR Handpose recognization 3D reconstruction Keypoints detection
. . .

loading

6e431151-5aa0-46d5-9b26-ce3245964014