en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,351 Hours - Mandarin Chinese(China) Spontaneous Dialogue (Smartphone+Voice Recorder) speech dataset

Mandarin natural dialogue audio data
Mandarin conversational data
Mandarin conversational dataset
Mandarin conversational video data
Mandarin conversational video dataset
Mandarin conversational graphical data
Mandarin conversational graphical dataset
Mandarin conversational recording data
Mandarin conversational recording dataset
Mandarin conversational visual data
Mandarin conversational visual dataset
Mandarin conversational tape data
Mandarin conversational tape dataset
Mandarin commonsense data
Mandarin commonsense dataset
Mandarin commonsense video data
Mandarin commonsense video dataset
Mandarin commonsense graphical data
Mandarin commonsense graphical dataset
Mandarin commonsense recording data
Mandarin commonsense recording dataset
Mandarin commonsense visual data
Mandarin commonsense visual dataset
Mandarin commonsense tape data
Mandarin commonsense tape dataset
Mandarin small talk data
Mandarin small talk dataset
Mandarin small talk video data
Mandarin small talk video dataset
Mandarin small talk graphical data
Mandarin small talk graphical dataset
Mandarin small talk recording data
Mandarin small talk recording dataset
Mandarin small talk visual data
Mandarin small talk visual dataset
Mandarin small talk tape data
Mandarin small talk tape dataset
Mandarin daily talk data
Mandarin daily talk dataset
Mandarin daily talk video data
Mandarin daily talk video dataset
Mandarin daily talk graphical data
Mandarin daily talk graphical dataset
Mandarin daily talk recording data
Mandarin daily talk recording dataset
Mandarin daily talk visual data
Mandarin daily talk visual dataset
Mandarin daily talk tape data
Mandarin daily talk tape dataset
Mandarin daily conversation data
Mandarin daily conversation dataset
Mandarin daily conversation video data
Mandarin daily conversation video dataset
Mandarin daily conversation graphical data
Mandarin daily conversation graphical dataset
Mandarin daily conversation recording data
Mandarin daily conversation recording dataset
Mandarin daily conversation visual data
Mandarin daily conversation visual dataset
Mandarin daily conversation tape data
Mandarin daily conversation tape dataset
Mandarin dialects conversational data
Mandarin dialects conversational dataset
Mandarin dialects conversational video data
Mandarin dialects conversational video dataset
Mandarin dialects conversational graphical data
Mandarin dialects conversational graphical dataset
Mandarin dialects conversational recording data
Mandarin dialects conversational recording dataset
Mandarin dialects conversational visual data
Mandarin dialects conversational visual dataset
Mandarin dialects conversational tape data
Mandarin dialects conversational tape dataset
Mandarin dialects commonsense data
Mandarin dialects commonsense dataset
Mandarin dialects commonsense video data
Mandarin dialects commonsense video dataset
Mandarin dialects commonsense graphical data
Mandarin dialects commonsense graphical dataset
Mandarin dialects commonsense recording data
Mandarin dialects commonsense recording dataset
Mandarin dialects commonsense visual data
Mandarin dialects commonsense visual dataset
Mandarin dialects commonsense tape data
Mandarin dialects commonsense tape dataset
Mandarin dialects small talk data
Mandarin dialects small talk dataset
Mandarin dialects small talk video data
Mandarin dialects small talk video dataset
Mandarin dialects small talk graphical data
Mandarin dialects small talk graphical dataset
Mandarin dialects small talk recording data
Mandarin dialects small talk recording dataset
Mandarin dialects small talk visual data
Mandarin dialects small talk visual dataset
Mandarin dialects small talk tape data
Mandarin dialects small talk tape dataset
Mandarin dialects daily talk data
Mandarin dialects daily talk dataset
Mandarin dialects daily talk video data
Mandarin dialects daily talk video dataset
Mandarin dialects daily talk graphical data
Mandarin dialects daily talk graphical dataset
Mandarin dialects daily talk recording data
Mandarin dialects daily talk recording dataset
Mandarin dialects daily talk visual data
Mandarin dialects daily talk visual dataset
Mandarin dialects daily talk tape data
Mandarin dialects daily talk tape dataset
Mandarin dialects daily conversation data
Mandarin dialects daily conversation dataset
Mandarin dialects daily conversation video data
Mandarin dialects daily conversation video dataset
Mandarin dialects daily conversation graphical data
Mandarin dialects daily conversation graphical dataset
Mandarin dialects daily conversation recording data
Mandarin dialects daily conversation recording dataset
Mandarin dialects daily conversation visual data
Mandarin dialects daily conversation visual dataset
Mandarin dialects daily conversation tape data
Mandarin dialects daily conversation tape dataset
Mandarin regional language conversational data
Mandarin regional language conversational dataset
Mandarin regional language conversational video data
Mandarin regional language conversational video dataset
Mandarin regional language conversational graphical data
Mandarin regional language conversational graphical dataset
Mandarin regional language conversational recording data
Mandarin regional language conversational recording dataset
Mandarin regional language conversational visual data
Mandarin regional language conversational visual dataset
Mandarin regional language conversational tape data
Mandarin regional language conversational tape dataset
Mandarin regional language commonsense data
Mandarin regional language commonsense dataset
Mandarin regional language commonsense video data
Mandarin regional language commonsense video dataset
Mandarin regional language commonsense graphical data
Mandarin regional language commonsense graphical dataset
Mandarin regional language commonsense recording data
Mandarin regional language commonsense recording dataset
Mandarin regional language commonsense visual data
Mandarin regional language commonsense visual dataset
Mandarin regional language commonsense tape data
Mandarin regional language commonsense tape dataset
Mandarin regional language small talk data
Mandarin regional language small talk dataset
Mandarin regional language small talk video data
Mandarin regional language small talk video dataset
Mandarin regional language small talk graphical data
Mandarin regional language small talk graphical dataset
Mandarin regional language small talk recording data
Mandarin regional language small talk recording dataset
Mandarin regional language small talk visual data
Mandarin regional language small talk visual dataset
Mandarin regional language small talk tape data
Mandarin regional language small talk tape dataset
Mandarin regional language daily talk data
Mandarin regional language daily talk dataset
Mandarin regional language daily talk video data
Mandarin regional language daily talk video dataset
Mandarin regional language daily talk graphical data
Mandarin regional language daily talk graphical dataset
Mandarin regional language daily talk recording data
Mandarin regional language daily talk recording dataset
Mandarin regional language daily talk visual data
Mandarin regional language daily talk visual dataset
Mandarin regional language daily talk tape data
Mandarin regional language daily talk tape dataset
Mandarin regional language daily conversation data
Mandarin regional language daily conversation dataset
Mandarin regional language daily conversation video data
Mandarin regional language daily conversation video dataset
Mandarin regional language daily conversation graphical data
Mandarin regional language daily conversation graphical dataset
Mandarin regional language daily conversation recording data
Mandarin regional language daily conversation recording dataset
Mandarin regional language daily conversation visual data
Mandarin regional language daily conversation visual dataset
Mandarin regional language daily conversation tape data
Mandarin regional language daily conversation tape dataset
Mandarin vernacular conversational data
Mandarin vernacular conversational dataset
Mandarin vernacular conversational video data
Mandarin vernacular conversational video dataset
Mandarin vernacular conversational graphical data
Mandarin vernacular conversational graphical dataset
Mandarin vernacular conversational recording data
Mandarin vernacular conversational recording dataset
Mandarin vernacular conversational visual data
Mandarin vernacular conversational visual dataset
Mandarin vernacular conversational tape data
Mandarin vernacular conversational tape dataset
Mandarin vernacular commonsense data
Mandarin vernacular commonsense dataset
Mandarin vernacular commonsense video data
Mandarin vernacular commonsense video dataset
Mandarin vernacular commonsense graphical data
Mandarin vernacular commonsense graphical dataset
Mandarin vernacular commonsense recording data
Mandarin vernacular commonsense recording dataset
Mandarin vernacular commonsense visual data
Mandarin vernacular commonsense visual dataset
Mandarin vernacular commonsense tape data
Mandarin vernacular commonsense tape dataset
Mandarin vernacular small talk data
Mandarin vernacular small talk dataset
Mandarin vernacular small talk video data
Mandarin vernacular small talk video dataset
Mandarin vernacular small talk graphical data
Mandarin vernacular small talk graphical dataset
Mandarin vernacular small talk recording data
Mandarin vernacular small talk recording dataset
Mandarin vernacular small talk visual data
Mandarin vernacular small talk visual dataset
Mandarin vernacular small talk tape data
Mandarin vernacular small talk tape dataset
Mandarin vernacular daily talk data
Mandarin vernacular daily talk dataset
Mandarin vernacular daily talk video data
Mandarin vernacular daily talk video dataset
Mandarin vernacular daily talk graphical data
Mandarin vernacular daily talk graphical dataset
Mandarin vernacular daily talk recording data
Mandarin vernacular daily talk recording dataset
Mandarin vernacular daily talk visual data
Mandarin vernacular daily talk visual dataset
Mandarin vernacular daily talk tape data
Mandarin vernacular daily talk tape dataset
Mandarin vernacular daily conversation data
Mandarin vernacular daily conversation dataset
Mandarin vernacular daily conversation video data
Mandarin vernacular daily conversation video dataset
Mandarin vernacular daily conversation graphical data
Mandarin vernacular daily conversation graphical dataset
Mandarin vernacular daily conversation recording data
Mandarin vernacular daily conversation recording dataset
Mandarin vernacular daily conversation visual data
Mandarin vernacular daily conversation visual dataset
Mandarin vernacular daily conversation tape data
Mandarin vernacular daily conversation tape dataset
Mandarin patois conversational data
Mandarin patois conversational dataset
Mandarin patois conversational video data
Mandarin patois conversational video dataset
Mandarin patois conversational graphical data
Mandarin patois conversational graphical dataset
Mandarin patois conversational recording data
Mandarin patois conversational recording dataset
Mandarin patois conversational visual data
Mandarin patois conversational visual dataset
Mandarin patois conversational tape data
Mandarin patois conversational tape dataset
Mandarin patois commonsense data
Mandarin patois commonsense dataset
Mandarin patois commonsense video data
Mandarin patois commonsense video dataset
Mandarin patois commonsense graphical data
Mandarin patois commonsense graphical dataset
Mandarin patois commonsense recording data
Mandarin patois commonsense recording dataset
Mandarin patois commonsense visual data
Mandarin patois commonsense visual dataset
Mandarin patois commonsense tape data
Mandarin patois commonsense tape dataset
Mandarin patois small talk data
Mandarin patois small talk dataset
Mandarin patois small talk video data
Mandarin patois small talk video dataset
Mandarin patois small talk graphical data
Mandarin patois small talk graphical dataset
Mandarin patois small talk recording data
Mandarin patois small talk recording dataset
Mandarin patois small talk visual data
Mandarin patois small talk visual dataset
Mandarin patois small talk tape data
Mandarin patois small talk tape dataset
Mandarin patois daily talk data
Mandarin patois daily talk dataset
Mandarin patois daily talk video data
Mandarin patois daily talk video dataset
Mandarin patois daily talk graphical data
Mandarin patois daily talk graphical dataset
Mandarin patois daily talk recording data
Mandarin patois daily talk recording dataset
Mandarin patois daily talk visual data
Mandarin patois daily talk visual dataset
Mandarin patois daily talk tape data
Mandarin patois daily talk tape dataset
Mandarin patois daily conversation data
Mandarin patois daily conversation dataset
Mandarin patois daily conversation video data
Mandarin patois daily conversation video dataset
Mandarin patois daily conversation graphical data
Mandarin patois daily conversation graphical dataset
Mandarin patois daily conversation recording data
Mandarin patois daily conversation recording dataset
Mandarin patois daily conversation visual data
Mandarin patois daily conversation visual dataset
Mandarin patois daily conversation tape data
Mandarin patois daily conversation tape dataset
Mandarin idiom conversational data
Mandarin idiom conversational dataset
Mandarin idiom conversational video data
Mandarin idiom conversational video dataset
Mandarin idiom conversational graphical data
Mandarin idiom conversational graphical dataset
Mandarin idiom conversational recording data
Mandarin idiom conversational recording dataset
Mandarin idiom conversational visual data
Mandarin idiom conversational visual dataset
Mandarin idiom conversational tape data
Mandarin idiom conversational tape dataset
Mandarin idiom commonsense data
Mandarin idiom commonsense dataset
Mandarin idiom commonsense video data
Mandarin idiom commonsense video dataset
Mandarin idiom commonsense graphical data
Mandarin idiom commonsense graphical dataset
Mandarin idiom commonsense recording data
Mandarin idiom commonsense recording dataset
Mandarin idiom commonsense visual data
Mandarin idiom commonsense visual dataset
Mandarin idiom commonsense tape data
Mandarin idiom commonsense tape dataset
Mandarin idiom small talk data
Mandarin idiom small talk dataset
Mandarin idiom small talk video data
Mandarin idiom small talk video dataset
Mandarin idiom small talk graphical data
Mandarin idiom small talk graphical dataset
Mandarin idiom small talk recording data
Mandarin idiom small talk recording dataset
Mandarin idiom small talk visual data
Mandarin idiom small talk visual dataset
Mandarin idiom small talk tape data
Mandarin idiom small talk tape dataset
Mandarin idiom daily talk data
Mandarin idiom daily talk dataset
Mandarin idiom daily talk video data
Mandarin idiom daily talk video dataset
Mandarin idiom daily talk graphical data
Mandarin idiom daily talk graphical dataset
Mandarin idiom daily talk recording data
Mandarin idiom daily talk recording dataset
Mandarin idiom daily talk visual data
Mandarin idiom daily talk visual dataset
Mandarin idiom daily talk tape data
Mandarin idiom daily talk tape dataset
Mandarin idiom daily conversation data
Mandarin idiom daily conversation dataset
Mandarin idiom daily conversation video data
Mandarin idiom daily conversation video dataset
Mandarin idiom daily conversation graphical data
Mandarin idiom daily conversation graphical dataset
Mandarin idiom daily conversation recording data
Mandarin idiom daily conversation recording dataset
Mandarin idiom daily conversation visual data
Mandarin idiom daily conversation visual dataset
Mandarin idiom daily conversation tape data
Mandarin idiom daily conversation tape dataset
Mandarin phonology conversational data
Mandarin phonology conversational dataset
Mandarin phonology conversational video data
Mandarin phonology conversational video dataset
Mandarin phonology conversational graphical data
Mandarin phonology conversational graphical dataset
Mandarin phonology conversational recording data
Mandarin phonology conversational recording dataset
Mandarin phonology conversational visual data
Mandarin phonology conversational visual dataset
Mandarin phonology conversational tape data
Mandarin phonology conversational tape dataset
Mandarin phonology commonsense data
Mandarin phonology commonsense dataset
Mandarin phonology commonsense video data
Mandarin phonology commonsense video dataset
Mandarin phonology commonsense graphical data
Mandarin phonology commonsense graphical dataset
Mandarin phonology commonsense recording data
Mandarin phonology commonsense recording dataset
Mandarin phonology commonsense visual data
Mandarin phonology commonsense visual dataset
Mandarin phonology commonsense tape data
Mandarin phonology commonsense tape dataset
Mandarin phonology small talk data
Mandarin phonology small talk dataset
Mandarin phonology small talk video data
Mandarin phonology small talk video dataset
Mandarin phonology small talk graphical data
Mandarin phonology small talk graphical dataset
Mandarin phonology small talk recording data
Mandarin phonology small talk recording dataset
Mandarin phonology small talk visual data
Mandarin phonology small talk visual dataset
Mandarin phonology small talk tape data
Mandarin phonology small talk tape dataset
Mandarin phonology daily talk data
Mandarin phonology daily talk dataset
Mandarin phonology daily talk video data
Mandarin phonology daily talk video dataset
Mandarin phonology daily talk graphical data
Mandarin phonology daily talk graphical dataset
Mandarin phonology daily talk recording data
Mandarin phonology daily talk recording dataset
Mandarin phonology daily talk visual data
Mandarin phonology daily talk visual dataset
Mandarin phonology daily talk tape data
Mandarin phonology daily talk tape dataset
Mandarin phonology daily conversation data
Mandarin phonology daily conversation dataset
Mandarin phonology daily conversation video data
Mandarin phonology daily conversation video dataset
Mandarin phonology daily conversation graphical data
Mandarin phonology daily conversation graphical dataset
Mandarin phonology daily conversation recording data
Mandarin phonology daily conversation recording dataset
Mandarin phonology daily conversation visual data
Mandarin phonology daily conversation visual dataset
Mandarin phonology daily conversation tape data
Mandarin phonology daily conversation tape dataset
Chinese conversational data
Chinese conversational dataset
Chinese conversational video data
Chinese conversational video dataset
Chinese conversational graphical data
Chinese

Mandarin Chinese(China) Spontaneous Dialogue (Smartphone+Voice Recorder) speech dataset, collected from dialogues based on given topics, covering dozens of generic domain. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers(1,950 people in total), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
Mobile phone: 16kHz, 16bit, mono channel, wav; Voice recorder: 44.1kHz, 16bit, dual channel, wav;
Recording condition
Low background noise, without echo;
Content category
generic domain(dozens of topics);
Recording device
Smartphone, voice recorder;
Speaker
1,950 people in total; 66% speakers of all are in the age group of 16-25; 962 speakers of them spoke in groups of two speakers, 312 speakers of them spoke in groups of three speakers, 396 speakers of them spoke in groups of four speakers, and the other 280 speakers spoke in groups of five speakers;
Country
China(CHN);
Language(Region) Code
zh-CN;
Language
Mandarin Chinese;
Features of annotation
Transcription text, speaker ID, gender;
Accuracy Rate
Sentence Accuracy Rate (SAR) 97%
Sample Sample
  • Audio

    你现在一般儿都玩什么游戏呀

  • Audio

    没改的时候的他的一技能是可以放个旋风

  • Audio

    我最喜欢玩儿之前的宫本了宫本武藏之前没改

  • Audio

    都玩儿那个王者荣耀

  • Audio

    王者荣耀哎我也玩儿那里边儿那个英雄都你喜欢玩儿什么

Recommended DatasetsRecommended Dataset
93 Hours - Russian(Russia) Spontaneous Dialogue Telephony speech dataset

Russian(Russia) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Conversational speech Russian asr data russian asr dataset russia
283 Hours - Indonesian(Indonesia) Spontaneous Dialogue Telephony speech dataset

Indonesian(Indonesia) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(376 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data Indonesian telephone
163 Hours - Russian(Russia) Children Real-world Casual Conversation and Monologue speech dataset

Russian(Russia) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Russian Child Spontaneous Speech
162 Hours - French(France) Children Real-world Casual Conversation and Monologue speech dataset

French(France) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

French Spontaneous Speech Child
107 Hours - Spanish(Mexico) Spontaneous Dialogue Smartphone speech dataset

Spanish(Mexico) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data spanish mexican
80 Hours - French(Canada) Spontaneous Dialogue Smartphone speech dataset

French(Canada) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data french Canadian
104 Hours - Portuguese(Portugal) Spontaneous Dialogue Smartphone speech dataset

Portuguese(Portugal) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(124 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data Portuguese European
101 Hours - Italian(Italy) Children Real-world Casual Conversation and Monologue speech dataset

Italian(Italy) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spontaneous Speech Data text annotation Italian
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

0a305176-37a2-492e-b200-f010967dfddf

93012c64-d58d-4087-8221-f25a1270d30e