en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,420 Hours - Mandarin Chinese(China) Spontaneous Monologue Smartphone speech dataset

Mandarin asr data
Mandarin asr dataset
Mandarin asr collection
Mandarin language data
Mandarin language dataset
Mandarin language collection
Mandarin speech data
Mandarin speech dataset
Mandarin speech collection
Mandarin discuss asr data
Mandarin discuss asr dataset
Mandarin discuss asr collection
Mandarin discuss language data
Mandarin discuss language dataset
Mandarin discuss language collection
Mandarin discuss speech data
Mandarin discuss speech dataset
Mandarin discuss speech collection
Mandarin small talk asr data
Mandarin small talk asr dataset
Mandarin small talk asr collection
Mandarin small talk language data
Mandarin small talk language dataset
Mandarin small talk language collection
Mandarin small talk speech data
Mandarin small talk speech dataset
Mandarin small talk speech collection
Mandarin conversational asr data
Mandarin conversational asr dataset
Mandarin conversational asr collection
Mandarin conversational language data
Mandarin conversational language dataset
Mandarin conversational language collection
Mandarin conversational speech data
Mandarin conversational speech dataset
Mandarin conversational speech collection
Mandarin chat asr data
Mandarin chat asr dataset
Mandarin chat asr collection
Mandarin chat language data
Mandarin chat language dataset
Mandarin chat language collection
Mandarin chat speech data
Mandarin chat speech dataset
Mandarin chat speech collection
Mandarin communication asr data
Mandarin communication asr dataset
Mandarin communication asr collection
Mandarin communication language data
Mandarin communication language dataset
Mandarin communication language collection
Mandarin communication speech data
Mandarin communication speech dataset
Mandarin communication speech collection
Mandarin speech asr data
Mandarin speech asr dataset
Mandarin speech asr collection
Mandarin speech language data
Mandarin speech language dataset
Mandarin speech language collection
Mandarin speech speech data
Mandarin speech speech dataset
Mandarin speech speech collection
Mandarin talk asr data
Mandarin talk asr dataset
Mandarin talk asr collection
Mandarin talk language data
Mandarin talk language dataset
Mandarin talk language collection
Mandarin talk speech data
Mandarin talk speech dataset
Mandarin talk speech collection
Mandarin conversation asr data
Mandarin conversation asr dataset
Mandarin conversation asr collection
Mandarin conversation language data
Mandarin conversation language dataset
Mandarin conversation language collection
Mandarin conversation speech data
Mandarin conversation speech dataset
Mandarin conversation speech collection
Mandarin impromptu asr data
Mandarin impromptu asr dataset
Mandarin impromptu asr collection
Mandarin impromptu language data
Mandarin impromptu language dataset
Mandarin impromptu language collection
Mandarin impromptu speech data
Mandarin impromptu speech dataset
Mandarin impromptu speech collection
Mandarin free speech asr data
Mandarin free speech asr dataset
Mandarin free speech asr collection
Mandarin free speech language data
Mandarin free speech language dataset
Mandarin free speech language collection
Mandarin free speech speech data
Mandarin free speech speech dataset
Mandarin free speech speech collection
Mandarin natural speech asr data
Mandarin natural speech asr dataset
Mandarin natural speech asr collection
Mandarin natural speech language data
Mandarin natural speech language dataset
Mandarin natural speech language collection
Mandarin natural speech speech data
Mandarin natural speech speech dataset
Mandarin natural speech speech collection
Mandarin common speech asr data
Mandarin common speech asr dataset
Mandarin common speech asr collection
Mandarin common speech language data
Mandarin common speech language dataset
Mandarin common speech language collection
Mandarin common speech speech data
Mandarin common speech speech dataset
Mandarin common speech speech collection
Mandarin immediate monologue asr data
Mandarin immediate monologue asr dataset
Mandarin immediate monologue asr collection
Mandarin immediate monologue language data
Mandarin immediate monologue language dataset
Mandarin immediate monologue language collection
Mandarin immediate monologue speech data
Mandarin immediate monologue speech dataset
Mandarin immediate monologue speech collection
Mandarin Spontaneous asr data
Mandarin Spontaneous asr dataset
Mandarin Spontaneous asr collection
Mandarin Spontaneous language data
Mandarin Spontaneous language dataset
Mandarin Spontaneous language collection
Mandarin Spontaneous speech data
Mandarin Spontaneous speech dataset
Mandarin Spontaneous speech collection
chinese asr data
chinese asr dataset
chinese asr collection
chinese language data
chinese language dataset
chinese language collection
chinese speech data
chinese speech dataset
chinese speech collection
chinese discuss asr data
chinese discuss asr dataset
chinese discuss asr collection
chinese discuss language data
chinese discuss language dataset
chinese discuss language collection
chinese discuss speech data
chinese discuss speech dataset
chinese discuss speech collection
chinese small talk asr data
chinese small talk asr dataset
chinese small talk asr collection
chinese small talk language data
chinese small talk language dataset
chinese small talk language collection
chinese small talk speech data
chinese small talk speech dataset
chinese small talk speech collection
chinese conversational asr data
chinese conversational asr dataset
chinese conversational asr collection
chinese conversational language data
chinese conversational language dataset
chinese conversational language collection
chinese conversational speech data
chinese conversational speech dataset
chinese conversational speech collection
chinese chat asr data
chinese chat asr dataset
chinese chat asr collection
chinese chat language data
chinese chat language dataset
chinese chat language collection
chinese chat speech data
chinese chat speech dataset
chinese chat speech collection
chinese communication asr data
chinese communication asr dataset
chinese communication asr collection
chinese communication language data
chinese communication language dataset
chinese communication language collection
chinese communication speech data
chinese communication speech dataset
chinese communication speech collection
chinese speech asr data
chinese speech asr dataset
chinese speech asr collection
chinese speech language data
chinese speech language dataset
chinese speech language collection
chinese speech speech data
chinese speech speech dataset
chinese speech speech collection
chinese talk asr data
chinese talk asr dataset
chinese talk asr collection
chinese talk language data
chinese talk language dataset
chinese talk language collection
chinese talk speech data
chinese talk speech dataset
chinese talk speech collection
chinese conversation asr data
chinese conversation asr dataset
chinese conversation asr collection
chinese conversation language data
chinese conversation language dataset
chinese conversation language collection
chinese conversation speech data
chinese conversation speech dataset
chinese conversation speech collection
chinese impromptu asr data
chinese impromptu asr dataset
chinese impromptu asr collection
chinese impromptu language data
chinese impromptu language dataset
chinese impromptu language collection
chinese impromptu speech data
chinese impromptu speech dataset
chinese impromptu speech collection
chinese free speech asr data
chinese free speech asr dataset
chinese free speech asr collection
chinese free speech language data
chinese free speech language dataset
chinese free speech language collection
chinese free speech speech data
chinese free speech speech dataset
chinese free speech speech collection
chinese natural speech asr data
chinese natural speech asr dataset
chinese natural speech asr collection
chinese natural speech language data
chinese natural speech language dataset
chinese natural speech language collection
chinese natural speech speech data
chinese natural speech speech dataset
chinese natural speech speech collection
chinese common speech asr data
chinese common speech asr dataset
chinese common speech asr collection
chinese common speech language data
chinese common speech language dataset
chinese common speech language collection
chinese common speech speech data
chinese common speech speech dataset
chinese common speech speech collection
chinese immediate monologue asr data
chinese immediate monologue asr dataset
chinese immediate monologue asr collection
chinese immediate monologue language data
chinese immediate monologue language dataset
chinese immediate monologue language collection
chinese immediate monologue speech data
chinese immediate monologue speech dataset
chinese immediate monologue speech collection
chinese Spontaneous asr data
chinese Spontaneous asr dataset
chinese Spontaneous asr collection
chinese Spontaneous language data
chinese Spontaneous language dataset
chinese Spontaneous language collection
chinese Spontaneous speech data
chinese Spontaneous speech dataset
chinese Spontaneous speech collection

Mandarin Chinese(China) Spontaneous Monologue Smartphone speech dataset, collected from dialogues without given topics, close to casual conversation, covering generic domain. Transcribed with text content, noise and other attributes. Our dataset was collected from extensive and diversify speakers(700 Chinese in total), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16bit, uncompressed wav, mono channel;
Recording condition
Low background noise;
Content category
generic domain(without given topics);
Recording device
Android Smartphone;
Speaker
700 people, 35%male and 65% femal;
Country
China(CHN);
Language(Region) Code
zh-CN;
Language
Mandarin Chinese;
Features of annotation
Transcription text; 4 noise symbols; mainly annotates for near-end speech
Accuracy Rate
Sentence Accuracy Rate (SAR) 95%
Sample Sample
  • Audio

    你觉得我说话语速快吗

  • Audio

    看看到时间了然后我给你发过去

  • Audio

    你这几天你[P]你都几点睡觉呀

  • Audio

    嗯可以了是吧

  • Audio

    然后那个得准备好检查的东西

Recommended DatasetsRecommended Dataset
501 Hours - Indonesian(Indonesia) Real-world Casual Conversation and Monologue speech dataset

Indonesian(Indonesia) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Indonesian Colloquial Video text annotation
1,013 Hours - English(Britain) Real-world Casual Conversation and Monologue speech dataset

English(Britain) Real-world Casual Conversation and Monologue speech dataset, covers conversation, self-media, etc, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spontaneous Speech british english
157 Hours - Uyghur Spontaneous Dialogue Microphone speech dataset

Uyghur Spontaneous Dialogue Microphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(326 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

维吾尔语 维语 维吾尔 自然对话 自然对话语音数据 自然对话数据 对话数据集 对话数据 对话语音 对话式AI数据 自然对话语音数据 AI对话语音数据 AI自然语音对话 外语自然对话数据 电话
136 Hours - Korean(Korea) Spontaneous Dialogue Telephony speech dataset

Korean(Korea) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(216 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Conversational telephone korean
128 Hours - English(Australia) Children Real-world Casual Conversation and Monologue speech dataset

English(Australia) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Australian English Spontaneous Speech text annotation
149 Hours - English(the United Kindom) Children Real-world Casual Conversation and Monologue speech dataset

English(United Kindom) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spontaneous Speech text annotation British English
145 Hours - Spanish(spain) Children Real-world Casual Conversation and Monologue speech dataset

Spanish(spain) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spanish Spontaneous Speech text annotation
189 Hours - Spanish(Latin America) Children Real-world Casual Conversation and Monologue speech dataset

Spanish(Latin America) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Latin American Spanish Spontaneous Speech
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

2edda514-7d94-401d-bdbb-6746cac5f7d9

078187dd-4067-4d30-bfa3-dc37a4ebc2c8