en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

290 Hours - Korean(Korea) Spontaneous Dialogue Smartphone speech dataset

korean
Conversational speech
Korean asr data
Korean asr dataset
Korean asr collection
Korean language data
Korean language dataset
Korean language collection
Korean discuss asr data
Korean discuss asr dataset
Korean discuss asr collection
Korean discuss language data
Korean discuss language dataset
Korean discuss language collection
Korean small talk asr data
Korean small talk asr dataset
Korean small talk asr collection
Korean small talk language data
Korean small talk language dataset
Korean small talk language collection
Korean conversational asr data
Korean conversational asr dataset
Korean conversational asr collection
Korean conversational language data
Korean conversational language dataset
Korean conversational language collection
Korean chat asr data
Korean chat asr dataset
Korean chat asr collection
Korean chat language data
Korean chat language dataset
Korean chat language collection
Korean communication asr data
Korean communication asr dataset
Korean communication asr collection
Korean communication language data
Korean communication language dataset
Korean communication language collection
Korean speech asr data
Korean speech asr dataset
Korean speech asr collection
Korean speech language data
Korean speech language dataset
Korean speech language collection
Korean talk asr data
Korean talk asr dataset
Korean talk asr collection
Korean talk language data
Korean talk language dataset
Korean talk language collection
Korean conversation asr data
Korean conversation asr dataset
Korean conversation asr collection
Korean conversation language data
Korean conversation language dataset
Korean conversation language collection
Korea asr data
Korea asr dataset
Korea asr collection
Korea language data
Korea language dataset
Korea language collection
Korea discuss asr data
Korea discuss asr dataset
Korea discuss asr collection
Korea discuss language data
Korea discuss language dataset
Korea discuss language collection
Korea small talk asr data
Korea small talk asr dataset
Korea small talk asr collection
Korea small talk language data
Korea small talk language dataset
Korea small talk language collection
Korea conversational asr data
Korea conversational asr dataset
Korea conversational asr collection
Korea conversational language data
Korea conversational language dataset
Korea conversational language collection
Korea chat asr data
Korea chat asr dataset
Korea chat asr collection
Korea chat language data
Korea chat language dataset
Korea chat language collection
Korea communication asr data
Korea communication asr dataset
Korea communication asr collection
Korea communication language data
Korea communication language dataset
Korea communication language collection
Korea speech asr data
Korea speech asr dataset
Korea speech asr collection
Korea speech language data
Korea speech language dataset
Korea speech language collection
Korea talk asr data
Korea talk asr dataset
Korea talk asr collection
Korea talk language data
Korea talk language dataset
Korea talk language collection
Korea conversation asr data
Korea conversation asr dataset
Korea conversation asr collection
Korea conversation language data
Korea conversation language dataset
Korea conversation language collection
Seoul asr data
Seoul asr dataset
Seoul asr collection
Seoul language data
Seoul language dataset
Seoul language collection
Seoul discuss asr data
Seoul discuss asr dataset
Seoul discuss asr collection
Seoul discuss language data
Seoul discuss language dataset
Seoul discuss language collection
Seoul small talk asr data
Seoul small talk asr dataset
Seoul small talk asr collection
Seoul small talk language data
Seoul small talk language dataset
Seoul small talk language collection
Seoul conversational asr data
Seoul conversational asr dataset
Seoul conversational asr collection
Seoul conversational language data
Seoul conversational language dataset
Seoul conversational language collection
Seoul chat asr data
Seoul chat asr dataset
Seoul chat asr collection
Seoul chat language data
Seoul chat language dataset
Seoul chat language collection
Seoul communication asr data
Seoul communication asr dataset
Seoul communication asr collection
Seoul communication language data
Seoul communication language dataset
Seoul communication language collection
Seoul speech asr data
Seoul speech asr dataset
Seoul speech asr collection
Seoul speech language data
Seoul speech language dataset
Seoul speech language collection
Seoul talk asr data
Seoul talk asr dataset
Seoul talk asr collection
Seoul talk language data
Seoul talk language dataset
Seoul talk language collection
Seoul conversation asr data
Seoul conversation asr dataset
Seoul conversation asr collection
Seoul conversation language data
Seoul conversation language dataset
Seoul conversation language collection

Korean(Korea) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(442 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16 bit, wav, mono channel;
Content category
Dialogue based on given topics;
Recording condition
Low background noise (indoor);
Recording device
Android smartphone, iPhone;
Speaker
442 native speakers in total, 43% male and 57% female;
Country
Korea(KOR);
Language(Region) Code
ko-KR;
Language
Korean;
Features of annotation
Transcription text, timestamp, speaker ID, gender, PII redacted.
Accuracy Rate
Sentence Accuracy Rate (SAR) 98%
Sample Sample
  • Audio

    어 맞아요 전여빈 배우도 너무 좋고 천우희 정말 좋아해요

  • Audio

    그친구 이름이 되게 흔했는데

  • Audio

    예를들면 강하늘 강하늘 배우도 되게 좋아하는데

  • Audio

    그분이 되게 제가 좋아하는 작품이랑 안좋아하는 작품을 되게 거의 번갈아가면서 많이 하셨어요.

  • Audio

    무거운 연기도하고 현실연기도 하고 되게 다 잘하시고 소화를 일단 너무 잘 하시는것 같아요.

Recommended DatasetsRecommended Dataset
93 Hours - Russian(Russia) Spontaneous Dialogue Telephony speech dataset

Russian(Russia) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Conversational speech Russian asr data russian asr dataset russia
283 Hours - Indonesian(Indonesia) Spontaneous Dialogue Telephony speech dataset

Indonesian(Indonesia) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(376 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data Indonesian telephone
163 Hours - Russian(Russia) Children Real-world Casual Conversation and Monologue speech dataset

Russian(Russia) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Russian Child Spontaneous Speech
162 Hours - French(France) Children Real-world Casual Conversation and Monologue speech dataset

French(France) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

French Spontaneous Speech Child
107 Hours - Spanish(Mexico) Spontaneous Dialogue Smartphone speech dataset

Spanish(Mexico) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data spanish mexican
80 Hours - French(Canada) Spontaneous Dialogue Smartphone speech dataset

French(Canada) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(126 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data french Canadian
104 Hours - Portuguese(Portugal) Spontaneous Dialogue Smartphone speech dataset

Portuguese(Portugal) Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(124 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

audio data dataset conversational asr data Portuguese European
101 Hours - Italian(Italy) Children Real-world Casual Conversation and Monologue speech dataset

Italian(Italy) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spontaneous Speech Data text annotation Italian
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

1947a79a-6bd0-4d16-8dea-06ecb746ff9a

9236a6dd-a4c0-42e8-ae5c-175ce3fa982e