Korean Telephony Speech Dataset – 136 Hours of Spontaneous Calls
This Korean Telephony Speech Dataset contains 136 hours of spontaneous dialogue recorded over phone calls. Covering over 20 real-life domains including customer service, e-commerce, finance, travel, and daily conversations, the dataset features natural two-speaker conversations collected via diverse telephony channels. Each sample is transcribed and annotated with speaker ID, gender, age, and other metadata. Data was collected from 216 native Korean speakers across different regions, enhancing model generalization. Ideal for automatic speech recognition (ASR), speaker diarization, and call center conversational AI systems. All data complies with GDPR, CCPA, and PIPL for responsible and legal AI development.
Korean telephony speech dataset Korean telephone audio telephone conversation Korean call center voice dataset Korean Korean spoken dialogue corpus multilingual telephony dataset Korean voice dataset speech-to-text Korean phone call spontaneous Korean speech data