Mandarin Chinese(China) Spontaneous Dialogue 48KHZ Smartphone speech dataset

Mandarin Chinese

Spontaneous Dialogue

Conversation

48khz

Mandarin Chinese(China) Spontaneous Dialogue 48KHZ Smartphone speech dataset, including at least 20 topics, covering a wide range of vocabulary and grammatical structures, encompassing various dialect regions of China, mirrors real-world interactions. Transcribed with text content, timestamp, speaker's ID and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Recommended Dataset

500 Hours - Japanese Full-Duplex Multi-Channel Speech Dataset (48khz)

This data collected from dialogues based on given topics. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Full-Duplex Speech Dataset Multi-Channel Speech Dataset Japanese Speech Dataset Japanese Audio Dataset

601 Hours - Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset

Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spanish Casual Conversation ASR Argentina

INTERSPEECH 2025 MLC-SLM Challenge Dataset

The INTERSPEECH 2025 MLC-SLM Challenge Dataset, curated by Nexdata, is derived from fifteen proprietary conversational speech corpora. Distinguished by exceptional annotation accuracy and operational reliability, this dataset is engineered to address critical challenges in multilingual automatic speech recognition (ASR) and long-context comprehension. It meticulously replicates real-world complexities including spontaneous interruptions and speaker overlaps across 11 languages (1500 hours total duration), thereby providing robust training resources for developing world-ready ASR systems. All data collection and processing strictly comply with international privacy regulations including GDPR, CCPA and PIPL, with rigorous protocols ensuring participant anonymity and ethical data usage throughout the lifecycle.

workshop audio dataset mlc-slm dataset ASR speech recognition data

581 Hours Greek Speech Dataset – Real world Casual Conversation & Monologue for ASR

The 600 Hours Greek Real-World Speech Dataset includes both casual conversations and monologues, covers self-media, conversation, live, variety show and other generic domains, mirroring real-world interactions. Transcribed with text content, speaker's ID, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

greek speech dataset greek ASR training data greek conversation corpus greek monologue speech greek speech recognition dataset speech-to-text greek data greek voice dataset greek transcription dataset

600 Hours Norwegian Speech Dataset – Real-world Casual Conversation & Monologue for ASR

The 600 Hours Norwegian Real-World Speech Dataset includes both casual conversations and monologues, covering domains such as self-media, live shows, and other generic domains, mirroring real-world interactions. Transcribed with text content, speaker's ID, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

norwegian speech dataset norwegian ASR training data norwegian conversation corpus norwegian monologue speech norwegian speech recognition dataset speech-to-text norwegian data norwegian voice dataset multilingual speech data norwegian transcription dataset

Gujatati(India) Speech Dataset (Scripted Dialogue)

This dataset contains Gujarati speech, covers several domains, mirrors real-world interactions. Transcribed with text content, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

gujarati audio dataset gujarati asr dataset gujarati speech dataset gujarati tts dataset

Spanish(Mexico) Real-world Casual Conversation and Monologue speech dataset

Spanish(Mexico) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Mexico Spanish Casual Conversation ASR

460 Hour Swedish Speech Dataset - Casual Conversations & Monologues for ASR & TTS

This dataset contains 460 hours of Swedish speech, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Swedish speech dataset Swedish conversation corpus Real-world Swedish audio Swedish ASR training data TTS training dataset Swedish Swedish NLP corpus

Mandarin Chinese(China) Spontaneous Dialogue 48KHZ Smartphone speech dataset

Mandarin Chinese Spontaneous Dialogue Conversation 48khz

Current Project Maturity

Mandarin Chinese

Spontaneous Dialogue

Conversation

48khz