401 Hours - Sichuan&Chongqing Dialect Conversation (Bilingual Annotated) Speech Data by Mobile Phone

Dialect

Dialogue

Sichuan dialect

Chongqing dialect

Sichuan&Chongqing Dialect Conversation (Bilingual Annotated) Speech Data by Mobile Phone, collected from dialogues based on given topics. Transcribed with text content, timestamp, speaker's ID, gender and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Format

24kHz, 16bit, uncompressed wav, mono channel;

Recording Environment

quiet indoor environment, without echo;

Recording content

dozens of topics are specified, and the speakers make dialogue under those topics while the recording is performed;

Demographics

486 speakers; balanced gender ratio among speakers, with age distribution ranging from 18 to 60 years old;

Annotation

extract and annotate individual sentences with their start and end timestamps, speaker identification, and spoken text content; noise annotation;

Device

Android mobile phone, iPhone;

Language

Sichuan&Chongqing dialect;

Application scenarios

speech recognition; voiceprint recognition;

Accuracy rate

word accuracy rate of 98%.

Recommended Dataset

500 Hours - Japanese(Japan) 48khz Full-Duplex Spontaneous Dialogue Smartphone speech dataset

Japanese(Japan) 48khz Full-Duplex Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Japanese Japan Dialogue Full-Duplex 48khz

601 Hours - Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset

Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Spanish Casual Conversation ASR Argentina

INTERSPEECH 2025 MLC-SLM Challenge Dataset

The INTERSPEECH 2025 MLC-SLM Challenge Dataset, curated by Nexdata, is derived from fifteen proprietary conversational speech corpora. Distinguished by exceptional annotation accuracy and operational reliability, this dataset is engineered to address critical challenges in multilingual automatic speech recognition (ASR) and long-context comprehension. It meticulously replicates real-world complexities including spontaneous interruptions and speaker overlaps across 11 languages (1500 hours total duration), thereby providing robust training resources for developing world-ready ASR systems. All data collection and processing strictly comply with international privacy regulations including GDPR, CCPA and PIPL, with rigorous protocols ensuring participant anonymity and ethical data usage throughout the lifecycle.

workshop audio dataset mlc-slm dataset ASR speech recognition data

4600 Hours - Mandarin Full-Duplex Multi-Channel Speech Dataset

4600 Hours Mandarin Full-Duplex Multi-Channel Speech Dataset is collected from dialogues based on given topics. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Mandarin speech dataset multi-stream Mandarin audio data conversational Mandarin corpus Chinese voice dataset full-duplex speech dataset multi-stream speech dataset multi-channel audio dataset

600 Hours Greek Speech Dataset – Real world Casual Conversation & Monologue for ASR

The 600 Hours Greek Real-World Speech Dataset includes both casual conversations and monologues, covers self-media, conversation, live, variety show and other generic domains, mirroring real-world interactions. Transcribed with text content, speaker's ID, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

greek speech dataset greek ASR training data greek conversation corpus greek monologue speech greek speech recognition dataset speech-to-text greek data greek voice dataset greek transcription dataset

600 Hours Norwegian Speech Dataset – Real-world Casual Conversation & Monologue for ASR

The 600 Hours Norwegian Real-World Speech Dataset includes both casual conversations and monologues, covering domains such as self-media, live shows, and other generic domains, mirroring real-world interactions. Transcribed with text content, speaker's ID, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

norwegian speech dataset norwegian ASR training data norwegian conversation corpus norwegian monologue speech norwegian speech recognition dataset speech-to-text norwegian data norwegian voice dataset multilingual speech data norwegian transcription dataset

1300 Hours Gujatati(India) Speech Dataset (Scripted Dialogue)

This dataset contains 1,300 hours of Gujarati speech, covers several domains, mirrors real-world interactions. Transcribed with text content, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

gujarati audio dataset gujarati asr dataset gujarati speech dataset gujarati tts dataset

352 Hours - Urdu Full-Duplex Spontaneous Dialogue Smartphone speech dataset

Urdu Full-Duplex Spontaneous Dialogue Smartphone speech dataset, collected from dialogues based on given topics. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

full duplex Dialogue