Home > All Category Datasets > Speech Recognition Datasets > 499 Hours - Vietnamese Scripted Monologue Smartphone speech dataset.

499 Hours - Vietnamese Scripted Monologue Smartphone speech dataset.

Vietnamese

Vietnamese Scripted Monologue Smartphone speech dataset, collected from monologue based on given prompts. Transcribed with text content. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Recommended Dataset

262 Hours - Japanese Children Speech Dataset for ASR and Pronunciation Training

This dataset contains approximately 262 hours of Japanese children's speech data collected from 411 speakers aged 6 to 13, including 147,668 scripted utterances with transcriptions. The speakers are categorized into lower-grade (ages 6–9, 179 speakers) and upper-grade (ages 10–13, 232 speakers) groups, with balanced gender distribution. Recordings were collected using smartphones in 16kHz/16bit mono WAV format and include both utterance transcriptions and read-aloud scripts. The dataset is applicable to tasks such as Japanese children's ASR, TTS, speaker recognition, and pronunciation assessment.

japanese children speech dataset pediatric speech dataset children speech dataset kids speech dataset children tts dataset

103 Hours Dutch Speech Dataset with Entity Annotations

This Dutch speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

dutch ner dataset dutch asr dataset dutch speech dataset spoken entity dataset entity recognition dataset voice assistant dataset

107 Hours Thai Speech Dataset with Entity Annotations

This Thai speech dataset contains a wide range of entity categories, including person names, phone numbers, addresses, alphanumeric sequences, email addresses, product models, product serial numbers, and monetary values. The recordings are collected through scripted monologues and are designed to reflect real-world speech scenarios. The dataset includes high-quality smartphone recordings, transcriptions, and relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

thai speech dataset thai asr dataset entity recognition dataset spoken entity dataset voice assistant dataset

122 Hours Japanese Speech Dataset – Entity-Annotated Monologue Audio for ASR & AI Training

This dataset contains 122 hours of high-quality Japanese scripted monologue speech collected from diverse speakers across multiple geographic regions.The dataset includes rich structured entity coverage such as person names, phone numbers, addresses, alphanumeric sequences, Emails, product Models, product serial numbers, and money entities, mirrors real-world interactions. The speech transcriptions include text content and other attributes. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

japanese speech dataset speech recognition dataset japanese japanese ASR dataset speech to text dataset japanese entity annotated speech dataset monologue speech dataset japanese

150 Hours Italian Speech Dataset with Entity Annotations

This Italian speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

italian speech dataset italian asr dataset italian ner dataset named entity recognition dataset entity extraction dataset entity recognition dataset

141 Hours Germany Speech Dataset with Entity Annotations

This Germany speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

germany speech dataset germany asr dataset germany ner dataset entity recognition dataset spoken entity recognition dataset entity extraction dataset

158 Hours French Speech Dataset with Entity Annotations

This French scripted speech dataset contains a wide range of entity categories, including person names, phone numbers, addresses, alphanumeric sequences, email addresses, product models, product serial numbers, and monetary values. The recordings are collected through scripted monologues and are designed to reflect real-world speech scenarios. The dataset includes high-quality smartphone recordings, transcriptions, and relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

french speech dataset french asr dataset french voice dataset french ner dataset spoken entity dataset entity recognition dataset

168 Hours English Speech Dataset with Entity Annotations

This English speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

english speech dataset english asr dataset english ner dataset spoken entity dataset entity recognition dataset voice assistant dataset

499 Hours - Vietnamese Scripted Monologue Smartphone speech dataset.

Vietnamese

Current Project Maturity