Home > All Category Datasets > Speech Recognition Datasets > 262 Hours - Japanese Children's Speech Dataset

262 Hours - Japanese Children's Speech Dataset

Japanese

children

speech

411 Speakers - Approx. 262 Hours Japanese Children's Speech Dataset, comprising 147,668 scripted utterances. Speakers are Japanese children aged 6 to 13, categorized into lower-grade (ages 6–9, 179 speakers) and upper-grade (ages 10–13, 232 speakers) groups with balanced gender distribution. Recordings were conducted using smartphones in 16kHz/16bit mono WAV format, accompanied by utterance transcriptions and read-aloud scripts. The dataset is applicable to tasks such as Japanese children's ASR, TTS, speaker recognition, and pronunciation assessment.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Recommended Dataset

103 Hours Dutch Speech Dataset with Entity Annotations

This Dutch speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

dutch ner dataset dutch asr dataset dutch speech dataset spoken entity dataset entity recognition dataset voice assistant dataset

107 Hours Thai Speech Dataset with Entity Annotations

This Thai speech dataset contains a wide range of entity categories, including person names, phone numbers, addresses, alphanumeric sequences, email addresses, product models, product serial numbers, and monetary values. The recordings are collected through scripted monologues and are designed to reflect real-world speech scenarios. The dataset includes high-quality smartphone recordings, transcriptions, and relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

thai speech dataset thai asr dataset entity recognition dataset spoken entity dataset voice assistant dataset

122 Hours Japanese Speech Dataset – Entity-Annotated Monologue Audio for ASR & AI Training

This dataset contains 122 hours of high-quality Japanese scripted monologue speech collected from diverse speakers across multiple geographic regions.The dataset includes rich structured entity coverage such as person names, phone numbers, addresses, alphanumeric sequences, Emails, product Models, product serial numbers, and money entities, mirrors real-world interactions. The speech transcriptions include text content and other attributes. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

japanese speech dataset speech recognition dataset japanese japanese ASR dataset speech to text dataset japanese entity annotated speech dataset monologue speech dataset japanese

150 Hours Italian Speech Dataset with Entity Annotations

This Italian speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

italian speech dataset italian asr dataset italian ner dataset named entity recognition dataset entity extraction dataset entity recognition dataset

141 Hours Germany Speech Dataset with Entity Annotations

This Germany speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

germany speech dataset germany asr dataset germany ner dataset entity recognition dataset spoken entity recognition dataset entity extraction dataset

158 Hours French Speech Dataset with Entity Annotations

This French scripted speech dataset contains a wide range of entity categories, including person names, phone numbers, addresses, alphanumeric sequences, email addresses, product models, product serial numbers, and monetary values. The recordings are collected through scripted monologues and are designed to reflect real-world speech scenarios. The dataset includes high-quality smartphone recordings, transcriptions, and relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

french speech dataset french asr dataset french voice dataset french ner dataset spoken entity dataset entity recognition dataset

168 Hours English Speech Dataset with Entity Annotations

This English speech dataset covers a wide range of entity types—such as personal names, phone numbers, addresses, alphanumeric sequences, email addresses, product model numbers, product serial numbers, and monetary amounts—authentically reflecting real-life interaction scenarios, and includes corresponding transcriptions and other attribute information. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

english speech dataset english asr dataset english ner dataset spoken entity dataset entity recognition dataset voice assistant dataset

112 Hours Arabic Speech Dataset – Entity-Annotated Scripted Monologue Audio for ASR & AI Training

This Arabic speech dataset contains 112 hours of high-quality scripted monologue recordings collected from smartphone devices. The dataset covers several domains, includes person, phone number, address, alphanumeric sequence, Email, product model, product serial number, and money entities, mirrors real-world interactions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers, geographically speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

arabic speech dataset arabic speech recognition dataset arabic ASR dataset arabic speech to text dataset entity annotated speech dataset arabic NLP dataset labeled arabic speech dataset

262 Hours - Japanese Children's Speech Dataset

Japanese children speech

Current Project Maturity

Japanese

children

speech