en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

262 Hours - Japanese Children's Speech Dataset

Japanese
children
speech

411 Speakers - Approx. 262 Hours Japanese Children's Speech Dataset, comprising 147,668 scripted utterances. Speakers are Japanese children aged 6 to 13, categorized into lower-grade (ages 6–9, 179 speakers) and upper-grade (ages 10–13, 232 speakers) groups with balanced gender distribution. Recordings were conducted using smartphones in 16kHz/16bit mono WAV format, accompanied by utterance transcriptions and read-aloud scripts. The dataset is applicable to tasks such as Japanese children's ASR, TTS, speaker recognition, and pronunciation assessment.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
411 speakers, approx. 262 hours, 147,668 utterances
Speaker profile
Japanese children aged 6–13
Device
Smartphone
Data format Audio data format
WAV; transcription format: TSVData
content
Scripted read-aloud speech, categorized into lower-grade (ages 6–9) and upper-grade (ages 10–13)Annotation contentUtterance transcription,Read-aloud Scripts
Accuracy
Character Accuracy Rate 98% or above
Application Scenarios
ASR, TTS, speaker recognition, pronunciation assessment
Sample Sample
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

9086b4f2-5ec2-40bf-acd6-0ad914c19e11

e5a08cb3-16d8-4008-b9ff-464736c564c5