en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

299 Hours – US English Children Speech Dataset for ASR & TTS

children speech dataset
kids voice dataset
child speech corpus
US English children dataset
children TTS dataset
child speech recognition data
American English children voice dataset

This dataset includes 299 hours of US English children’s speech, recorded as scripted monologues, collected from monologue based on given scripts, covering essay stories. The data covers a variety of categories, including children's books and textbooks, and is rich in content that aligns with children's language habits.Transcribed with text content and other attributes. Our dataset is collected from a wide and diverse range of speakers geographically, which supports tasks like speech recognition, TTS, and child voice modeling. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16kHz, 16bit, uncompressed wav, mono channel
Recording environment
quiet indoor environment, without echo
Recording content (read speech)
children's books and textbooks
Speaker
286 American children, 53% of which are female, all children are 5-12 years old
Recording device
Android smartphone, iPhone
Country
The United States of America(USA)
Language
English
Language(Region) Code
en-US
Accuracy rate
Sentence accuracy rate(SAR) 95%
Sample Sample
  • Audio

    he wore baggy trousers and a long shirt, his face was almost completely hidden by his head cloth. he did not speak or look at them.

  • Audio

    for his or her intelligence. that goes for the animals as well as the people. everything that happens to them is explained to us.

  • Audio

    arabia where the greatest horses in the world were bred!

  • Audio

    chapter three of terror at the zoo by peg kehret.

  • Audio

    jack pushed his glasses into place. who was going to believe any

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

e5739feb-179a-41f0-92d9-32b6d39334d5

a6b73806-3234-4340-a004-46a1947216e5