en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

Speech Synthesis Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Voice Type

All Female Male Children Emotion Customer Service Tone Quality Average Tone Song Front-end Text Others

Language

All Mandarin Chinese Dialects English Japanese Chinese‐English Code‐mixing Others

2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS New Zealand English Average Tone

20 Hours - American English Speech Synthesis Corpus-Male

Male audio data of American English. It is recorded by American English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS American English Male

2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS Spanish Average Tone

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS Mexican Spanish Average Tone

19.46 Hours - American English Speech Synthesis Corpus-Female

Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS American English Female

38 People - Hong Kong Cantonese Average Tone Speech Synthesis Corpus

38 People - Hong Kong Cantonese Average Tone Speech Synthesis Corpus, It is recorded by Hong Kong native speakers. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis Corpus TTS Female General Male english

10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS British English Average Tone

50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service

50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTS Average Tone Synthesis Corpus Customer Service

29.4 Hours - Chinese Mandarin Synthesis Corpus-Female, General

Chinese Mandarin Synthesis Corpus-Female, General. It is recorded by Chinese native speaker. It covers oral sentences, audio books, news, advertising, customer service and movie commentary, and the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis Corpus TTS Mandarin Female General combination fusion union amalgamation blend amalgam composite mixture compound deduction alloy coalescence integration unification deductive reasoning merging blending incorporation consolidation merger organization marriage coalition formation organism combining composition integrating syntheses alliance meld synthetic aggregate assimilation fusing harmony joining mingling organisation uniting admixture agreement assembly centralization concoction creation federation liquification mix synthesized

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
ae8fee64-c65d-4dbc-8003-540c3ca04578