en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Speech Synthesis Datasets

Instantly enhance AI model performance with high quality off-the-shelf datasets.

Voice Type

All
34
Average Tone
21
Emotion
3
Female
8
Front-end Text
3
Male
2
Others
2

Language

All
34
Chinese Dialects
10
Chinese-English Code-mixing
1
English
8
Japanese
2
Mandarin
15
Others
6

20 Hours - American English Male Voice TTS Dataset

This dataset contains 20 hours of American English male voice recordings. It is recorded by Americans (native English speakers) with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition research, and AI voice development.
TTS english dataset speech synthesis dataset TTS male voice dataset male voice dataset for tts American English speech synthesis dataset

19.46 Hours - American English Female Voice TTS Dataset

This dataset contains 19.46 hours of American English female voice recordings. It is recorded by American (native English speaker) with authentic accent and clear, sweet tone. The phoneme coverage is balanced. Professional phoneticians participate in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition, and AI voice development requiring natural-sounding female speech.
American English speech synthesis dataset female voice dataset for TTS American English female voice corpus speech synthesis training data female TTS dataset American English female speaker speech synthesis dataset TTS english dataset

10.4 Hours – Japanese Female Voice TTS Dataset

This dataset contains 10.4 hours of Japanese female voice recordings. It is recorded by Japanese native speaker with an authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. This corpus is ideal for tasks such as Japanese text-to-speech (TTS) training, speech synthesis research, and AI voice model development.
Japanese speech synthesis dataset Japanese tts dataset Japanese text-to-speech dataset female female japanese tts dataset

2 Speakers – Australian English TTS Dataset (Native Accent)

This dataset features recordings from 2 native Australian English speakers with authentic accents. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Australian English TTS dataset Australian speech dataset for AI Australian accent speech dataset Australian text to speech voices multi-speaker Australian English dataset Australian English phoneme balanced dataset

2 Speakers – Spanish TTS Dataset with Native Castilian Accent

This dataset includes recordings from 2 native Spanish speakers with authentic Castilian accents. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Spanish speech dataset for TTS Spanish text to speech dataset Spanish voice dataset for AI models native Spanish accent dataset Castilian Spanish TTS dataset Spanish speech synthesis dataset

3 Speakers – Italian TTS Dataset with Native Accent

This dataset includes recordings from 3 native Italian speakers with authentic accents. Covering both customer service and general speaking styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Italian speech dataset for TTS Italian text to speech dataset Italian voice dataset for AI Italian accent speech dataset multi-speaker Italian TTS dataset Italian TTS dataset

10 Speakers – British English TTS Dataset with Authentic Accent

This dataset contains recordings from 10 native British English speakers with an authentic UK accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the text-to-speech (TTS) systems and AI voice synthesis models.
British English speech synthesis dataset British English voice dataset for TTS British accent speech corpus UK English speech dataset female male natural British English voice dataset British English tts dataset

2 Speakers - Canadian French TTS Dataset (Native Accent)

This dataset contains recordings from 2 native Canadian French speakers with authentic accents. It is ideal for researchers and developers seeking natural Canadian French voices.
Canadian French TTS dataset Canadian French speech dataset for AI Canadian French accent speech corpus Canadian French text to speech voices Canadian French speech dataset

Cantonese TTS Dataset – 4 Native Speakers, 20+ Hours

This Cantonese speech synthesis corpus includes recordings from 4 native speakers of Guangdong. The corpus contain educational, game and general colloquial content. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Cantonese audio dataset Cantonese TTS dataset native Cantonese speech recordings Cantonese voice dataset for AI Cantonese speech dataset

loading

Tailor Your Data Now

Why off-the-shelf Datasets

  • Copyright

    Copyright

    Clear Coyright and Ready to Check
  • Security

    Security

    Properly Authorized Secure to Use
  • Professional

    Professional

    Designed and produced by AI data experts
  • Diversity

    Diversity

    Collected from a varity of real scenes
  • Cost Effective

    Cost Effective

    More Cost-Efficient Than Tailored Data
  • Efficiency

    Efficiency

    Ready-To-Go Deliver in Seconds
910c26ac-7e4c-4812-823a-63e41bd1f74d