en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

19.46 Hours - American English Speech Synthesis Corpus-Female

TTS
American English
Female

Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
44,100Hz, 16bit, uncompressed wav, mono channel.
Recording environment
professional recording studio.
Recording content
general narrative sentences, interrogative sentences, etc. 19,841 sentences,
Speaker
native speaker of American English, female.
Annotation Feature
word transcription, part-of-speech, phoneme boundary, four-level accents, four-level prosodic boundary.
Device
Microphone
Language
American English
Application scenarios
speech synthesis
Sample Sample
  • Audio

    Is- it|really take time?$VERB PRON ADV VERB NOUNIH1 Z/IH1 T/R IH1 L IY03/T EY1 K/T AY1 M/

  • Audio

    Was he|start a bird|as he walk$in the wood?$VERB PRON VERB DET NOUN CONJ PRON VERB ADP DET NOUNW AA1 Z/HH IY1/S T* AA1 R T3-1/AX0/B ER1 D3/AE1 Z/HH IY1/W AO1 K3/IH0 N/DH AX1/W UH1 D/

  • Audio

    Look$at the way|they hear operas$and see oil paintings.$VERB ADP DET NOUN PRON VERB NOUN CONJ VERB NOUN NOUNL UH1 K/AE1 T-1/DH AX1/W EY13/DH EY1/HH IH1 R/AA1 P R AX0 Z3/AE1 N D-1/S IY13/OY1 L/P EY1 N T AX0 NG Z/

  • Audio

    The focus-|of this chapter$is the American revolution.$DET NOUN ADP DET NOUN VERB DET ADJ NOUNDH AX1/F OW1 K AX0 S/AH1 V/DH AX1 S/CH AE1 P T ER03/IH1 Z/DH AX1/AX0 M EH1 R AX0 K AX0 N/R EH2 V AX0 L UW1 SH AX0 N/

  • Audio

    Was$the young|birds|feather?$VERB DET ADJ NOUN NOUNW AA1 Z/DH AX1/Y AH1 NG3/B ER1 D Z/F EH1 DH ER0/

Recommended DatasetsRecommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Japanese Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Sichuan Dialect
10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS British English Average Tone
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

88c837cd-5905-47b7-b520-a25b9ffb802e

fc4bbf07-c8f5-4c07-a9c5-6cd79fb4dc40