en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

10 People - British English Average Tone Speech Synthesis Corpus

TTS
British English
Average Tone

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
general narrative sentences, interrogative sentences, etc;
Speaker
british native speaker, 5 male and 5 female, 2 hours per person;
Device
microphone;
Language
British English;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample Sample
  • Audio

    Carville was a Louisiana Cajun and ex marine/ who had- a great strategic sense%.K AA1 . V IH2 L / W AA1 Z / AX0 / L UW0 . IY1 . Z IY0 . AE2 . N AX0 / K EY1 . JH AX0 N / AX0 N D / EH1 K S / M AX0 . R IY1 N / HH UW1 / HH AE1 D / AX0 / G R EY1 T3 / S T R AX0 . T IY1 . JH IH0 K / S EH1 N S

  • Audio

    I don't feel very well%.AY1 / D OW1 N T3 / F IY1 L / V EH1 . R IY0 / W EH1 L

  • Audio

    No American President had been there- in/ twenty years%.N OW13 / AX0 . M EH1 . R AX0 . K AX0 N / P R EH1 . Z AX0 . D EH2 N T / HH AE1 D / B IH1 N / DH EH1 R / IH0 N / T W EH1 N . T IY0 / Y IH1 R Z

  • Audio

    I don't/ like to speak- about/ things which I don't understand%.AY1 / D OW1 N T / L AY1 K / T UW1 / S P IY1 K3 / AX0 . B AW1 T / TH IH1 NG Z / W IH1 CH / AY1 / D OW1 N T / AH2 N . D AX0 . S T AE1 N D

Recommended DatasetsRecommended Dataset
42 People - Chinese Mandarin Multi-emotional Synthesis Corpus

22 People - Chinese Mandarin Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker, covering different ages and genders. seven emotional text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Chinese Emotional Multi-emotional tts Synthesis Corpus
10.4 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service

10.4 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with sweet voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Mandarin Female Customer Service
12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional

12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Chinese Emotional Multi-emotional tts Synthesis Corpus
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Sichuan Dialect
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

e0645358-f0c7-4120-96a4-1b1fc75c6bc2

1dc982a7-a5fb-443f-a868-1be7bb4bc2cc