en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service

TTS
Average Tone
Synthesis Corpus
Customer Service

50 People - Chinese-English Mixed Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 16bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
customer service text, and the syllables, phonemes and tones are balanced;
Speaker
50 speakers totally, with 50% male and 50% female;
Device
microphone;
Language
Chinese/English mixed;
Annotation
word and Pinyin transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample Sample
  • Audio

    164002fgf只能#1给您#1核实#3,我们#1这side#1不能#1提供#4。fgfzhi3 neng2 gei3 nin2 he2 shi2 wo3 men5 zhe4 / S AY1 D / bu4 neng2 ti2 gong1

  • Audio

    200012fgf这个的话#1您就#1关注#1您的#1APP嘛#4。fgfzhe4 ge5 de5 hua4 nin2 jiu4 guan1 zhu4 nin2 de5 / AH0 . P IY1 . P IY1 / ma5

  • Audio

    172005fgf大车#1很少#1跑的#1路段#3,是#2five折#1优惠#4。fgfda4 che1 hen2 shao2 pao3 de5 lu4 duan4 shi4 F AY1 V zhe2 you1 hui4

  • Audio

    172010fgfYou#1不要#1选择#1那个#1信什么#3,就是#1需要#1保证金的#1那个#4。fgfY UW1 / bu2 yao4 xuan3 ze2 na4 ge5 xin4 shen2 me5 jiu4 shi4 xu1 yao4 bao3 zheng4 jin1 de5 na4 ge5

  • Audio

    086001fgfYou是在#1朝阳区#1附近的吗#4?fgfY UW1 / shi4 zai4 chao2 yang2 qu1 fu4 jin4 de5 ma5

Recommended DatasetsRecommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Japanese Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Sichuan Dialect
10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS British English Average Tone
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

d4d9a0a1-e180-4956-9fd2-d339bd3023ed

811fff20-a478-4852-8986-a286e211e69b