en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

12.6 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, Conversational Speech

tts
conversational speech
female
customer service

12.6 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, Conversational Speech, It is recorded by Chinese native speakers, with sweet voice. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
Natural dialogue of simulated scene;
Speaker
female, 20-30 years old, sweet voice;
Device
microphone;
Language
Mandarin;
Annotation
word and Pinyin transcription, prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample Sample
  • Audio

    有#1二十#1块钱#1一个#1G#4fgfyou3 er4 shi2 kuai4 qian2 yi2 ge5 gl4

  • Audio

    诶#1您好#1很高兴#1为您#1服务#4fgfei4 nin2 hao3 hen3 gao1 xing4 wei4 nin2 fu2 wu4

  • Audio

    您#1现在#1使用的呢#1是#1十块钱#1两百兆的#4fgfnin2 xian4 zai4 shi3 yong4 de5 ne5 shi4 shi2 kuai4 qian2 liang6 bai3 zhao4 de5

  • Audio

    哦#1就是#1想改个#1流量#1更多#1一些的#1是吧#4?fgfo5 jiu4 shi4 xiang6 gai3 ge5 liu2 liang4 geng4 duo1 yi4 xie1 de5 shi4 ba5

  • Audio

    嗯#1您是说#1想变更#1套餐#1对吗#4fgfng5 nin2 shi4 shuo1 xiang3 bian4 geng1 tao4 can1 dui4 ma5

Recommended DatasetsRecommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Japanese Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Sichuan Dialect
10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS British English Average Tone
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

14da5d58-82ed-4310-92ea-9ac0a92dacbb

6136e622-3719-4a5e-b577-a9619e115ca6