en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional

Chinese
Emotional
Multi-emotional
tts
Synthesis
Corpus

12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel
Recording environment
professional recording studio
Recording content
seven emotions (happiness, anger, sadness, surprise, fear, disgust)+sentences with filler word
Speaker
professional CharacterVoice; Role: An 18-year-old girl who works as an entertainment anchor and enjoys singing and dancing
Device
microphone
Language
Mandarin
Annotation
word and pinyin transcription, prosodic boundary annotation, phoneme boundary annotation
The amount of data
The amount of neutral data is not less than 1.6 hours; the amount of data with filler word is not less than 0.4 hours; and the remaining six types of emotional data is not less than 1.67 hours each
Sample Sample
  • Audio

    希望#1能够#1呼吸#1新鲜#1空气#3而不是#1被污染#1物质#1包裹着#4。xi1 wang4 neng2 gou4 hu1 xi1 xin1 xian1 kong1 qi4 er2 bu2 shi4 bei4 wu1 ran3 wu4 zhi4 bao1 guo3 zhe5

  • Audio

    请不要#1太过分#3,我是#1有#1边界的#4。qing3 bu2 yao4 tai4 guo4 fen4 wo3 shi4 you3 bian1 jie4 de5

  • Audio

    我#1找不到#1任何#1颜色#1和#1乐趣#4。wo6 zhao3 bu2 dao4 ren4 he2 yan2 se4 he2 le4 qu4

  • Audio

    跟着#1我的#1节奏#2一起#1舞动吧#4!gen1 zhe5 wo3 de5 jie2 zou4 yi4 qi6 wu3 dong4 ba5

  • Audio

    仿佛有#1一只手#3正从#1我的#1后背#1伸出来#4。fang3 fu2 you3 yi4 zhi1 shou3 zheng4 cong2 wo3 de5 hou4 bei4 shen1 chu1 lai5

Recommended DatasetsRecommended Dataset
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service

26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with lively and frindly voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Mandarin Female Customer Service
6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children

Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Chinese Children
19.46 Hours - American English Speech Synthesis Corpus-Female

Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS American English Female
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

3bfb1903-600f-4e20-9b94-7e1a7d09ca99

ae110735-ac8d-4562-b6e0-272d906ad80a