en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

20 People - Chinese Mandarin Multi-emotional Synthesis Corpus

Chinese
Emotional
Multi-emotional
tts
Synthesis
Corpus

20 People - Chinese Mandarin Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker, covering different ages and genders. seven emotional texts, are all from novels and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel
Recording environment
professional recording studio
Recording content
seven emotions (happiness, anger, sadness, surprise, fear, disgust, neutral) ;texts are all from novels
Speaker
20 persons, different age groups and genders
Device
microphone
Language
Mandarin
Annotation
word and pinyin transcription, prosodic boundary annotation
Application scenarios
speech synthesis
The amount of data
The amount of data for per person is 140 minutes, each emotion is 20 minutes
Sample Sample
  • Audio

    我#1很好奇#3,是#1哪个#1男子#2竟能#1得到#1这等#1奇女子的#1芳心#4。wo6 hen3 hao4 qi2 shi4 na3 ge5 nan2 zi3 jing4 neng2 de2 dao4 zhe4 deng3 qi2 nv6 zi3 de5 fang1 xin1

  • Audio

    这好哇#3,有#1视频#1有#1真相#3,我#1支持#3,我#1狂支持#4!zhe4 hao3 wa5 you3 shi4 pin2 you3 zhen1 xiang4 wo3 zhi1 chi2 wo3 kuang2 zhi1 chi2

  • Audio

    当初#3,满侯府的人#1都#1欺辱你#3。只有#1我待#1二郎#3是#1真心#1真意的#4。dang1 chu1 man3 hou2 fu3 de5 ren2 dou1 qi1 ru6 ni3 zhi6 you6 wo3 dai4 er4 lang2 shi4 zhen1 xin1 zhen1 yi4 de5

  • Audio

    我#1只能#1出到#1一人#1一万#3,多一分#1都#1没有#3,你们#1爱找谁#1找谁#4。wo6 zhi3 neng2 chu1 dao4 yi4 ren2 yi2 wan4 duo1 yi4 fen1 dou1 mei2 you3 ni3 men5 ai4 zhao3 shei2 zhao3 shei2

Recommended DatasetsRecommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Japanese Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Sichuan Dialect
10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS British English Average Tone
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

cfab4573-0361-4958-b9e0-18aef4edc63c

f339ebae-0856-4556-a923-6b1d43ffb905