en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

Synthesis Corpus
TTS
Female
General
Sichuan
Dialect

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
general corpus
Speaker
professional Character Voice, female, 20-30 years old
Device
microphone;
Language
sichuan dialect
Annotation
word and phoneme transcription, prosodic boundary annotation;
Application scenarios
speech synthesis.
Sample Sample
  • Audio

    哎呀#3,桂花#1超级#1漂亮嘞#4!ai1 ya4 gui3 hua1 cao2 ji4 piao3 liang1 lei2

  • Audio

    嘿嘿#3,那我们#2又能#1愉快嘞#1玩耍啦#4!hei1 hei1 la3 ngo1 men1 you3 len1 yu1 kuai3 lei1 wan1 sua1 la1

  • Audio

    嘿#3,你在#1那儿#2做啥子#4。hei4 ni1 zai4 lar2 zu4 sa3 zi4

  • Audio

    因为#1他#1张嘴后#2牙齿#1正好#1与#1身后#1储物箱#1上嘞#1图案#1完美#1重合#4。yin2 wei1 ta1 zang2 zui1 hou3 ya1 ci2 zen3 hao1 yu1 sen2 hou4 cu1 wu4 xiang2 sang3 lei1 tu4 an4 wan3 mei1 cong4 ho4

  • Audio

    陀螺嘞#1大小#2要看#1木轴嘞#1粗细#4。to1 lo1 lei1 da3 xiao4 yao2 kan4 mu1 zou1 lei1 cu2 xi3

Recommended DatasetsRecommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Japanese Average Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Mexican Spanish Average Tone
2 People - Spanish Average Tone Speech Synthesis Corpus

2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Spanish Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus

2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS New Zealand English Average Tone
10 People - British English Average Tone Speech Synthesis Corpus

10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS British English Average Tone
12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional

12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Chinese Emotional Multi-emotional tts Synthesis Corpus
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

8fecc722-9b7a-4987-8158-1a223b989d6e

d42b6bc3-ea08-4b90-87af-6637f0995488