[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"customer service and general;"},{"@type":"PropertyValue","name":"Speaker","value":"new zealanders, 1 male and 1 female;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"New Zealand English;"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1350,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"2 People - New Zealand English Average Tone Speech Synthesis Corpus","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"customer service and general;"},{"title":"Speaker","value":"new zealanders, 1 male and 1 female;"},{"title":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"New Zealand English;"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"English,Tts,New Zealand English,Average Tone","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000150.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=JMwrHHw7ZkncUcfJ2DDgEhAALmk%3D","/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000150.wav","And poor Honey% was always- a man crazy fool/ with no more sense than- a guinea hen%.AX0 N D / P UH1 R / HH AH1 . N IY0 / W AA1 Z / AO1 L . W EY2 Z / AX0 / M AE1 N / K R EY1 . Z IY0 / F UW1 L / W IH1 DH / N OW13 / M AO1 / S IH1 N S / DH AE1 N / AX0 / G IH1 . N IY0 / HH EH1 N"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000500.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=AeoIyVaUp4KIN1meq7Nx9SwnwSY%3D","/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000500.wav","But- he was not there% and she could not bring herself to ask for him%.B AH1 T / HH IY1 / W AA1 Z / N AA1 T3 / DH IH1 R / AX0 N D / SH IY1 / K UH1 D / N AA1 T / B R IH1 NG / HH AX0 . S EH1 L F / T UW1 / AA1 S K / F AO1 / HH IH1 M"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000520.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=oQgXhHvYTe0g4qND%2FOiXOmr2XX8%3D","/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000520.wav","On the earth% if I fall from- a tree I will fall to the ground%.AA1 N / DH AX0 / ER1 TH / IH1 F / AY1 / F AO1 L3 / F R AH1 M / AX0 / T R IY1 / AY1 / W IH1 L / F AO1 L / T UW1 / DH AX0 / G R AW1 N D"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000203.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=brEJgqpVGTxzKw5dD0eR6Swi0H4%3D","/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000203.wav","You have- applied for twelve installments/, returned% eight- installments/, and there- are still four- installments left%.Y UW1 / HH AE1 V / AX0 . P L AY1 D / F AO1 / T W EY1 L V / AX0 N . S T AO1 L . M AX0 N T S / R IH0 . T ER1 N D / EY1 T / AX0 N . S T AO1 L . M AX0 N T S / AE0 N D / DH EY1 R / AA1 / S T IH1 L / F AO1 R / AX0 N . S T AO1 L . M AX0 N T S / L EY1 F T3"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000134.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=AYur%2B88bJC9mwFYB%2FzcWfiH%2B1is%3D","/data/apps/damp/temp/ziptemp/APY231106001_demo1706781602066/APY231106001_demo/000134.wav","Send- us the bank statement% and we'll verify with the bank%!S EY1 N D / AH1 S / DH AX0 / B AE1 NG K / S T EY1 T . M AX0 N T / AE0 N D / W IY1 L3 / V EY1 . R AX0 . F AY2 / W AX1 DH / DH AX0 / B AE1 NG K"]],"officialSummary":"2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["TTS","New Zealand English","Average Tone"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
2 People - New Zealand English Average Tone Speech Synthesis Corpus
TTS
New Zealand English
Average Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
customer service and general;
Speaker
new zealanders, 1 male and 1 female;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Device
microphone;
Language
New Zealand English;
Application scenarios
speech synthesis.
Sample
Audio
And poor Honey% was always- a man crazy fool/ with no more sense than- a guinea hen%.AX0 N D / P UH1 R / HH AH1 . N IY0 / W AA1 Z / AO1 L . W EY2 Z / AX0 / M AE1 N / K R EY1 . Z IY0 / F UW1 L / W IH1 DH / N OW13 / M AO1 / S IH1 N S / DH AE1 N / AX0 / G IH1 . N IY0 / HH EH1 N
Audio
But- he was not there% and she could not bring herself to ask for him%.B AH1 T / HH IY1 / W AA1 Z / N AA1 T3 / DH IH1 R / AX0 N D / SH IY1 / K UH1 D / N AA1 T / B R IH1 NG / HH AX0 . S EH1 L F / T UW1 / AA1 S K / F AO1 / HH IH1 M
Audio
On the earth% if I fall from- a tree I will fall to the ground%.AA1 N / DH AX0 / ER1 TH / IH1 F / AY1 / F AO1 L3 / F R AH1 M / AX0 / T R IY1 / AY1 / W IH1 L / F AO1 L / T UW1 / DH AX0 / G R AW1 N D
Audio
You have- applied for twelve installments/, returned% eight- installments/, and there- are still four- installments left%.Y UW1 / HH AE1 V / AX0 . P L AY1 D / F AO1 / T W EY1 L V / AX0 N . S T AO1 L . M AX0 N T S / R IH0 . T ER1 N D / EY1 T / AX0 N . S T AO1 L . M AX0 N T S / AE0 N D / DH EY1 R / AA1 / S T IH1 L / F AO1 R / AX0 N . S T AO1 L . M AX0 N T S / L EY1 F T3
Audio
Send- us the bank statement% and we'll verify with the bank%!S EY1 N D / AH1 S / DH AX0 / B AE1 NG K / S T EY1 T . M AX0 N T / AE0 N D / W IY1 L3 / V EY1 . R AX0 . F AY2 / W AX1 DH / DH AX0 / B AE1 NG K
Recommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
2 People - Japanese Average Tone Speech Synthesis Corpus
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSJapaneseAverage Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralChaozhouDialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSMexicanSpanishAverage Tone
2 People - Spanish Average Tone Speech Synthesis Corpus
2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSSpanishAverage Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralSichuanDialect
10 People - British English Average Tone Speech Synthesis Corpus
10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSBritish EnglishAverage Tone
12 Hours - Chinese Mandarin Synthesis Corpus-Female, Entertainment anchor Style, Multi-emotional
12 Hours - Chinese Mandarin Entertainment anchor Style Multi-emotional Synthesis Corpus. It is recorded by Chinese native speaker. six emotional text+modal particles, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.