{"id":1139,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY211105001.png?Expires=2007353701&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=xI/sAoNk3zPW/XxhdVtDGbch4uU%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"100 People - Chinese Mandarin Average Tone Speech Synthesis Corpus, General","datazy":[{"title":"Format","value":"48,000Hz, 16bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"news, dialogue, audio books, poetry, advertising, news broadcasting, entertainment;"},{"title":"Speaker","value":"100 speakers totally, covering different ages and genders;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"Mandarin, English;"},{"title":"Annotation","value":"word and phoneme transcription, prosodic boundary annotation, phoneme boundary annotation;"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"Synthesis Corpus,TTS,Mandarin,Mixed Speech with Chinese & English,Chinese,English,Average Tone Speech","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/007004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=higsau8hRcLRggziJ7Rv27059nE%3D","/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/007004.wav","我到#1大连了#3睡啦#2有时间#1打电话哇#4wo3 dao4 da4 lian2 le5 shui4 la5 you3 shi2 jian1 da3 dian4 hua4 wa5"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/005010.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=7VcmbkAPVXoAFr42f%2BooChmGm7E%3D","/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/005010.wav","庆祝#1一下#1JJ Cross#1七周年#4。qing4 zhu4 yi2 xia4 / JH EY1 . JH EY1 / K R AO1 S / qi1 zhou1 nian2"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/009010.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Vhwg7h5LaAeKka6P1SAhdt%2FSQfc%3D","/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/009010.wav","Bone Collector#3与#1Professor#3联手#1时的#1十佳球#4。B OY1 / K OW0 . L EH1 K . T ER0 / yu3 / P R AX0 . F AY1 . S ER0 / lian2 shou3 shi2 de5 shi2 jia1 qiu2"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/002009.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2Fj0UCg5PvlHtVzvJZgRg7K11Tgk%3D","/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/002009.wav","How did your DNA/ get on that cord%?HH AW1 / D IH1 D / Y AO1 R / D IY1 . EH1 N . EY1 / G EH1 T / AO1 N / DH AE1 T / K AO1 R D"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/001002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2Bp3KD87wCanshj4MjQgFBlPt4cc%3D","/data/apps/damp/temp/ziptemp/APY211105001_demo1706695203268/APY211105001_demo/001002.wav","好山#1好水#1看株洲#3,养生#1养胃#1品红茶#4。hao3 shan1 hao6 shui3 kan4 zhu1 zhou1 yang3 sheng1 yang3 wei4 pin3 hong2 cha2"]],"officialSummary":"100 People - Chinese Mandarin Average Tone Speech Synthesis Corpus, General. It is recorded by Chinese native speaker. It covers news, dialogue, audio books, poetry, advertising, news broadcasting, entertainment; and the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["Synthesis Corpus","TTS","Female","General","Male"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
100 People - Chinese Mandarin Average Tone Speech Synthesis Corpus, General
Synthesis Corpus
TTS
Female
General
Male
100 People - Chinese Mandarin Average Tone Speech Synthesis Corpus, General. It is recorded by Chinese native speaker. It covers news, dialogue, audio books, poetry, advertising, news broadcasting, entertainment; and the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
2 People - Japanese Average Tone Speech Synthesis Corpus
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSJapaneseAverage Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralChaozhouDialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSMexicanSpanishAverage Tone
2 People - Spanish Average Tone Speech Synthesis Corpus
2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSSpanishAverage Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus
2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSNew Zealand EnglishAverage Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralSichuanDialect
10 People - British English Average Tone Speech Synthesis Corpus
10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.