[{"@type":"PropertyValue","name":"Data content","value":"prosodic annotation for 200,955 selected Chinese sentences"},{"@type":"PropertyValue","name":"Data scale","value":"200,955 sentences"},{"@type":"PropertyValue","name":"Data source","value":"all the text comes from the news and human conversation"},{"@type":"PropertyValue","name":"Annotation","value":"4 prosodic hierarchies annotating"},{"@type":"PropertyValue","name":"Language","value":"Chinese"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis"},{"@type":"PropertyValue","name":"Accuracy","value":"not lower than 99%"}]
{"id":1027,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY190717001.png?Expires=2007353669&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=XHs2rEol0yAyYie7YMzsRCsOBOU%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"200,955 Sentences - Mandarin Prosodic Corpus Data","datazy":[{"title":"Data content","value":"prosodic annotation for 200,955 selected Chinese sentences"},{"title":"Data scale","value":"200,955 sentences"},{"title":"Data source","value":"all the text comes from the news and human conversation"},{"title":"Annotation","value":"4 prosodic hierarchies annotating"},{"title":"Language","value":"Chinese"},{"title":"Application scenarios","value":"speech synthesis"},{"title":"Accuracy","value":"not lower than 99%"}],"datatag":"Chinese,Prosodic Annotation,Speech Synthesis,Front-end Training Set","technologydoc":null,"downurl":null,"datainfo":"4 prosodic hierarchies annotating for the 200,000 carefully selected Chinese texts which involve news and colloquial sentences. The sentence length is appropriate with diversified sentence patterns. This can be used as a TTS front-end prosody prediction training data set.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY1907170011695808958133/APY190717001/%7B5EF6B86A-F494-41E9-AF8E-B7A987AF6785%7D_20190719090717.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=MMxI6zvSR2rzgvONlSXKnkRWOJg%3D","/data/apps/damp/temp/ziptemp/APY1907170011695808958133/APY190717001/{5EF6B86A-F494-41E9-AF8E-B7A987AF6785}_20190719090717.jpg",""],"officialSummary":"4 prosodic hierarchies annotating for the 200000 carefully selected Chinese texts which involve news and colloquial sentences. The sentence length is appropriate with diversified sentence patterns. This can be used as a TTS front-end prosody prediction training data set.","dataexampl":"","datakeyword":["Prosodic annotation of Chinese text"," prosodic corpus of Chinese text"," news prosodic annotation"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"yes"}
4 prosodic hierarchies annotating for the 200000 carefully selected Chinese texts which involve news and colloquial sentences. The sentence length is appropriate with diversified sentence patterns. This can be used as a TTS front-end prosody prediction training data set.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data content
prosodic annotation for 200,955 selected Chinese sentences
Data scale
200,955 sentences
Data source
all the text comes from the news and human conversation
Annotation
4 prosodic hierarchies annotating
Language
Chinese
Application scenarios
speech synthesis
Accuracy
not lower than 99%
Sample
Recommended Dataset
200,475 Sentences - Chinese Text Normalization Data
200,475 Sentences - Chinese Text Normalization Data. Annotate the special symbols and Arabic numerals in the sentences as Chinese characters.
TN data text regularized data speech synthesis data speech synthesis data set speech synthesis data
319,977 Sentences - Mandarin Polyphone Corpus Data
The Mandarin Polyphone Corpus Data is designed for polyphone disambiguation. It includes 603 common Mandarin pinyin pronunciations, There are differences in the number of phonetic corpora according to the number of phrases in a single word.
Chinese Polysyllabic Corpus Chinese polyphone corpusChinese corpus