[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"Recording studio"},{"@type":"PropertyValue","name":"Recording content","value":"Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;"},{"@type":"PropertyValue","name":"Speaker","value":"370 people in total,18~60 years old"},{"@type":"PropertyValue","name":"Annotation","value":"14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol"},{"@type":"PropertyValue","name":"Device","value":"Microphone;"},{"@type":"PropertyValue","name":"Language","value":"Mandarin Chinese;"}]
{"id":1589,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"Mandarin Chinese Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus","datazy":[{"title":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","content":"Recording studio"},{"title":"Recording content","content":"Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;"},{"title":"Speaker","content":"370 people in total,18~60 years old"},{"title":"Annotation","content":"14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol"},{"title":"Device","content":"Microphone;"},{"title":"Language","content":"Mandarin Chinese;"}],"datatag":"Mandarin Chinese,TTS,Spontaneous Dialogue,Paralanguage","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"demo1.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo1.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=eLT7%2BXlaLQaLPmud41SEF6O3V2c%3D","intro":"<V>她<S/>特</S>别喜欢<F/>就是</F>小蛋糕,我们有时候也叫她蛋糕妹,<V>因为她<S/>每</S>一天都要吃。","size":1333100,"progress":100,"type":"mp3"},{"name":"demo2.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo2.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=eun9phcHpAcfM2p6gHwsBH7dxsQ%3D","intro":"<V>大窑这个饮料<M/>啊</M>还是比较好喝的,推荐去尝试一下。","size":803180,"progress":100,"type":"mp3"},{"name":"demo3.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo3.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=MDbcQFHuwewUcsW1li%2BRqlT5gS0%3D","intro":"<V>再加上她的男朋友,<V><F/>然后</F>每次他们一吵架,她男朋友就给她买那个小蛋糕去哄她。","size":1421036,"progress":100,"type":"mp3"}],"officialSummary":"Mandarin Chinese Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus, recorded by 370 Chinese native speakers, natural conversation style. Professional phonetician annotationed 14 kinds of paralanguages, transcriptions, speakers, and so on, precisely matches with the research and development needs of the speech synthesis.","dataexampl":null,"datakeyword":["Mandarin Chinese","TTS","Spontaneous Dialogue","Paralanguage"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
Mandarin Chinese Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus
Mandarin Chinese
TTS
Spontaneous Dialogue
Paralanguage
Mandarin Chinese Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus, recorded by 370 Chinese native speakers, natural conversation style. Professional phonetician annotationed 14 kinds of paralanguages, transcriptions, speakers, and so on, precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
Recording studio
Recording content
Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;
Speaker
370 people in total,18~60 years old
Annotation
14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol