[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"Recording studio"},{"@type":"PropertyValue","name":"Recording content","value":"Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;"},{"@type":"PropertyValue","name":"Speaker","value":"370 people in total,18~60 years old"},{"@type":"PropertyValue","name":"Annotation","value":"14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol"},{"@type":"PropertyValue","name":"Device","value":"Microphone;"},{"@type":"PropertyValue","name":"Language","value":"Mandarin Chinese;"}]
{"id":1589,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"Mandarin Chinese Speech Synthesis Dataset – 370 Speakers, 200 Hours","datazy":[{"title":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","content":"Recording studio"},{"title":"Recording content","content":"Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;"},{"title":"Speaker","content":"370 people in total,18~60 years old"},{"title":"Annotation","content":"14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol"},{"title":"Device","content":"Microphone;"},{"title":"Language","content":"Mandarin Chinese;"}],"datatag":"Mandarin Chinese,TTS,Spontaneous Dialogue,Paralanguage","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"demo1.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo1.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=eLT7%2BXlaLQaLPmud41SEF6O3V2c%3D","intro":"<V>她<S/>特</S>别喜欢<F/>就是</F>小蛋糕,我们有时候也叫她蛋糕妹,<V>因为她<S/>每</S>一天都要吃。","size":1333100,"progress":100,"type":"mp3"},{"name":"demo2.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo2.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=eun9phcHpAcfM2p6gHwsBH7dxsQ%3D","intro":"<V>大窑这个饮料<M/>啊</M>还是比较好喝的,推荐去尝试一下。","size":803180,"progress":100,"type":"mp3"},{"name":"demo3.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20250723135855/demo3.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=MDbcQFHuwewUcsW1li%2BRqlT5gS0%3D","intro":"<V>再加上她的男朋友,<V><F/>然后</F>每次他们一吵架,她男朋友就给她买那个小蛋糕去哄她。","size":1421036,"progress":100,"type":"mp3"}],"officialSummary":"This dataset is recorded by 370 Chinese native speakers and 200 hours of natural conversation audio. Professional phonetician annotationed 14 kinds of paralanguages, full transcriptions, and speaker metadata. Precisely matches with the research and development needs of speech synthesis, dialogue TTS, and natural language modeling research.","dataexampl":null,"datakeyword":["Chinese paralanguage dataset","spontaneous dialogue dataset","Chinese conversational speech corpus","Mandarin speech synthesis corpus","Chinese speech synthesis dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
Mandarin Chinese Speech Synthesis Dataset – 370 Speakers, 200 Hours
Chinese paralanguage dataset
spontaneous dialogue dataset
Chinese conversational speech corpus
Mandarin speech synthesis corpus
Chinese speech synthesis dataset
This dataset is recorded by 370 Chinese native speakers and 200 hours of natural conversation audio. Professional phonetician annotationed 14 kinds of paralanguages, full transcriptions, and speaker metadata. Precisely matches with the research and development needs of speech synthesis, dialogue TTS, and natural language modeling research.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
Recording studio
Recording content
Provide a list of 36 topics, speakers choose one and start a spontaneous dialogue;
Speaker
370 people in total,18~60 years old
Annotation
14 kinds of paralanguage annotation; text transcription; speaker ID; special symbol