[{"@type":"PropertyValue","name":"Format","value":"48kHz, 24 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Recording condition","value":"Recording studio"},{"@type":"PropertyValue","name":"Content category","value":"Spontaneous dialogue in given topics"},{"@type":"PropertyValue","name":"Speaker","value":"294 people (Non-Professional Voice Actors) in total, gender balanced (144 females and 150 males), 18~60 years old;"},{"@type":"PropertyValue","name":"Features of annotation","value":"16 kinds of paralanguage annotation; text transcription; speaker ID, special symbol;"},{"@type":"PropertyValue","name":"Recording device","value":"Microphone"},{"@type":"PropertyValue","name":"Language","value":"Mandarin Chinese"},{"@type":"PropertyValue","name":"Country","value":"China(CHN)"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"zh-CN"},{"@type":"PropertyValue","name":"Accuracy","value":"Character Accuracy Rate 99%"}]
{"id":1620,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus","datazy":[{"isCheckLength":true,"title":"Format","content":"48kHz, 24 bit, wav, mono channel"},{"isCheckLength":true,"title":"Recording condition","content":"Recording studio"},{"isCheckLength":true,"title":"Content category","content":"Spontaneous dialogue in given topics"},{"isCheckLength":true,"title":"Speaker","content":"294 people (Non-Professional Voice Actors) in total, gender balanced (144 females and 150 males), 18~60 years old;"},{"isCheckLength":true,"title":"Features of annotation","content":"16 kinds of paralanguage annotation; text transcription; speaker ID, special symbol;"},{"isCheckLength":true,"title":"Recording device","content":"Microphone"},{"isCheckLength":true,"title":"Language","content":"Mandarin Chinese"},{"isCheckLength":true,"title":"Country","content":"China(CHN)"},{"isCheckLength":true,"title":"Language(Region) Code","content":"zh-CN"},{"isCheckLength":true,"title":"Accuracy","content":"Character Accuracy Rate 99%"}],"datatag":"M Chinese,Spontaneous Dialogue,Seperated track,Conversation,48khz","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"0315_1_001_intervals [16].wav","intro":"有的有的,<P>它那种枪战类型的游戏<M/>呢</M>,考的就是肌肉的反应能力和思维的敏捷能力。","size":1119688,"progress":100,"type":"mp3"},{"name":"0315_2_002_intervals [190].wav","intro":"那<D/>你</D>如<D/>果</D>要介绍<P>是比方有朋友找你,你会推荐他去吃这个<M/>吗</M>?","size":849702,"progress":100,"type":"mp3"},{"name":"0310_1_002_intervals [18].wav","intro":"<V>他现在已经透了一些花絮出来了,我看见<R/>抖音抖音</R>上面已经有了。","size":838228,"progress":100,"type":"mp3"}],"officialSummary":"Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus, with a free dialogue style. Given a topic, the speaker can express themselves, and in each conversation, each person's audio is stored in their own separate WAV file. Professional linguists have annotated 16 types of paralanguage annotations, text annotations, timestamps, and other information to accurately match the research and development needs of speech synthesis.","dataexampl":null,"datakeyword":["M Chinese","Spontaneous Dialogue","Seperated track","Conversation","48khz"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus
M Chinese
Spontaneous Dialogue
Seperated track
Conversation
48khz
Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus, with a free dialogue style. Given a topic, the speaker can express themselves, and in each conversation, each person's audio is stored in their own separate WAV file. Professional linguists have annotated 16 types of paralanguage annotations, text annotations, timestamps, and other information to accurately match the research and development needs of speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48kHz, 24 bit, wav, mono channel
Recording condition
Recording studio
Content category
Spontaneous dialogue in given topics
Speaker
294 people (Non-Professional Voice Actors) in total, gender balanced (144 females and 150 males), 18~60 years old;
Features of annotation
16 kinds of paralanguage annotation; text transcription; speaker ID, special symbol;