[{"@type":"PropertyValue","name":"Format","value":"44.1kHz, 16bit, wav, dual channel."},{"@type":"PropertyValue","name":"Recording environment","value":"Mixed"},{"@type":"PropertyValue","name":"Recording content","value":"lectures on science and technology, training, publicity, etc."},{"@type":"PropertyValue","name":"Device","value":"AU Center Console Mixer"},{"@type":"PropertyValue","name":"Country","value":"China(CHN)"},{"@type":"PropertyValue","name":"Language","value":"Mandarin"},{"@type":"PropertyValue","name":"Features of annotation","value":"annotating for the transcription text, speaker identification and gender"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate(SAR) 97%"}]
{"id":1066,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY200229003.png?Expires=2007353680&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=lNR7zzSwafiD7FPaXwDbT9Yicy0%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"1,722 Hours - Mandarin(China) Near-field Conference speech dataset","datazy":[{"title":"Format","value":"44.1kHz, 16bit, wav, dual channel."},{"title":"Recording environment","value":"Mixed"},{"title":"Recording content","value":"lectures on science and technology, training, publicity, etc."},{"title":"Device","value":"AU Center Console Mixer"},{"title":"Country","value":"China(CHN)"},{"title":"Language","value":"Mandarin"},{"title":"Features of annotation","value":"annotating for the transcription text, speaker identification and gender"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate(SAR) 97%"}],"datatag":"Mandarin,Speech,Near field,Center Console","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/01.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5iK%2BRv2bdt%2F0%2FEO4vDSi40%2FO19w%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/01.wav","我觉得呃我觉得我工作的一直非常的开心为什么开心呢因为其实有三点"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/05.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NHJo3x6EmLvBWCXChY8UIyQ0Onk%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/05.wav","但是我们每家我们都有我们自己的特点我们每家都能够呃做出自己的亮点我们都可以去互相的学习"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/04.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2BSfn2DtikBBVHVnxacyfXPsmmTI%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/04.wav","因为不管我们行业发展到什么样的情况我们的企业在什么样的阶段我们的学习培训工作在什么样的阶段"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/02.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2LwBd3QdtDvZgZ%2BeW8X%2BrxcePsU%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/02.wav","我第一个是什么呢就是我以为我觉得这样的一个行业"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/03.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=KEvdUIKlaAHvv6RmhJuC1fW1mNw%3D","/data/apps/damp/temp/ziptemp/APY200229003_demo1712743251148/APY200229003_demo/03.wav","是一个百花齐放嗯这个百家争鸣的行业"]],"officialSummary":"Mandarin(China) Near-field Conference speech dataset, collected the output by AU central console mixer in real speech scenes. It has a natural pronunciation without environmental noise almost, covers a variety of topics. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Mandarin speech dataset"," Mandarin Near-field Conference dataset"," Mandarin speech data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Mandarin(China) Near-field Conference speech dataset, collected the output by AU central console mixer in real speech scenes. It has a natural pronunciation without environmental noise almost, covers a variety of topics. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
44.1kHz, 16bit, wav, dual channel.
Recording environment
Mixed
Recording content
lectures on science and technology, training, publicity, etc.
Device
AU Center Console Mixer
Country
China(CHN)
Language
Mandarin
Features of annotation
annotating for the transcription text, speaker identification and gender
Accuracy Rate
Sentence Accuracy Rate(SAR) 97%
Sample
Audio
我觉得呃我觉得我工作的一直非常的开心为什么开心呢因为其实有三点
Audio
但是我们每家我们都有我们自己的特点我们每家都能够呃做出自己的亮点我们都可以去互相的学习
Audio
因为不管我们行业发展到什么样的情况我们的企业在什么样的阶段我们的学习培训工作在什么样的阶段
Audio
我第一个是什么呢就是我以为我觉得这样的一个行业
Audio
是一个百花齐放嗯这个百家争鸣的行业
Recommended Dataset
1,003 People - Emotional Video Data
Emotional Video Data,including multiple races, multiple indoor scenes, multiple age groups, multiple languages, multiple emotions (11 types of facial emotions, 15 types of inner emotions). For each sentence in each video, annotated emotion types (including facial emotions and inner emotions), start & end timestamp, text transcription.This dataset can be used for tasks such as emotion recognition and sentiment analysis, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Emotional video multiple races multiple indoor scenes multiple age groups multiple languages multiple emotions11 types of facial emotions 15 types of inner emotionsfeelingpassionsentimentexcitementsensationaffectionintensityardorsensibilityfervorvehemenceloveresponsewarmthzealemotionalpathosagitationspiritaffectivityenthusiasmfervencyperturbationimpressionsympathysadnesssentimentalitythrilleagernesspoignancycommotionemotionalismfiresoulanimationemotionalityresponsivenesssensitivenessdisturbanceenergyexpressiontendernesspitifulnessplaintivenesspoignancesentimentsvibesangerempathyfeelings
English(the United States) Emotion Scripted Monologue Microphone speech dataset, collected from monologue based on given scripts, covering 10 types of emotional scripts,such as anger, happiness, sadness, etc., matches real-world scenario. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(20 American native speakers), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English emotional audio data captured by microphone emotional audio detection data English emotional audio data