[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise (indoor), without echo;"},{"@type":"PropertyValue","name":"Content category","value":"100,000 common expressions;"},{"@type":"PropertyValue","name":"Recording device","value":"Android smartphone;"},{"@type":"PropertyValue","name":"Speaker","value":"3,691 Chinese, 34% male and 66% female;"},{"@type":"PropertyValue","name":"Country","value":"China(CHN);"},{"@type":"PropertyValue","name":"Language","value":"English;"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text;"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%"}]
{"id":32,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY161101010.png?Expires=2007353613&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=ejxTH3sHzrMUuO7EXVq1AbfSDE8%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"593 Hours - English(China) Scripted Monologue Smartphone speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Recording condition","value":"Low background noise (indoor), without echo;"},{"title":"Content category","value":"100,000 common expressions;"},{"title":"Recording device","value":"Android smartphone;"},{"title":"Speaker","value":"3,691 Chinese, 34% male and 66% female;"},{"title":"Country","value":"China(CHN);"},{"title":"Language","value":"English;"},{"title":"Features of annotation","value":"Transcription text;"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%"}],"datatag":"English,China,Reading,Scripted Monologue","technologydoc":null,"downurl":null,"datainfo":"Participate: Thousands of Chinese pepole, covers most of dialect district in China. Chinese speaking English accent. Recording text: commonly used English sentence, extensive content, extensive fields, balanced phoneme. This data can be applied to improve speech recognition system, the recognition of Chinese speaking English.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["3,691 people","16kHz, 16bit, wav","100,000 commonly used English sentences"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0003S0009.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=tc%2FKLs547VSMHPkU93mJE8CAa9Q%3D","/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0003S0009.wav","Her cheeks had fallen in,making her look old."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0030S0021.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=OWNF%2B%2FeZGj4Z%2F5mXRD8bE4%2BcI2A%3D","/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0030S0021.wav","No milk. I'm slimming."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0002S0001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=iYuQSVoDOc5ns646UzTe%2FR1wdUM%3D","/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0002S0001.wav","We're focused on small things: Do I have my pierce?"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0007S0001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=%2BPilGZfeDKj3sFWifrnfrD%2Fcbv0%3D","/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0007S0001.wav","No one could know why he did like that."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0023S0004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=qttZXeX53WG1JjfRnOhm2s7zKKM%3D","/data/apps/damp/temp/ziptemp/APY161101010_demo1730455200155/apy161101010/T0055G0023S0004.wav","The bark scaled off the tree."]],"officialSummary":"English(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 100,000 common expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(3,691 Chinese, covering domestic dialect zones like Jiangsu, Shandong, Beijing, He'nan, and meets the specific accents of Chinese speaking English), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Chinese English speech data","Chinese English speech dataset","speech dataset","speech data","English speech dataset","English speech data","Chinese speech dataset","Chinese speech data","audio data","audio dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
English(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 100,000 common expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(3,691 Chinese, covering domestic dialect zones like Jiangsu, Shandong, Beijing, He'nan, and meets the specific accents of Chinese speaking English), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16bit, uncompressed wav, mono channel;
Recording condition
Low background noise (indoor), without echo;
Content category
100,000 common expressions;
Recording device
Android smartphone;
Speaker
3,691 Chinese, 34% male and 66% female;
Country
China(CHN);
Language
English;
Features of annotation
Transcription text;
Accuracy Rate
Sentence Accuracy Rate (SAR) 95%
Sample
Audio
Her cheeks had fallen in,making her look old.
Audio
No milk. I'm slimming.
Audio
We're focused on small things: Do I have my pierce?