[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise(indoor), without echo;"},{"@type":"PropertyValue","name":"Content category","value":"Informal expressions;"},{"@type":"PropertyValue","name":"Recording device","value":"Android Smartphone, iPhone;"},{"@type":"PropertyValue","name":"Speaker","value":"2,508 people, 47% male and 53% female;"},{"@type":"PropertyValue","name":"Country","value":"China(CHN);"},{"@type":"PropertyValue","name":"Language","value":"Uyghur;"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text;"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%"}]
{"id":46,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY161101015.png?Expires=2007353616&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=mRcz2cdSgwhN06HmtIHnHfCw2AU%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"738 Hours - Uyghur(China) Scripted Monologue Smartphone speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Recording condition","value":"Low background noise(indoor), without echo;"},{"title":"Content category","value":"Informal expressions;"},{"title":"Recording device","value":"Android Smartphone, iPhone;"},{"title":"Speaker","value":"2,508 people, 47% male and 53% female;"},{"title":"Country","value":"China(CHN);"},{"title":"Language","value":"Uyghur;"},{"title":"Features of annotation","value":"Transcription text;"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%"}],"datatag":"Uyghur,China,Smartphone,Reading,Scripted Monologue","technologydoc":null,"downurl":null,"datainfo":"2,058 speakers, all of which are from Uighur area with a balanced gender. Recording content is spoken sentence of uygur.All the sentences were manually accurately transcribed with noise label annotation.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["2,508 people","16kHz, 16bit, wav","300,000 colloquial Uyghur sentences"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0064G1244S0002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=teK0lOh66DL%2FOOd9BT4Yu3FTjq4%3D","/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0064G1244S0002.wav","باغرى بەكمۇ ئىسسىق جاي"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0190S0001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=mVCgTVPnhQrbWTr4fXcDSrxIP%2B0%3D","/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0190S0001.wav","تاكسى بىلەن بېرىپ كېلىشكە قانچىلىك پۇل كېتىدۇ ؟"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0064G1290S0003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=yrC9nYLEftYgmv6x763w9IboooQ%3D","/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0064G1290S0003.wav","ۇنىڭ قوللىرى قىچىشىپ كەتكەنىدى ."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0351S0004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=I0YlXO4ncQKKJOadlnZ5j3f3%2F4s%3D","/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0351S0004.wav","ىممىتى بار ئىنكاس ."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0334S0018.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=JOYO%2Bt0CM08rSdnLT41Ue3M7e%2B4%3D","/data/apps/damp/temp/ziptemp/APY161101015_demo1699264800098/APY161101015_r/T0001G0334S0018.wav","ەن نىگارىمنىڭ ئايدەك ھۆسنىگە،"]],"officialSummary":"Uyghur(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 300,000 Uighur local expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(2,058 Uighur), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Uyghur phonetics and Uyghur mobile phones collect phonetic data"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Uyghur phonetics and Uyghur mobile phones collect phonetic data
Uyghur(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering 300,000 Uighur local expressions. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(2,058 Uighur), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Mandarin Chinese(China) Heavy Accent Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and control, in-car command and control, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(2,444 people in total, mainly from southern China, part of them are from northern China), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English(China) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, informal English, human-machine interaction and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,279 people in total, covering 7 dialect regions across China), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English(the United States) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,842 American in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Malay(Malaysia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, news and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(675 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Indonesian(Indonesia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, news and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,285 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English(Spain) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(891 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English(France) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,089 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
English(Germany) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,162 people in total), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.