[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Content category","value":"Economy, entertainment, news, informal language, numbers, alphabet;"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise(indoor), without echo;"},{"@type":"PropertyValue","name":"Recording device","value":"Android smartphone:iPhone = 3:1;"},{"@type":"PropertyValue","name":"Speaker","value":"496 Indonesian; 44% male and 56% female;"},{"@type":"PropertyValue","name":"Country","value":"Indonesia(IDN);"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"id-ID;"},{"@type":"PropertyValue","name":"Language","value":"Indonesian;"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text, timestamp, 5 noise symbols, special identifiers;"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%(noise symbols and other identifiers are excluded)"}]
{"id":71,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY161101030_R.png?Expires=2007353624&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=3NmUgYVJu7xXFXnbs4/ZBCpflU8%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"359 Hours - Indonesian(Indonesia) Scripted Monologue Smartphone speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Content category","value":"Economy, entertainment, news, informal language, numbers, alphabet;"},{"title":"Recording condition","value":"Low background noise(indoor), without echo;"},{"title":"Recording device","value":"Android smartphone:iPhone = 3:1;"},{"title":"Speaker","value":"496 Indonesian; 44% male and 56% female;"},{"title":"Country","value":"Indonesia(IDN);"},{"title":"Language(Region) Code","value":"id-ID;"},{"title":"Language","value":"Indonesian;"},{"title":"Features of annotation","value":"Transcription text, timestamp, 5 noise symbols, special identifiers;"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 95%(noise symbols and other identifiers are excluded)"}],"datatag":"Indonesian,Indonesia,Smartphone,Reading,Scripted Monologue","technologydoc":null,"downurl":null,"datainfo":"496 speakers, native Indonesian. quiet recording environment. rich recording conent, covers economy , entertainment, news and spoken Indonesian. All the text are manually transcribed with high accuracy.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["496 people","359 hours","400 sentences for each person"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0170G0165S0010.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=VRg8On0EzDkSesGJi%2Bx8mhBqaRg%3D","/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0170G0165S0010.wav","Padahal potensi laut Indonesia sangat luar biasa."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0163G0002S0001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=jMZrbryXq%2BT5SfHPqx90ELJwdYU%3D","/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0163G0002S0001.wav","Ternyata[[lipsmack]],warga menyakini sudah ada rencana pemerintah sejak lama untuk menata Kampung Pulo"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0174G0367S0004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Pym0g8bviJ6TnTaJw830Yprml1g%3D","/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0174G0367S0004.wav","Sebab,Indonesia hanya lebih tinggi dari India,Pakistan dan Filipina."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0174G0367S0033.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=l1VyUlalwcZhBVNOer%2BzCbWllBQ%3D","/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0174G0367S0033.wav","Itu terlihat dari melambatnya ekonomi Indonesia sepanjang Triwulan satu dua ribu empat belas lalu."],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0166G0019S0002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Xlfu6ftAFgu5tp8bVtVWPJMO9hA%3D","/data/apps/damp/temp/ziptemp/APY161101030_R_demo1695808864567/APY161101030_R_demo/T0166G0019S0002.wav","Demi Wisatawan[[lipsmack]],Menteri[[lipsmack]] Rizal Bebaskan Visa empat puluh tujuh Negara[[lipsmack]] Jawapos dot com"]],"officialSummary":"Indonesian(Indonesia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering economy, entertainment, news, informal language, numbers, alphabet domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(496 speakers), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Indonesian data"," mobile phone collected voice data"," read voice"," Indonesian voice"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Indonesian(Indonesia) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering economy, entertainment, news, informal language, numbers, alphabet domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(496 speakers), geographicly speaking, enhancing model performance in real and complex tasks.nQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.