[{"@type":"PropertyValue","name":"Format","value":"16kHz,16bit,wav,mono channel"},{"@type":"PropertyValue","name":"Recording environment","value":"quiet indoor environment, normal environment(contains noise that does not affect recognition)"},{"@type":"PropertyValue","name":"Recording content","value":"Speakers will read and record based on the given texts, with each text containing at least 1 type of specified entity word: person, phone number, address, alphanumeric sequence, Email, product Model, product serial number, and money."},{"@type":"PropertyValue","name":"Country","value":"Saudi Arabia(SAU), Egypt(EGY), UAE(ARE)"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"ar-SA,ar-EG,ar-AE"},{"@type":"PropertyValue","name":"Language","value":"Arabic"},{"@type":"PropertyValue","name":"Accuracy","value":"WAR(Word Accuracy Rate) 98% (Punctuation, tags and non-speech annotations are subjective, thus they are excluded from the accuracy statistics.)"},{"@type":"PropertyValue","name":"Device","value":"Android phone, iPhone"}]
{"id":1958,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"Arabic Entities Scripted Monologue Smartphone speech dataset","datazy":[{"title":"Format","content":"16kHz,16bit,wav,mono channel"},{"title":"Recording environment","content":"quiet indoor environment, normal environment(contains noise that does not affect recognition)"},{"title":"Recording content","content":"Speakers will read and record based on the given texts, with each text containing at least 1 type of specified entity word: person, phone number, address, alphanumeric sequence, Email, product Model, product serial number, and money."},{"title":"Country","content":"Saudi Arabia(SAU), Egypt(EGY), UAE(ARE)"},{"title":"Language(Region) Code","content":"ar-SA,ar-EG,ar-AE"},{"title":"Language","content":"Arabic"},{"title":"Accuracy","content":"WAR(Word Accuracy Rate) 98% (Punctuation, tags and non-speech annotations are subjective, thus they are excluded from the accuracy statistics.)"},{"title":"Device","content":"Android phone, iPhone"}],"datatag":"Saudi,Egypt,UAE,Arabic,Smartphone,Reading,Scripted Monologue","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"G00007T08P00204.wav","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20260430102954/G00007T08P00204.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=xojB%2BUSUdwOs5XwG0G2oV4gGqRo%3D","intro":"هل يمكنني التحقق من أن الخصم كان [MONEY/]عشرة جنيه[/MONEY]، هل تم تطبيق العرض الترويجي بشكل صحيح؟\\nهل يمكنني التحقق من أن الخصم كان [MONEY/]10 جنيه[/MONEY]، هل تم تطبيق العرض الترويجي بشكل صحيح؟","size":198478,"progress":100,"type":"mp3"},{"name":"G00007T07P00031.wav","url":"https://storage-product.datatang.com/damp/product/instructions_zh/20260430102954/G00007T07P00031.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=D%2FL5wMfkhWvQqqbqJnYOUc%2FqHpM%3D","intro":"مرحباً، لاحظت لون غير طبيعي على [PROTYP/] شاشة إل جي كونيد ميني LED خمسة وستين بوصة [/PROTYP]، هل يمكن التحقق من ذلك أو تحديد موعد للفحص؟\\nمرحباً، لاحظت لون غير طبيعي على [PROTYP/]إل جي شاشة QNED MiniLED مقاس 65 بوصة[/PROTYP]، هل يمكن التحقق من ذلك أو تحديد موعد للفحص؟","size":464078,"progress":100,"type":"mp3"}],"officialSummary":"Arabic Entities Scripted Monologue Smartphone speech dataset, covers several domains, including person, phone number, address, alphanumeric sequence, Email, product Model, product serial number, and money entities, mirrors real-world interactions. Transcribed with text content, and other attributes. Our dataset was collected from extensive and diversify speakers, geographically speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["Saudi","Egypt","UAE","Arabic","Smartphone","Reading","Scripted Monologue"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN\"},{\"code\":\"3\",\"language\":\"EN\"},{\"code\":\"4\",\"language\":\"JP\"}]","productNameEn":"112 hours - Arabic Entities Scripted Monologue Smartphone speech dataset","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
Arabic Entities Scripted Monologue Smartphone speech dataset, covers several domains, including person, phone number, address, alphanumeric sequence, Email, product Model, product serial number, and money entities, mirrors real-world interactions. Transcribed with text content, and other attributes. Our dataset was collected from extensive and diversify speakers, geographically speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz,16bit,wav,mono channel
Recording environment
quiet indoor environment, normal environment(contains noise that does not affect recognition)
Recording content
Speakers will read and record based on the given texts, with each text containing at least 1 type of specified entity word: person, phone number, address, alphanumeric sequence, Email, product Model, product serial number, and money.
Country
Saudi Arabia(SAU), Egypt(EGY), UAE(ARE)
Language(Region) Code
ar-SA,ar-EG,ar-AE
Language
Arabic
Accuracy
WAR(Word Accuracy Rate) 98% (Punctuation, tags and non-speech annotations are subjective, thus they are excluded from the accuracy statistics.)
Device
Android phone, iPhone
Sample
Audio
هل يمكنني التحقق من أن الخصم كان [MONEY/]عشرة جنيه[/MONEY]، هل تم تطبيق العرض الترويجي بشكل صحيح؟\nهل يمكنني التحقق من أن الخصم كان [MONEY/]10 جنيه[/MONEY]، هل تم تطبيق العرض الترويجي بشكل صحيح؟
Audio
مرحباً، لاحظت لون غير طبيعي على [PROTYP/] شاشة إل جي كونيد ميني LED خمسة وستين بوصة [/PROTYP]، هل يمكن التحقق من ذلك أو تحديد موعد للفحص؟\nمرحباً، لاحظت لون غير طبيعي على [PROTYP/]إل جي شاشة QNED MiniLED مقاس 65 بوصة[/PROTYP]، هل يمكن التحقق من ذلك أو تحديد موعد للفحص؟