{"id":1899,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"601 Hours Argentine Spanish Speech Dataset with Transcriptions for AI Training","datazy":[{"title":"Format","content":"16kHz, 16 bit, wav, mono channel;"},{"title":"Recording condition","content":"Low background noise;"},{"title":"Country","content":"Argentina(ARG),etc.;"},{"title":"Language(Region) Code","content":"es-AR,etc."},{"title":"Language","content":"Spanish(Argentina), etc;"},{"title":"Features of annotation","content":"Transcription text, timestamp, speaker ID, gender, noise."},{"title":"Accuracy Rate","content":"Word Accuracy Rate (WAR) 98%"}],"datatag":"Spanish,Casual Conversation,ASR,Argentina","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"000607_1.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20251231135504/000607_1.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=x%2FY%2F1vG19%2Fny%2FQeRlu81fXLvRio%3D","intro":"Y siempre me arrepentí de la pregunta.","size":66962,"progress":100,"type":"mp3"},{"name":"003190_14.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20251231135504/003190_14.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=keDlqkVBp89emLK2dryw6i7n94s%3D","intro":"No [OVERLAP/] cuando [/OVERLAP] tenías más años. [N]","size":51794,"progress":100,"type":"mp3"},{"name":"002201_10.wav","url":"https://storage-product.datatang.com/damp/product/samplePresentation_ipad/20251231135504/002201_10.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=GoTuuM0sll0F8IdYLBgFbreHdNo%3D","intro":"Eh veo el pelo de las chicas en la calle, y, y nada, me pongo a pensar, a esta le haría un botox. [N]","size":171660,"progress":100,"type":"mp3"}],"officialSummary":"This Argentine Spanish speech dataset features real-world casual conversations and monologues, reflecting authentic everyday interactions. The dataset includes high-quality audio recordings with transcriptions, speaker IDs, gender information, and other relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["spanish asr dataset","spanish speech corpus","argentinian spanish dataset","argentina speech dataset","argentine spanish dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN\"},{\"code\":\"3\",\"language\":\"EN\"},{\"code\":\"4\",\"language\":\"JP\"}]","productNameEn":"601 Hours - Spanish(Argentina) Real-world Casual Conversation and Monologue speech dataset","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
601 Hours Argentine Spanish Speech Dataset with Transcriptions for AI Training
spanish asr dataset
spanish speech corpus
argentinian spanish dataset
argentina speech dataset
argentine spanish dataset
This Argentine Spanish speech dataset features real-world casual conversations and monologues, reflecting authentic everyday interactions. The dataset includes high-quality audio recordings with transcriptions, speaker IDs, gender information, and other relevant metadata. Our dataset was collected from speakers with diverse geographical and background profiles, thereby enhancing the model's performance in real-world, complex tasks; the dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.