[{"@type":"PropertyValue","name":"Format","value":"16k Hz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise"},{"@type":"PropertyValue","name":"Country","value":"Spain(ESP)"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"es-ES"},{"@type":"PropertyValue","name":"Language","value":"Spanish"},{"@type":"PropertyValue","name":"Features of annotation","value":"transcription text, timestamp, speaker identification, gender, noise, sensitive information"},{"@type":"PropertyValue","name":"Accuracy","value":"Word Accuracy Rate (WAR) at least 98%"}]
{"id":2050,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"300 Hours Spanish Financial Speech Dataset – Banking Audio for ASR, Voice AI and LLM Training","datazy":[{"title":"Format","content":"16k Hz, 16 bit, wav, mono channel"},{"title":"Content category","content":"Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics"},{"title":"Recording condition","content":"Low background noise"},{"title":"Country","content":"Spain(ESP)"},{"title":"Language(Region) Code","content":"es-ES"},{"title":"Language","content":"Spanish"},{"title":"Features of annotation","content":"transcription text, timestamp, speaker identification, gender, noise, sensitive information"},{"title":"Accuracy","content":"Word Accuracy Rate (WAR) at least 98%"}],"datatag":"Spanish,Financial,Casual Conversation,Monologue","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"000002_7.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402170906/000002_7.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=GOpZhQNRqNhGZhuTqcAfAjKstnU%3D","intro":"¿Cómo declaramos las criptomonedas y las donancias que hemos tenido? [N]","size":143154,"progress":100,"type":"mp3"},{"name":"000003_13.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402170906/000003_13.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=lplOYPNLtWySDC09oDi3y9YzQVE%3D","intro":"Oye, pues soy empresario, hago esto, me dedico a esto, y entonces te dice cómo lo tienes que declarar,","size":155286,"progress":100,"type":"mp3"},{"name":"000004_19.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402170906/000004_19.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=2nW1jeEtU2YL78o97HvfsPk4lbc%3D","intro":"¿Cuál es el camino hacia esa libertad financiera? [N]","size":52214,"progress":100,"type":"mp3"}],"officialSummary":"This dataset encompasses a wide range of specialized financial terminology and authentically reflects real-world interactions. It includes transcripts, speaker IDs, gender information, and other attributes. Collected from a geographically and demographically diverse group of speakers, the dataset helps improve model performance in complex, real-world tasks and has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["spanish speech dataset","spanish financial speech dataset","spanish banking dataset","rag dataset for finance","financial conversational dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN\"},{\"code\":\"3\",\"language\":\"EN\"},{\"code\":\"4\",\"language\":\"JP\"}]","productNameEn":"300 Hours - Spanish(Spain) Financial Real-world Casual Conversation and Monologue speech dataset","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
300 Hours Spanish Financial Speech Dataset – Banking Audio for ASR, Voice AI and LLM Training
spanish speech dataset
spanish financial speech dataset
spanish banking dataset
rag dataset for finance
financial conversational dataset
This dataset encompasses a wide range of specialized financial terminology and authentically reflects real-world interactions. It includes transcripts, speaker IDs, gender information, and other attributes. Collected from a geographically and demographically diverse group of speakers, the dataset helps improve model performance in complex, real-world tasks and has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics
Recording condition
Low background noise
Country
Spain(ESP)
Language(Region) Code
es-ES
Language
Spanish
Features of annotation
transcription text, timestamp, speaker identification, gender, noise, sensitive information
Accuracy
Word Accuracy Rate (WAR) at least 98%
Sample
Audio
¿Cómo declaramos las criptomonedas y las donancias que hemos tenido? [N]
Audio
Oye, pues soy empresario, hago esto, me dedico a esto, y entonces te dice cómo lo tienes que declarar,
Audio
¿Cuál es el camino hacia esa libertad financiera? [N]