[{"@type":"PropertyValue","name":"Format","value":"16k Hz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"Covering various financial professional terminologies, primarily focuses on macroeconomics(market trends, financial policies, etc.), microeconomics(individual enterprises, stocks, investment portfolios, etc.)"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise"},{"@type":"PropertyValue","name":"Country","value":"Spain(ESP), Latin countries"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"es-ES, etc."},{"@type":"PropertyValue","name":"Language","value":"Spain"},{"@type":"PropertyValue","name":"Features of annotation","value":"transcription text, timestamp, speaker identification, gender, noise, PII redacted, entities, letter case"},{"@type":"PropertyValue","name":"Accuracy","value":"Word Accuracy Rate (WAR) at least 98% (Tags, entities are not included in accuracy statistics due to subjectivity)"}]
{"id":1544,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"217 Hours Spanish Financial Speech Dataset with Financial Entity Annotation","datazy":[{"title":"Format","content":"16k Hz, 16 bit, wav, mono channel","desc":"Format"},{"title":"Content category","content":"Covering various financial professional terminologies, primarily focuses on macroeconomics(market trends, financial policies, etc.), microeconomics(individual enterprises, stocks, investment portfolios, etc.)","desc":"Content category"},{"title":"Recording condition","content":"Low background noise","desc":"Recording condition"},{"title":"Country","content":"Spain(ESP), Latin countries","desc":"Country"},{"title":"Language(Region) Code","content":"es-ES, etc.","desc":"Language(Region) Code"},{"title":"Language","content":"Spain","desc":"Language"},{"title":"Features of annotation","content":"transcription text, timestamp, speaker identification, gender, noise, PII redacted, entities, letter case","desc":"Features of annotation"},{"title":"Accuracy","content":"Word Accuracy Rate (WAR) at least 98% (Tags, entities are not included in accuracy statistics due to subjectivity)","desc":"Accuracy"}],"datatag":"Spanish,Entity,Spontaneous Dialogue,Financial","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_11.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_11.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=fa5hcU5cgblUULZ9WkqXSzzmqNQ%3D","intro":"no podía yo ni siquiera concebir el impacto que Cracks iba a tener en mi vida y en la de tantas personas como tú.","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_5.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_5.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=FrayUAjh1eY%2F0MWivyRIllT1eu8%3D","intro":"en en no estar en control y apenas tomas control todo empieza a.[N]","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_1.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000703_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=hfJHDXDRpr4DW%2FGi%2Bju559ysZcc%3D","intro":"Toma control, si tú quieres que el futuro sea diferente sal a construirlo, ¿no? Sa- sal a hacer que sea diferente.[N]","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000001_3.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000001_3.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=o7vOH42nnf7hpcQoGWg5KP4cX48%3D","intro":"Una iniciativa superinteresante de la que os comentaremos más en un momento. Pero por ahora y para arrancar pongámonos en situación.[N]","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000001_1.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240709007_demo1727690465047/APY240709007_demo/000001_1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=GOctePF3Oxm7JxF0fxvxcpe3fqg%3D","intro":"Este vídeo ha sido posible gracias a Microwd. Con su apoyo, Microwd nos está permitiendo preparar en VisualEconomik una serie de vídeos sobre América Latina.[N]","size":0,"progress":100,"type":"mp3"}],"officialSummary":"This Spanish financial speech dataset covering a wide range of financial terminologies, with a particular focus on macroeconomics and microeconomics. The dataset reflects authentic real-world interactions and includes high-quality audio recordings, transcriptions, speaker IDs, gender information, financial entity annotations, and other relevant metadata. The data was collected from speakers with diverse geographical and personal backgrounds, helping to enhance model performance in complex, real-world tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["spanish financial speech dataset","spanish speech dataset","spanish banking dataset","conversational ai dataset","rag dataset for finance"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN,JP,PT,DE,KO,FR,ES\"},{\"code\":\"3\",\"language\":\"EN\"},{\"code\":\"4\",\"language\":\"JP\"}]","productNameEn":"217 Hours - Spanish Financial Entities Real-world Casual Conversation and Monologue speech dataset","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
217 Hours Spanish Financial Speech Dataset with Financial Entity Annotation
spanish financial speech dataset
spanish speech dataset
spanish banking dataset
conversational ai dataset
rag dataset for finance
This Spanish financial speech dataset covering a wide range of financial terminologies, with a particular focus on macroeconomics and microeconomics. The dataset reflects authentic real-world interactions and includes high-quality audio recordings, transcriptions, speaker IDs, gender information, financial entity annotations, and other relevant metadata. The data was collected from speakers with diverse geographical and personal backgrounds, helping to enhance model performance in complex, real-world tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Covering various financial professional terminologies, primarily focuses on macroeconomics(market trends, financial policies, etc.), microeconomics(individual enterprises, stocks, investment portfolios, etc.)
Recording condition
Low background noise
Country
Spain(ESP), Latin countries
Language(Region) Code
es-ES, etc.
Language
Spain
Features of annotation
transcription text, timestamp, speaker identification, gender, noise, PII redacted, entities, letter case
Accuracy
Word Accuracy Rate (WAR) at least 98% (Tags, entities are not included in accuracy statistics due to subjectivity)
Sample
Audio
no podía yo ni siquiera concebir el impacto que Cracks iba a tener en mi vida y en la de tantas personas como tú.
Audio
en en no estar en control y apenas tomas control todo empieza a.[N]
Audio
Toma control, si tú quieres que el futuro sea diferente sal a construirlo, ¿no? Sa- sal a hacer que sea diferente.[N]
Audio
Una iniciativa superinteresante de la que os comentaremos más en un momento. Pero por ahora y para arrancar pongámonos en situación.[N]
Audio
Este vídeo ha sido posible gracias a Microwd. Con su apoyo, Microwd nos está permitiendo preparar en VisualEconomik una serie de vídeos sobre América Latina.[N]