en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

217 Hours Spanish Financial Speech Dataset with Financial Entity Annotation

spanish financial speech dataset
spanish speech dataset
spanish banking dataset
conversational ai dataset
rag dataset for finance

This Spanish financial speech dataset covering a wide range of financial terminologies, with a particular focus on macroeconomics and microeconomics. The dataset reflects authentic real-world interactions and includes high-quality audio recordings, transcriptions, speaker IDs, gender information, financial entity annotations, and other relevant metadata. The data was collected from speakers with diverse geographical and personal backgrounds, helping to enhance model performance in complex, real-world tasks. The dataset has undergone quality validation by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Covering various financial professional terminologies, primarily focuses on macroeconomics(market trends, financial policies, etc.), microeconomics(individual enterprises, stocks, investment portfolios, etc.)
Recording condition
Low background noise
Country
Spain(ESP), Latin countries
Language(Region) Code
es-ES, etc.
Language
Spain
Features of annotation
transcription text, timestamp, speaker identification, gender, noise, PII redacted, entities, letter case
Accuracy
Word Accuracy Rate (WAR) at least 98% (Tags, entities are not included in accuracy statistics due to subjectivity)
Sample Sample
  • Audio

    no podía yo ni siquiera concebir el impacto que Cracks iba a tener en mi vida y en la de tantas personas como tú.

  • Audio

    en en no estar en control y apenas tomas control todo empieza a.[N]

  • Audio

    Toma control, si tú quieres que el futuro sea diferente sal a construirlo, ¿no? Sa- sal a hacer que sea diferente.[N]

  • Audio

    Una iniciativa superinteresante de la que os comentaremos más en un momento. Pero por ahora y para arrancar pongámonos en situación.[N]

  • Audio

    Este vídeo ha sido posible gracias a Microwd. Con su apoyo, Microwd nos está permitiendo preparar en VisualEconomik una serie de vídeos sobre América Latina.[N]

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

b9451040-449c-42e6-8c19-e38bd466857d

4c798885-b4e3-4167-94f8-2223d253426e