en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Korean Financial Speech Dataset – 215 Hours of Real-World Audio

Korean financial speech dataset
Korean ASR dataset
economics audio corpus
financial audio dataset
Korean business voice data
macroeconomic speech dataset
finance chatbot training data
domain-specific speech dataset
Korean language audio for AI

This Korean Financial Speech Dataset contains 215 hours of real-world audio, including casual conversations and monologues. The content spans professional financial terminology in macroeconomics and microeconomics contexts, simulating authentic banking and financial service interactions. Each recording includes transcriptions, speaker metadata (ID, gender), and tagged financial entities. The dataset supports a wide range of AI applications such as automatic speech recognition (ASR), financial natural language understanding (NLU), voicebot development, and domain-specific language modeling. All data complies with GDPR, CCPA, and PIPL regulations, ensuring privacy and ethical usage.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Covering various financial professional terminologies, primarily focuses on macroeconomics(market trends, financial policies, etc.), microeconomics(individual enterprises, stocks, investment portfolios, etc.)
Recording condition
Low background noise
Country
Korea(KOR)
Language(Region) Code
ko-KR
Language
Korean
Features of annotation
transcription text, timestamp, speaker identification, gender, noise, PII redacted, entities, letter case
Accuracy
Word Accuracy Rate (WAR) at least 98%(excluding tags and entities)
Sample Sample
  • Audio

    지난 주에 또 시끄러웠던 게 오염수가 아니라 이제 또 홍범도 장군

  • Audio

    얘기가 진짜 많이 [OVERLAP/]나왔었는데[/OVERLAP]

  • Audio

    예 어쨌든 간에 또 일요일이 돌아왔고

  • Audio

    반갑습니다.

  • Audio

    얘기가 나오다 보니까

Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

fa4a4c38-17a5-467b-b065-192cadceb4d3

e5be6673-23c1-4f73-bbf2-8dbdc0f2fc6c