[{"@type":"PropertyValue","name":"Format","value":"16k Hz, 16 bit, wav, mono channel"},{"@type":"PropertyValue","name":"Content category","value":"Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics"},{"@type":"PropertyValue","name":"Recording condition","value":"Low background noise"},{"@type":"PropertyValue","name":"Country","value":"Japan(JPN)"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"ja-JP"},{"@type":"PropertyValue","name":"Language","value":"Japanese"},{"@type":"PropertyValue","name":"Features of annotation","value":"transcription text, timestamp, speaker identification, gender, noise, sensitive information"},{"@type":"PropertyValue","name":"Accuracy","value":"Character Accuracy Rate (CAR) at least 98%"}]
{"id":2048,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"166","type2str":null,"dataname":"300 Hours Japanese Financial Speech Dataset – Real-World Conversations for Speech Recognition Models","datazy":[{"title":"Format","content":"16k Hz, 16 bit, wav, mono channel"},{"title":"Content category","content":"Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics"},{"title":"Recording condition","content":"Low background noise"},{"title":"Country","content":"Japan(JPN)"},{"title":"Language(Region) Code","content":"ja-JP"},{"title":"Language","content":"Japanese"},{"title":"Features of annotation","content":"transcription text, timestamp, speaker identification, gender, noise, sensitive information"},{"title":"Accuracy","content":"Character Accuracy Rate (CAR) at least 98%"}],"datatag":"Japanese,Financial,Casual Conversation,Monologue","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"000012_4.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402165416/000012_4.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=PogHZtu0zq2KjlgzInzmnJdPWOc%3D","intro":"一方で今日も引き続きバリューや中小型株優位で、セクターローテーションの一環とも言えるのか、","size":200330,"progress":100,"type":"mp3"},{"name":"000012_11.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402165416/000012_11.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=1DTXLErsV4NW%2FoZniqM2W38PoDM%3D","intro":"決算が物足りないと受け止められた、アドバンストマイクロデバイスが急落していました。","size":164192,"progress":100,"type":"mp3"},{"name":"000011_7.wav","url":"https://storage-product.datatang.com/damp/product/sample_presentation/20260402165416/000011_7.wav?Expires=4102415999&OSSAccessKeyId=LTAI5tEBeSWUJiqjXvBMsxEu&Signature=Mm5nc8PecIdGi253irVV7lExwpU%3D","intro":"えまず、資料は、ええ最初のデータとしてこういうのを持ってきました。過去二十回取ってきました。","size":152612,"progress":100,"type":"mp3"}],"officialSummary":"300 hours of Japanese financial speech dataset featuring real-world conversations and monologue speech in finance-related scenarios. The dataset includes rich transcripts and speaker metadata such as speaker ID and gender, and other attributes. Our datasets draw from a geographically diverse group of speakers, enhancing the model's performance on real-world, complex tasks. The datasets have passed quality testing by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":null,"datakeyword":["japanese financial speech dataset","financial speech dataset","japanese japanese ASR financial dataset","financial domain speech dataset","labeled financial speech dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Data Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN,JP\"},{\"code\":\"3\",\"language\":\"EN\"},{\"code\":\"4\",\"language\":\"JP\"}]","productNameEn":"300 Hours - Japanese(Japan) Financial Real-world Casual Conversation and Monologue speech dataset","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
300 Hours Japanese Financial Speech Dataset – Real-World Conversations for Speech Recognition Models
japanese financial speech dataset
financial speech dataset
japanese japanese ASR financial dataset
financial domain speech dataset
labeled financial speech dataset
300 hours of Japanese financial speech dataset featuring real-world conversations and monologue speech in finance-related scenarios. The dataset includes rich transcripts and speaker metadata such as speaker ID and gender, and other attributes. Our datasets draw from a geographically diverse group of speakers, enhancing the model's performance on real-world, complex tasks. The datasets have passed quality testing by multiple AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16k Hz, 16 bit, wav, mono channel
Content category
Covering various financial professional terminologies, primarily focuses on macroeconomics, microeconomics
Recording condition
Low background noise
Country
Japan(JPN)
Language(Region) Code
ja-JP
Language
Japanese
Features of annotation
transcription text, timestamp, speaker identification, gender, noise, sensitive information