[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"7,400 sentences of news and dialogue text, the syllables, phonemes and tones are balanced;"},{"@type":"PropertyValue","name":"Speaker","value":"female, 20-30 years old, sweet voice;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"Japanese"},{"@type":"PropertyValue","name":"Annotation","value":"word transcription;"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1165,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY220429001.png?Expires=2007353707&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=QWTqVf%2BqtLmUcUmSmDlvIBrM0B4%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"10.4 Hours - Japanese Synthesis Corpus-Female","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"7,400 sentences of news and dialogue text, the syllables, phonemes and tones are balanced;"},{"title":"Speaker","value":"female, 20-30 years old, sweet voice;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"Japanese"},{"title":"Annotation","value":"word transcription;"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"Japanese,TTS,Female","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=6y5kYbNIGDMbdkMpsdieNCmaFaM%3D","/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100003.wav","銀行に預けてれば、とりあえず安全ですから。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=L2VNPLKJe9NgB1JA1WGCj6eYkHM%3D","/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100001.wav","さっきから入り口の方ちらちら見てっからさ。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=OAoPesSIsdfooP7r2azVkQYgK88%3D","/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100005.wav","どこにいてもいいから元気でいてほしい。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=36hc92sN3w8SO4CJyZ59kF%2F0dsg%3D","/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100002.wav","こんなにまだ自分に何にもないと思わなかったな。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=sNrxQ1S%2Bhg3q3xAAZhv%2BkAqY%2BLs%3D","/data/apps/damp/temp/ziptemp/APY220429001_demo1695809021473/APY220429001_demo/100004.wav","そんなやり方じゃどんなやつだってつぶれますよ。"]],"officialSummary":"10.4 Hours - Japanese Synthesis Corpus-Female. It is recorded by Japanese native speaker, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["tts","japan","female"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
10.4 Hours - Japanese Synthesis Corpus-Female. It is recorded by Japanese native speaker, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
7,400 sentences of news and dialogue text, the syllables, phonemes and tones are balanced;
Speaker
female, 20-30 years old, sweet voice;
Device
microphone;
Language
Japanese
Annotation
word transcription;
Application scenarios
speech synthesis.
Sample
Audio
銀行に預けてれば、とりあえず安全ですから。
Audio
さっきから入り口の方ちらちら見てっからさ。
Audio
どこにいてもいいから元気でいてほしい。
Audio
こんなにまだ自分に何にもないと思わなかったな。
Audio
そんなやり方じゃどんなやつだってつぶれますよ。
Recommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
2 People - Japanese Average Tone Speech Synthesis Corpus
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSJapaneseAverage Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralChaozhouDialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSMexicanSpanishAverage Tone
2 People - Spanish Average Tone Speech Synthesis Corpus
2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSSpanishAverage Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus
2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSNew Zealand EnglishAverage Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralSichuanDialect
10 People - British English Average Tone Speech Synthesis Corpus
10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.