[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"contains news and general corpus;"},{"@type":"PropertyValue","name":"Speaker","value":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"Japanese"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1411,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"2 People - Japanese Average Tone Speech Synthesis Corpus","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","value":"professional recording studio;"},{"title":"Recording content","value":"contains news and general corpus;"},{"title":"Speaker","value":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;"},{"title":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Device","value":"microphone;"},{"title":"Language","value":"Japanese"},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"Japanese,Tts,Average Tone","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav","あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav","何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav","この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav","はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D","/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav",""]],"officialSummary":"2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["TTS","Japanese","Average Tone"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
2 People - Japanese Average Tone Speech Synthesis Corpus
TTS
Japanese
Average Tone
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
contains news and general corpus;
Speaker
professional voice actor, one male and one female, aged 25-35, 10 hours per person;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Device
microphone;
Language
Japanese
Application scenarios
speech synthesis.
Sample
Audio
あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)
Audio
何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)
Audio
この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)
Audio
はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)
Audio
Recommended Dataset
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
MandarinCustomer ServiceSynthesis Corpus
20.1 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service
20 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service. It is recorded by Chinese native speakers, the voice of the full of magnetism. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSCustomer ServiceSynthesis Corpus
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with lively and frindly voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSMandarinFemaleCustomer Service
6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children
Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSChineseChildren
19.46 Hours - American English Speech Synthesis Corpus-Female
Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSAmerican EnglishFemale
2 People - Korean Average Tone Speech Synthesis Corpus
2 People - Korean Average Tone Speech Synthesis Corpus. It is recorded by rnkorean native , with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSKoreanAverage Tone
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.