[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel."},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio."},{"@type":"PropertyValue","name":"Recording content","value":"K12 exercises, picture books, supplementary reading materials, greetings, guide to reading, etc."},{"@type":"PropertyValue","name":"Speaker","value":"a Chinese female adult imitating the voice of children aged 7-8, with lively and sweet style."},{"@type":"PropertyValue","name":"Device","value":"microphone."},{"@type":"PropertyValue","name":"Language","value":"Mandarin."},{"@type":"PropertyValue","name":"Annotation","value":"word transcription."},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1091,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY201218001.png?Expires=2007353688&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=Ar5JeQx8qKFSzSrp96D/9uCH9oo%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children","datazy":[{"title":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel."},{"title":"Recording environment","value":"professional recording studio."},{"title":"Recording content","value":"K12 exercises, picture books, supplementary reading materials, greetings, guide to reading, etc."},{"title":"Speaker","value":"a Chinese female adult imitating the voice of children aged 7-8, with lively and sweet style."},{"title":"Device","value":"microphone."},{"title":"Language","value":"Mandarin."},{"title":"Annotation","value":"word transcription."},{"title":"Application scenarios","value":"speech synthesis."}],"datatag":"TTS,Female,Imitating Children,Children","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005514.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=J8Fv9RA5bcF2lVE9pcFuw9X7JY4%3D","/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005514.wav","马上相逢无纸笔,凭君传语报平安。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005842.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=UpIbAgRFP%2BZQnfXuD6A1RNuWnJE%3D","/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005842.wav","你可真是个少有的好人啊!"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005614.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=jlvt%2B3IcuC0g4zJDy9j3Kgw34Zw%3D","/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/005614.wav","十四减括号一加六括号等于几?"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Kcm0xqzm3%2BMt%2BcYdAquCiGszb%2F8%3D","/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/000001.wav","冰心,现代女作家、儿童文学家。"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/006586.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=u3dHIvJnZWfUckk%2FKBowbGWjaec%3D","/data/apps/damp/temp/ziptemp/APY201218001_demo1695808998505/APY201218001_demo/006586.wav","对这么熟的人,我怎能不拿他当作个好朋友呢?"]],"officialSummary":"Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":"","datakeyword":["TTS","Chinese","Children"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children
TTS
Chinese
Children
Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel.
Recording environment
professional recording studio.
Recording content
K12 exercises, picture books, supplementary reading materials, greetings, guide to reading, etc.
Speaker
a Chinese female adult imitating the voice of children aged 7-8, with lively and sweet style.
Device
microphone.
Language
Mandarin.
Annotation
word transcription.
Application scenarios
speech synthesis.
Sample
Audio
马上相逢无纸笔,凭君传语报平安。
Audio
你可真是个少有的好人啊!
Audio
十四减括号一加六括号等于几?
Audio
冰心,现代女作家、儿童文学家。
Audio
对这么熟的人,我怎能不拿他当作个好朋友呢?
Recommended Dataset
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
2 People - Japanese Average Tone Speech Synthesis Corpus
2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSJapaneseAverage Tone
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralChaozhouDialect
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus
2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSMexicanSpanishAverage Tone
2 People - Spanish Average Tone Speech Synthesis Corpus
2 People - Spanish Average Tone Speech Synthesis Corpus. It is recorded by rn native Spaniard, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSSpanishAverage Tone
2 People - New Zealand English Average Tone Speech Synthesis Corpus
2 People - New Zealand English Average Tone Speech Synthesis Corpus. It is recorded by rn native New Zealanders, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
TTSNew Zealand EnglishAverage Tone
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female
20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.
Synthesis CorpusTTSFemaleGeneralSichuanDialect
10 People - British English Average Tone Speech Synthesis Corpus
10 People - British English Average Tone Speech Synthesis Corpus. It is recorded by British English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.