{"id":1411,"datatype":"1","titleimg":"https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus","datazy":[{"title":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;","desc":"Format"},{"title":"Recording environment","content":"professional recording studio;","desc":"Recording environment"},{"title":"Recording content","content":"contains news and general corpus;","desc":"Recording content"},{"title":"Speaker","content":"professional voice actor, one male and one female, aged 25-35, 10 hours per person;","desc":"Speaker"},{"title":"Annotation","content":"word and phoneme transcription, four-level prosodic boundary annotation;","desc":"Annotation"},{"title":"Device","content":"microphone;","desc":"Device"},{"title":"Language","content":"Japanese","desc":"Language"},{"title":"Application scenarios","content":"speech synthesis.","desc":"Application scenarios"}],"datatag":"Japanese,Tts,Average Tone","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":null,"samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=B9%2FlCe9cQv3dpyBbp3Xa7jlwIMw%3D","intro":"あなた達#3、市役所の人#4！a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=b5hCLTmjOhmm1LT%2BravL3gnDM8s%3D","intro":"何か#3、お経みてえな#1歌だな#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ptvZWPSR06IvznsBDC7p34CJazs%3D","intro":"この人の#1遺品の#1中から#3、父の#1手帳は#1見つかったの#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000002.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=2ZcgDHbn7yGxy2713ps%2BB0GM0OY%3D","intro":"はい#3、こちら#3、お願いします#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY240504001_demo1717495200209/APY240504001_demo/000001.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ndv2gvsnX2%2FfO9cWnLD1poVyTas%3D","intro":"","size":0,"progress":100,"type":"mp3"}],"officialSummary":"This dataset contains recordings from 2 native Japanese speakers with authentic accents, each person contribute 10 hours of audio. Contains news and colloquial style general corpus, the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of building Japanese text-to-speech systems, speech synthesis research, and AI voice applications.","dataexampl":null,"datakeyword":["Japanese speech dataset","Japanese TTS dataset","Japanese speech synthesis corpus","Japanese voice dataset for AI","native Japanese speech dataset","Japanese text-to-speech dataset","balanced phoneme Japanese corpus"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","dataShowType":"[{\"code\":\"0\",\"language\":\"ZH\"},{\"code\":\"1\",\"language\":\"ZH\"},{\"code\":\"2\",\"language\":\"EN,JP,PT,DE,KO,FR,ES\"},{\"code\":\"3\",\"language\":\"EN\"}]","productNameEn":"2 People - Japanese Average Tone Speech Synthesis Corpus","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Home > All Category Datasets > Speech Synthesis Datasets > 20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus

20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus

Japanese speech dataset

Japanese TTS dataset

Japanese speech synthesis corpus

Japanese voice dataset for AI

native Japanese speech dataset

Japanese text-to-speech dataset

balanced phoneme Japanese corpus

This dataset contains recordings from 2 native Japanese speakers with authentic accents, each person contribute 10 hours of audio. Contains news and colloquial style general corpus, the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of building Japanese text-to-speech systems, speech synthesis research, and AI voice applications.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

contains news and general corpus;

Speaker

professional voice actor, one male and one female, aged 25-35, 10 hours per person;

Annotation

word and phoneme transcription, four-level prosodic boundary annotation;

Device

microphone;

Language

Japanese

Application scenarios

speech synthesis.

Sample

Audio
あなた達#3、市役所の人#4！a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)
Audio
何か#3、お経みてえな#1歌だな#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)
Audio
この人の#1遺品の#1中から#3、父の#1手帳は#1見つかったの#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)
Audio
はい#3、こちら#3、お願いします#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)
Audio

Recommended Dataset

10.4 Hours – Japanese Female Voice TTS Dataset

This dataset contains 10.4 hours of Japanese female voice recordings. It is recorded by Japanese native speaker with an authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. This corpus is ideal for tasks such as Japanese text-to-speech (TTS) training, speech synthesis research, and AI voice model development.

Japanese speech synthesis dataset Japanese tts dataset Japanese text-to-speech dataset female female japanese tts dataset

20 Hours - American English Male Voice TTS Dataset

This dataset contains 20 hours of American English male voice recordings. It is recorded by Americans (native English speakers) with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition research, and AI voice development.

TTS english dataset speech synthesis dataset TTS male voice dataset male voice dataset for tts American English speech synthesis dataset

19.46 Hours - American English Female Voice TTS Dataset

This dataset contains 19.46 hours of American English female voice recordings. It is recorded by American (native English speaker) with authentic accent and clear, sweet tone. The phoneme coverage is balanced. Professional phoneticians participate in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition, and AI voice development requiring natural-sounding female speech.

American English speech synthesis dataset female voice dataset for TTS American English female voice corpus speech synthesis training data female TTS dataset American English female speaker speech synthesis dataset TTS english dataset

8 Hours – Cantonese Speech Dataset for TTS (Hong Kong)

This dataset features recordings from 4 native Hong Kong Cantonese speakers. The corpus contain educational, game and general colloquial content. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Cantonese speech dataset Hong Kong Cantonese speech corpus Cantonese text-to-speech dataset Cantonese voice dataset for AI native Cantonese speech recordings Cantonese TTS dataset Hong Kong accent speech dataset

2 Speakers – Korean TTS Dataset with Native Accent

This dataset contains recordings from 2 native Korean speakers with authentic accent. Contains news and colloquial general corpus, the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development in text-to-speech, Korean speech synthesis, and AI voice applications.

Korean speech dataset Korean TTS dataset Korean speech synthesis corpus Korean voice dataset for AI Korean accent speech corpus Korean text-to-speech dataset Korean speech recordings for TTS

14 Hours Taiwan Mandarin TTS Dataset – Multi-Style Voices

This dataset contains 14 hours of Taiwan Mandarin recordings from 4 professional voice actors with 7 speaking styles. The styles are criminal subordinate, rough man, little girl, kind grandma, businessman, grandfather and non-commissioned officer. Professional phonetician participates in the annotation. It is ideal for text-to-speech (TTS), expressive voice generation, virtual avatars, and AI speech synthesis applications.

Taiwan Mandarin speech dataset Taiwan Mandarin voice dataset Taiwan Mandarin speech corpus for AI Mandarin accent dataset Taiwan Mandarin TTS dataset

6 Speakers – Taiwanese Mandarin Speech Dataset for TTS

This dataset includes recordings from 6 professional voice actors from Taiwan, covering news and colloquial speech. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Taiwanese Mandarin speech dataset Taiwan Mandarin TTS dataset Mandarin speech synthesis corpus native Taiwanese Mandarin corpus

8 Hours - Canadian French TTS Dataset (Native Accent)

This dataset contains recordings from 2 native Canadian French speakers with authentic accents. It is ideal for researchers and developers seeking natural Canadian French voices.

Canadian French TTS dataset Canadian French speech dataset for AI Canadian French accent speech corpus Canadian French text to speech voices Canadian French speech dataset

Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)

Defined goals, need professional guidance

Active development or optimization phase

Data & labeling experts with clear specifications

Full Name *

Contact Phone No.*

Company name *

Company Email *

Data Requirements *

By submitting, I agree to the Privacy Protection

Submit

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; Embodied AI Datasets; LLM Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Embodied AI; Generative AI; Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; Partners; Quality & Security; Event
Links: OPENMPD; DataPlus; Datarade

Platform: Platform
Competition: Competition
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

c411c6d2-ea77-48a5-868c-7f44e03094b9

6170c224-a951-41e5-bfde-1407505b8664

20 Hours Japanese TTS Dataset – Native Japanese Voice Corpus

Japanese speech dataset Japanese TTS dataset Japanese speech synthesis corpus Japanese voice dataset for AI native Japanese speech dataset Japanese text-to-speech dataset balanced phoneme Japanese corpus

Current Project Maturity

Japanese speech dataset

Japanese TTS dataset

Japanese speech synthesis corpus

Japanese voice dataset for AI

native Japanese speech dataset

Japanese text-to-speech dataset

balanced phoneme Japanese corpus