en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

2 People - Japanese Average Tone Speech Synthesis Corpus

TTS
Japanese
Average Tone

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by rn native Japan, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
contains news and general corpus;
Speaker
professional voice actor, one male and one female, aged 25-35, 10 hours per person;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Device
microphone;
Language
Japanese
Application scenarios
speech synthesis.
Sample Sample
  • Audio

    あなた 達#3、市役所 の 人#4!a(L) . n a(H) . t a(L) # t a(L) . ch i(L) / sh i(L) . ya(H) . k u(H) . s yo(H) # n o(H) # h i(L) . t o(H)

  • Audio

    何か#3、お経 みてえ な#1歌 だ な#4。n a(H) . N(L) . k a(L) / o(L) . k yo:(HH) # m i(H) . t e:(LL) # n a(H) / u(L) . t a(H) # d a(L) # n a(L)

  • Audio

    この 人 の#1遺品 の#1中 から#3、父 の#1手帳 は#1見つかった の#4。k o(L) . n o(H) # h i(L) . t o(L) # n o(L) / i(L) . h i(H) . N(H) # n o(H) / n a(H) . k a(L) # k a(L) . r a(L) / ch i(L) . ch i(H) # n o(L) / t e(L) . c yo:(HH) # w a(H) / m i(L) . ts u(H) . k a(H) . T(H) . t a(H) # n o(L)

  • Audio

    はい#3、こちら#3、お 願い します#4。h a(H) . i(L) / k o(L) . ch i(H) . r a(H) / o(L) # n e(H) . g a(H) . i(H) # sh i(L) . m a(H) . s u(L)

  • Audio

Recommended DatasetsRecommended Dataset
150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service

150 People - Chinese Mandarin Average Tone Speech Synthesis Corpus-Customer Service. It is recorded by Chinese native speakers,customer service text, and the syllables, phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Mandarin Customer Service Synthesis Corpus
20.1 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service

20 Hours - Chinese Mandarin Synthesis Corpus-Male, Customer Service. It is recorded by Chinese native speakers, the voice of the full of magnetism. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Customer Service Synthesis Corpus
26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service

26.1 Hours - Chinese Mandarin Synthesis Corpus-Female, Customer Service, It is recorded by Chinese native speakers, with lively and frindly voice. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Mandarin Female Customer Service
6.78 Hours - Chinese Mandarin Speech Synthesis Corpus-Female Imitating Children

Female audio data of adults imitating children, 6599 sentences in total and 6.78 hours. It is recorded by Chinese native speakers, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Chinese Children
19.46 Hours - American English Speech Synthesis Corpus-Female

Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS American English Female
2 People - Korean Average Tone Speech Synthesis Corpus

2 People - Korean Average Tone Speech Synthesis Corpus. It is recorded by rnkorean native , with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Korean Average Tone
4 People - Northeastern dialect Average Tone Speech Synthesis Corpus

4 People - Northeastern dialect Average Tone Speech Synthesis Corpus. It is recorded by Northeast native. About 40% of the corpus contains words unique to Northeast China, the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Average Tone General Northeast Dialect
10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus TTS Female General Chaozhou Dialect
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

db886190-2e5e-4b08-80f5-d4557da2959a

07f5cdc8-9530-4cb2-9689-e793afa94ce8