en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

200,955 Sentences - Mandarin Prosodic Dataset for TTS Prosody Prediction

Mandarin prosodic corpus
TTS prosody training data
Front-end prosody prediction corpus
Mandarin speech synthesis data
Prosodic hierarchy annotation
Chinese TTS front-end dataset
Sentence-level prosody corpus
Mandarin intonation dataset

This dataset contains 4 prosodic hierarchies annotating for the 200000 carefully selected Chinese texts, covering both news and colloquial language. The sentence length is appropriate with diversified sentence patterns. This can be used as a TTS front-end prosody prediction training data set.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data content
prosodic annotation for 200,955 selected Chinese sentences
Data scale
200,955 sentences
Data source
all the text comes from the news and human conversation
Annotation
4 prosodic hierarchies annotating
Language
Chinese
Application scenarios
speech synthesis
Accuracy
not lower than 99%
Sample Sample
  • 200,955 Sentences - Mandarin Prosodic Dataset for TTS Prosody Prediction
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

a9cb789b-302d-49ac-9b59-d5958deed7e8

1b6f0fc8-345e-404b-8d51-f2ad7f420fae