Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

Synthesis Corpus

TTS

Mandarin

Mixed Speech with Chinese & English

Chinese

English

Dialect

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

general corpus

Speaker

professional Character Voice, female, 20-30 years old

Device

microphone;

Language

sichuan dialect

Annotation

word and phoneme transcription, prosodic boundary annotation;

Application scenarios

speech synthesis.

Sample

Audio
哎呀#3，桂花#1超级#1漂亮嘞#4！ai1 ya4 gui3 hua1 cao2 ji4 piao3 liang1 lei2
Audio
嘿嘿#3，那我们#2又能#1愉快嘞#1玩耍啦#4！hei1 hei1 la3 ngo1 men1 you3 len1 yu1 kuai3 lei1 wan1 sua1 la1
Audio
嘿#3，你在#1那儿#2做啥子#4。hei4 ni1 zai4 lar2 zu4 sa3 zi4
Audio
因为#1他#1张嘴后#2牙齿#1正好#1与#1身后#1储物箱#1上嘞#1图案#1完美#1重合#4。yin2 wei1 ta1 zang2 zui1 hou3 ya1 ci2 zen3 hao1 yu1 sen2 hou4 cu1 wu4 xiang2 sang3 lei1 tu4 an4 wan3 mei1 cong4 ho4
Audio
陀螺嘞#1大小#2要看#1木轴嘞#1粗细#4。to1 lo1 lei1 da3 xiao4 yao2 kan4 mu1 zou1 lei1 cu2 xi3

Tell Us Your Special Needs

Full Name *

Contact Phone No. *

Company name *

Company Email *

Data Requirements *

By submitting, I agree to the Privacy Protection

Submit

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; LLM Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Generative AI; Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; Partners; Quality & Security; Event
Links: OPENMPD; DataPlus; Datarade

Platform: Platform
Competition: Competition
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

f4c4ddac-8ba0-4a10-9536-3d9bebb1f192

75a2235e-7df7-472e-aa8f-3b8e251e01fe

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female

Synthesis Corpus TTS Mandarin Mixed Speech with Chinese & English Chinese English Dialect

20 Hours - Sichuan Dialect Speech Synthesis Corpus - Female. It is recorded by Chengdu Sichuan Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus

TTS

Mandarin

Mixed Speech with Chinese & English

Chinese

English

Dialect