{"id":1159,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY220430001.png?Expires=2007353707&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=Xy0LTK6smT2ZIR6bTMvkcM%2Bj/c0%3D","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"20 Hours - American English Speech Synthesis Corpus-Male","datazy":[{"title":"Format","desc":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","desc":"Recording environment","content":"professional recording studio;"},{"title":"Recording content","desc":"Recording content","content":"general narrative sentences, interrogative sentences, etc;"},{"title":"Speaker","desc":"Speaker","content":"male, 20-30 years old, young and positive voice;"},{"title":"Device","desc":"Device","content":"microphone;"},{"title":"Language","desc":"Language","content":"American English;"},{"title":"Annotation","desc":"Annotation","content":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Application scenarios","desc":"Application scenarios","content":"speech synthesis."}],"datatag":"English,Tts,American English,Male","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100003.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=EsLOpjsxcnlIoj4qqwkJbQ1TajY%3D","intro":"Look- at- the way they hear operas- and- see oil paintings%.L UH1 K3 / AE1 T / DH AX0 / W EY1 / DH EY1 / HH IY1 R / AA1 . P R AX0 Z / AX0 N D / S IY1 / OY1 L / P EY1 N . T IH0 NG Z","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100009.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100009.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=1zsMh1kphe0l7IMctfv5kpDKjso%3D","intro":"Was there some discussion- about- whether I should- speak%?W AX1 Z / DH EH1 R / S AH1 M / D IH0 . S K AH1 . SH AX0 N3 / AX0 . B AW1 T / W EH1 . DH ER0 / AY1 / SH UH1 D / S P IY1 K","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100005.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5Qdf67K4%2BIH5zQJ8G%2F3qYUMh6%2Bs%3D","intro":"The focus- of this chapter is the American revolution%.DH AX0 / F OW1 . K AX0 S3 / AX1 V / DH IH1 S / CH AE1 P . T ER0 / IH1 Z / DH IY0 / AX0 . M EH1 . R IH0 . K AX0 N / R EH2 . V AX0 . L UW1 . SH AX0 N","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100007.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=tHaBfAfUc8rIfSAOcZp%2F8a68TLM%3D","intro":"Can I go calling any time%?K AE1 N / AY13 / G OW1 / K AO1 . L IH0 NG / EH1 . N IY0 / T AY1 M","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100004.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NfaAP7%2B%2BDLU0hQ5AkuwthPVJmj4%3D","intro":"Is- it really take- time%?IH1 Z / IH1 T / R IY1 . AX0 . L IY0 / T EY1 K / T AY1 M3","size":0,"progress":100,"type":"mp3"}],"officialSummary":"Male audio data of American English. It is recorded by American English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.","dataexampl":null,"datakeyword":["English","Tts","American English","Male"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Voice Type,Language","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

20 Hours - American English Speech Synthesis Corpus-Male

English

Tts

American English

Male

Male audio data of American English. It is recorded by American English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.

Specifications

Format

48,000Hz, 24bit, uncompressed wav, mono channel;

Recording environment

professional recording studio;

Recording content

general narrative sentences, interrogative sentences, etc;

Speaker

male, 20-30 years old, young and positive voice;

Device

microphone;

Language

American English;

Annotation

word and phoneme transcription, four-level prosodic boundary annotation;

Application scenarios

speech synthesis.

Sample

Audio
Look- at- the way they hear operas- and- see oil paintings%.L UH1 K3 / AE1 T / DH AX0 / W EY1 / DH EY1 / HH IY1 R / AA1 . P R AX0 Z / AX0 N D / S IY1 / OY1 L / P EY1 N . T IH0 NG Z
Audio
Was there some discussion- about- whether I should- speak%?W AX1 Z / DH EH1 R / S AH1 M / D IH0 . S K AH1 . SH AX0 N3 / AX0 . B AW1 T / W EH1 . DH ER0 / AY1 / SH UH1 D / S P IY1 K
Audio
The focus- of this chapter is the American revolution%.DH AX0 / F OW1 . K AX0 S3 / AX1 V / DH IH1 S / CH AE1 P . T ER0 / IH1 Z / DH IY0 / AX0 . M EH1 . R IH0 . K AX0 N / R EH2 . V AX0 . L UW1 . SH AX0 N
Audio
Can I go calling any time%?K AE1 N / AY13 / G OW1 / K AO1 . L IH0 NG / EH1 . N IY0 / T AY1 M
Audio
Is- it really take- time%?IH1 Z / IH1 T / R IY1 . AX0 . L IY0 / T EY1 K / T AY1 M3

Recommended Dataset

19.46 Hours - American English Speech Synthesis Corpus-Female

Female audio data of American English,. It is recorded by American English native speaker, with authentic accent and sweet sound. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

American English TTS

Chinese Multi-emotional Modal particle and Natural Conversation Speech Synthesis Corpus

Chinese Multi-emotional Modal particle and Natural Conversation Speech Synthesis Corpus, is recorded by multiple native Chinese voice actors. It not only includes sentences rich in modal particles that align with daily expression habits, but also encompasses free conversation data on given topics. In each conversation, the audio of each speaker is independently stored in their respective tracks. Professional phoneticians have annotated information such as text content, meeting the precise requirements for speech synthesis research and development to a full extent.

Chinese Multi-emotional Modal particle Natural Conversation Speech Synthesis TTS

Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus

Mandarin Chinese Seperated Track Spontaneous Dialogue Paralanguage Annotated Speech Synthesis Corpus, with a free dialogue style. Given a topic, the speaker can express themselves, and in each conversation, each person's audio is stored in their own separate WAV file. Professional linguists have annotated 16 types of paralanguage annotations, text annotations, timestamps, and other information to accurately match the research and development needs of speech synthesis.

M Chinese Spontaneous Dialogue Seperated track Conversation 48khz

2 People - Korean Average Tone Speech Synthesis Corpus

2 People - Korean Average Tone Speech Synthesis Corpus. It is recorded by korean native , with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Korean Tts Average Tone

14 Hours - Taiwan Mandarin Seven Style Average Tone Speech Synthesis Corpus

14 Hours - Taiwan Mandarin Seven Style Average Tone Speech Synthesis Corpus, 7 styles recorded by 4 professional CharacterVoice, the styles are criminalsubordinate, rough man, little girl, kind grandma, businessman, grandfather, Non-Commissioned Officer style, Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

TTS Taiwan Mandarin Multi-style

2 People - Japanese Average Tone Speech Synthesis Corpus

2 People - Japanese Average Tone Speech Synthesis Corpus. It is recorded by native Japanese, with authentic accent. Contains news and colloquial style general corpus,the phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Japanese Tts Average Tone

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female

10 Hours - Chaozhou Dialect Speech Synthesis Corpus - Female. It is recorded by Chaozhou-Shantou Pronunciation. the phonemes and tones are balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Synthesis Corpus Chaozhou TTS Chinese Dialect

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus

2 People - Mexican Spanish Average Tone Speech Synthesis Corpus. It is recorded by native Mexican, with authentic accent, Covering both customer service and general styles. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

Mexican Spanish Tts Average Tone

Tell Us Your Special Needs

Full Name *

Contact Phone No. *

Company name *

Company Email *

Data Requirements *

By submitting, I agree to the Privacy Protection

Submit

Subscribe to our newsletter

Be the first to receive Nexdata latest product releases, data solutions and enterprise news.

Off-the-Shelf Datasets: All Category Datasets; LLM Datasets; Computer Vision Datasets; Speech Recognition Datasets; Speech Synthesis Datasets; OCR Datasets; Pronunciation Dictionary; NLU Datasets

Data Service: 3D Point Cloud Data; Street View Data; OCR Data; Behavior Recognition Data; Identity Recognition Data; Speech Recognition Data; Speech Synthesis Data; Multimodal Data

Industries: Generative AI; Autonomous Vehicles; AR/VR; Conversational AI; Smart Home; Retail; Intelligent Healthcare

Company: About Us; News; Partners; Quality & Security; Event
Links: OPENMPD; DataPlus; Datarade

Platform: Platform
Competition: Competition
Resources: Sponsored Datasets

Sharpen Your AI with Better Data

+1(626)594-5598

[email protected]

Sitemap Terms and Conditions

We use cookies to enhance your browsing experience, serve personalized ads or content, and analyze our traffic. By clicking "Accept All", you consent to our use of cookies.

dc3bb794-e917-4622-affd-d57b77923696

c0ad7b71-09e5-4a7f-906d-93674b67b24a

20 Hours - American English Speech Synthesis Corpus-Male

English Tts American English Male

Male audio data of American English. It is recorded by American English native speakers, with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It precisely matches with the research and development needs of the speech synthesis.

English

Tts

American English

Male