[{"@type":"PropertyValue","name":"Format","value":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Recording environment","value":"professional recording studio;"},{"@type":"PropertyValue","name":"Recording content","value":"general narrative sentences, interrogative sentences, etc;"},{"@type":"PropertyValue","name":"Speaker","value":"male, 20-30 years old, young and positive voice;"},{"@type":"PropertyValue","name":"Device","value":"microphone;"},{"@type":"PropertyValue","name":"Language","value":"American English;"},{"@type":"PropertyValue","name":"Annotation","value":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"@type":"PropertyValue","name":"Application scenarios","value":"speech synthesis."}]
{"id":1159,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY220430001.png?Expires=2007353707&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=Xy0LTK6smT2ZIR6bTMvkcM%2Bj/c0%3D","type1":"165","type1str":null,"type2":"219","type2str":null,"dataname":"20 Hours - American English Male Voice TTS Dataset","datazy":[{"title":"Format","desc":"Format","content":"48,000Hz, 24bit, uncompressed wav, mono channel;"},{"title":"Recording environment","desc":"Recording environment","content":"professional recording studio;"},{"title":"Recording content","desc":"Recording content","content":"general narrative sentences, interrogative sentences, etc;"},{"title":"Speaker","desc":"Speaker","content":"male, 20-30 years old, young and positive voice;"},{"title":"Device","desc":"Device","content":"microphone;"},{"title":"Language","desc":"Language","content":"American English;"},{"title":"Annotation","desc":"Annotation","content":"word and phoneme transcription, four-level prosodic boundary annotation;"},{"title":"Application scenarios","desc":"Application scenarios","content":"speech synthesis."}],"datatag":"English,Tts,American English,Male","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100003.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=EsLOpjsxcnlIoj4qqwkJbQ1TajY%3D","intro":"Look- at- the way they hear operas- and- see oil paintings%.L UH1 K3 / AE1 T / DH AX0 / W EY1 / DH EY1 / HH IY1 R / AA1 . P R AX0 Z / AX0 N D / S IY1 / OY1 L / P EY1 N . T IH0 NG Z","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100009.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100009.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=1zsMh1kphe0l7IMctfv5kpDKjso%3D","intro":"Was there some discussion- about- whether I should- speak%?W AX1 Z / DH EH1 R / S AH1 M / D IH0 . S K AH1 . SH AX0 N3 / AX0 . B AW1 T / W EH1 . DH ER0 / AY1 / SH UH1 D / S P IY1 K","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100005.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5Qdf67K4%2BIH5zQJ8G%2F3qYUMh6%2Bs%3D","intro":"The focus- of this chapter is the American revolution%.DH AX0 / F OW1 . K AX0 S3 / AX1 V / DH IH1 S / CH AE1 P . T ER0 / IH1 Z / DH IY0 / AX0 . M EH1 . R IH0 . K AX0 N / R EH2 . V AX0 . L UW1 . SH AX0 N","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100007.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=tHaBfAfUc8rIfSAOcZp%2F8a68TLM%3D","intro":"Can I go calling any time%?K AE1 N / AY13 / G OW1 / K AO1 . L IH0 NG / EH1 . N IY0 / T AY1 M","size":0,"progress":100,"type":"mp3"},{"name":"/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100004.wav","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NfaAP7%2B%2BDLU0hQ5AkuwthPVJmj4%3D","intro":"Is- it really take- time%?IH1 Z / IH1 T / R IY1 . AX0 . L IY0 / T EY1 K / T AY1 M3","size":0,"progress":100,"type":"mp3"}],"officialSummary":"This dataset contains 20 hours of American English male voice recordings. It is recorded by Americans (native English speakers) with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition research, and AI voice development.","dataexampl":null,"datakeyword":["TTS english dataset","speech synthesis dataset","TTS male voice dataset","male voice dataset for tts","American English speech synthesis dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Voice Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechSyn","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
https://www.nexdata.ai/shujutang/static/image/index/datatang_yuyin_default.webp
[{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100003.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=EsLOpjsxcnlIoj4qqwkJbQ1TajY%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100009.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=1zsMh1kphe0l7IMctfv5kpDKjso%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100005.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=5Qdf67K4%2BIH5zQJ8G%2F3qYUMh6%2Bs%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100007.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=tHaBfAfUc8rIfSAOcZp%2F8a68TLM%3D"},{"@type":"AudioObject","embedUrl":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220430001_demo1695809020325/APY220430001_demo/100004.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=NfaAP7%2B%2BDLU0hQ5AkuwthPVJmj4%3D"}]
20 Hours - American English Male Voice TTS Dataset
TTS english dataset
speech synthesis dataset
TTS male voice dataset
male voice dataset for tts
American English speech synthesis dataset
This dataset contains 20 hours of American English male voice recordings. It is recorded by Americans (native English speakers) with authentic accent. The phoneme coverage is balanced. Professional phonetician participates in the annotation. It is suitable for text-to-speech (TTS) model training, phoneme recognition research, and AI voice development.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
![Specifications]()
Specifications
Format
48,000Hz, 24bit, uncompressed wav, mono channel;
Recording environment
professional recording studio;
Recording content
general narrative sentences, interrogative sentences, etc;
Speaker
male, 20-30 years old, young and positive voice;
Language
American English;
Annotation
word and phoneme transcription, four-level prosodic boundary annotation;
Application scenarios
speech synthesis.
![Sample]()
Sample
Audio
Look- at- the way they hear operas- and- see oil paintings%.L UH1 K3 / AE1 T / DH AX0 / W EY1 / DH EY1 / HH IY1 R / AA1 . P R AX0 Z / AX0 N D / S IY1 / OY1 L / P EY1 N . T IH0 NG Z
Audio
Was there some discussion- about- whether I should- speak%?W AX1 Z / DH EH1 R / S AH1 M / D IH0 . S K AH1 . SH AX0 N3 / AX0 . B AW1 T / W EH1 . DH ER0 / AY1 / SH UH1 D / S P IY1 K
Audio
The focus- of this chapter is the American revolution%.DH AX0 / F OW1 . K AX0 S3 / AX1 V / DH IH1 S / CH AE1 P . T ER0 / IH1 Z / DH IY0 / AX0 . M EH1 . R IH0 . K AX0 N / R EH2 . V AX0 . L UW1 . SH AX0 N
Audio
Can I go calling any time%?K AE1 N / AY13 / G OW1 / K AO1 . L IH0 NG / EH1 . N IY0 / T AY1 M
Audio
Is- it really take- time%?IH1 Z / IH1 T / R IY1 . AX0 . L IY0 / T EY1 K / T AY1 M3
![Recommended Datasets]()
Recommended Dataset
Tell Us Your Special Needs
1ed47663-3b04-4596-b437-483bf432f740