[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel"},{"@type":"PropertyValue","name":"Recording environment","value":"quiet indoor environment, without echo"},{"@type":"PropertyValue","name":"Recording content (read speech)","value":"including: '播放音乐', '开始播放', '暂停音乐', '暂停播放', '停止音乐', '停止播放', '接听电话', '挂断电话', '增大音量', '声音大点', '减小音量', '声音小点', '后退一首', '上一首', '快进一首', '下一首', '收藏音乐' ; a total of 17 Chinese Commands"},{"@type":"PropertyValue","name":"Speaker","value":"491 Chinese, balance for gender."}]
{"id":1222,"datatype":"1","titleimg":"/shujutang/static/image/index/datatang_yuyin_default.webp","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"491 People - Mandarin(China) Commands speech dataset","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel"},{"title":"Recording environment","value":"quiet indoor environment, without echo"},{"title":"Recording content (read speech)","value":"including: '播放音乐', '开始播放', '暂停音乐', '暂停播放', '停止音乐', '停止播放', '接听电话', '挂断电话', '增大音量', '声音大点', '减小音量', '声音小点', '后退一首', '上一首', '快进一首', '下一首', '收藏音乐' ; a total of 17 Chinese Commands"},{"title":"Speaker","value":"491 Chinese, balance for gender."}],"datatag":"Command,Bluetooth headset","technologydoc":null,"downurl":null,"datainfo":"","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/2.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=9l59EtR2jVU7mI3kpHsTmCSky1M%3D","/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/2.wav","开始播放"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/1.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=ac3%2BSWTWzTjxN5SG1nXMdSXfLaE%3D","/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/1.wav","播放音乐"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/4.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=zTXKN2qhXnGidGW1jS5nyJrHihY%3D","/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/4.wav","暂停播放"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/3.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=vmrrGAUzF%2BlfpAu0Pzw5It%2F47Wc%3D","/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/3.wav","暂停音乐"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/5.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=siyw%2FSxq1r53koBGjwuugcZP1Cs%3D","/data/apps/damp/temp/ziptemp/APY230104002_demo1711620065790/5.wav","停止播放"]],"officialSummary":" Mandarin(China) Commands speech dataset, each recording the same corpus with 17 commonly used command words. The proportion of male and female speakers is balanced, covering multiple age groups. The data is recorded by Bluetooth headset, covering the mainstream models in the market. It can be used for the voice assistant, command control, and other application scenarios.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["command words data"," speech assistant data","Car voice data"," commands speech dataset"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
491 People - Mandarin(China) Commands speech dataset
command words data
speech assistant data
Car voice data
commands speech dataset
Mandarin(China) Commands speech dataset, each recording the same corpus with 17 commonly used command words. The proportion of male and female speakers is balanced, covering multiple age groups. The data is recorded by Bluetooth headset, covering the mainstream models in the market. It can be used for the voice assistant, command control, and other application scenarios.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16bit, uncompressed wav, mono channel
Recording environment
quiet indoor environment, without echo
Recording content (read speech)
including: '播放音乐', '开始播放', '暂停音乐', '暂停播放', '停止音乐', '停止播放', '接听电话', '挂断电话', '增大音量', '声音大点', '减小音量', '声音小点', '后退一首', '上一首', '快进一首', '下一首', '收藏音乐' ; a total of 17 Chinese Commands
Speaker
491 Chinese, balance for gender.
Sample
Audio
开始播放
Audio
播放音乐
Audio
暂停播放
Audio
暂停音乐
Audio
停止播放
Recommended Dataset
149 Hours - English(the United Kindom) Children Real-world Casual Conversation and Monologue speech dataset
English(United Kindom) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
SpontaneousSpeechtext annotationBritish English
145 Hours - Spanish(spain) Children Real-world Casual Conversation and Monologue speech dataset
Spanish(spain) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
SpanishSpontaneousSpeech text annotation
189 Hours - Spanish(Latin America) Children Real-world Casual Conversation and Monologue speech dataset
Spanish(Latin America) Children Real-world Casual Conversation and Monologue speech dataset, covers self-media, conversation, live, lecture, variety show and other generic domains, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, age, accent and other attributes. Our dataset was collected from extensive and diversify speakers(12 years old and younger children), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Swedish(Sweden) Real-world Casual Conversation and Monologue speech dataset, covers self-meida,interview, etc, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Finnish(Finland) Real-world Casual Conversation and Monologue speech dataset, covers conversation, course, life, etc, mirrors real-world interactions. Transcribed with text content, speaker's ID, gender, and other attributes. Our dataset was collected from extensive and diversify speakers, geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Urdu(Pakistan) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(270 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Pashto(Afghanistan) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(224 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Dari(Afghanistan) Spontaneous Dialogue Telephony speech dataset, collected from dialogues based on given topics, covering 20+ domains. Transcribed with text content, speaker's ID, gender, age and other attributes. Our dataset was collected from extensive and diversify speakers(452 native speakers), geographicly speaking, enhancing model performance in real and complex tasks. Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.