[{"@type":"PropertyValue","name":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"@type":"PropertyValue","name":"Content category","value":"Smart car; smart home; voice assistant;"},{"@type":"PropertyValue","name":"Recording condition","value":"Noisy environment, including subway, market, restaurant, street, airport etc;"},{"@type":"PropertyValue","name":"Recording device","value":"Android smartphone; iPhone;"},{"@type":"PropertyValue","name":"Speaker","value":"205 people, 58% male and 42% female;"},{"@type":"PropertyValue","name":"Country","value":"China(CHN);"},{"@type":"PropertyValue","name":"Language(Region) Code","value":"zh-CN;"},{"@type":"PropertyValue","name":"Language","value":"Mandarin Chinese;"},{"@type":"PropertyValue","name":"Features of annotation","value":"Transcription text;"},{"@type":"PropertyValue","name":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 98%(noise symbols are excluded)"}]
{"id":192,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY161101024_G.png?Expires=2007353620&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=v6L/6egP8bgkTiisvWsFGM9IC8s%3D","type1":"165","type1str":null,"type2":"165","type2str":null,"dataname":"205 People - Mandarin Chinese(China) Noisy Monologue Smartphone speech dataset_ Guiding","datazy":[{"title":"Format","value":"16kHz, 16bit, uncompressed wav, mono channel;"},{"title":"Content category","value":"Smart car; smart home; voice assistant;"},{"title":"Recording condition","value":"Noisy environment, including subway, market, restaurant, street, airport etc;"},{"title":"Recording device","value":"Android smartphone; iPhone;"},{"title":"Speaker","value":"205 people, 58% male and 42% female;"},{"title":"Country","value":"China(CHN);"},{"title":"Language(Region) Code","value":"zh-CN;"},{"title":"Language","value":"Mandarin Chinese;"},{"title":"Features of annotation","value":"Transcription text;"},{"title":"Accuracy Rate","value":"Sentence Accuracy Rate (SAR) 98%(noise symbols are excluded)"}],"datatag":"Mandarin Chinese,Noise,Accent,Smartphone,Guiding","technologydoc":null,"downurl":null,"datainfo":"The product is recorded by 205 speakers in noise environment of various daily life, they talk in Mandarin with accent. The recorded text includes driving scenarios, smart home and intelligent voice assistant. The data can be used for linguistic model training and algorithm researching of speech recognition acoustics, corpus construction of machine translation, voiceprint recognition model training and Algorithms researching.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["various environments","various scenes","16kHz, 16bit, wav"],"samplePresentation":[["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0008_S0023.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Kuum%2B8V6yIYb1f7p4uVec7epiGA%3D","/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0008_S0023.wav","现在开始蒸馒头"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0008_S0018.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=93BoG6ipxTvPl0%2BV%2F%2BrhIwmrDBE%3D","/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0008_S0018.wav","开启看电视模式"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0155_S0056.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Ok%2F5GEYYlSq1s3ZLX2lSFWDZwbg%3D","/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0155_S0056.wav","从梅州到宁波要怎么坐车"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0138_S0016.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Y302alg2ce%2BjZy9j1u7RUKw%2BXVo%3D","/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0138_S0016.wav","还有多长时候路能通"],["mp3","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0138_S0006.wav?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=22O86WyVDRg1CIf4KgBbxGLACas%3D","/data/apps/damp/temp/ziptemp/APY161101024_G_demo1706954406798/APY161101024_G/T3234_G0138_S0006.wav","这里的停车场在哪里"]],"officialSummary":"Mandarin Chinese(China) Noisy Monologue Smartphone speech dataset_ Guiding, collected from monologue based on given prompts, covering generic domain, such as in-car, smart home, voice assistant, recorded in noisy condition. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(205 people), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.","dataexampl":"","datakeyword":["Noise voice"," accent mandarin"," mobile phone to collect voice data"," guide voice"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Language,Data Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"speechRec","BGimg":"brightSpot_audio","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"no"}
Mandarin Chinese(China) Noisy Monologue Smartphone speech dataset_ Guiding, collected from monologue based on given prompts, covering generic domain, such as in-car, smart home, voice assistant, recorded in noisy condition. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(205 people), geographicly speaking, enhancing model performance in real and complex tasks.Quality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Format
16kHz, 16bit, uncompressed wav, mono channel;
Content category
Smart car; smart home; voice assistant;
Recording condition
Noisy environment, including subway, market, restaurant, street, airport etc;
Recording device
Android smartphone; iPhone;
Speaker
205 people, 58% male and 42% female;
Country
China(CHN);
Language(Region) Code
zh-CN;
Language
Mandarin Chinese;
Features of annotation
Transcription text;
Accuracy Rate
Sentence Accuracy Rate (SAR) 98%(noise symbols are excluded)