[{"@type":"PropertyValue","name":"Storage format","value":"TXT"},{"@type":"PropertyValue","name":"Data content","value":"Chinese-Uighur Parallel Corpus Data"},{"@type":"PropertyValue","name":"Data size","value":"4.72 million pairs of Chinese-Uighur Parallel Corpus Data. The Chinese sentences contain 22 characters on average"},{"@type":"PropertyValue","name":"Language","value":"Chinese, Uighur"},{"@type":"PropertyValue","name":"Application scenario","value":"machine translation"},{"@type":"PropertyValue","name":"Accuracy rate","value":"90%"}]
{"id":1185,"datatype":"1","titleimg":"https://res.datatang.com/asset/productNew/APY220720002.png?Expires=2007353710&OSSAccessKeyId=LTAI5tQwXnJZbubgVfVa1ep9&Signature=Moa7FebA92pIkRhfCHLwOPn34EA%3D","type1":"183","type1str":null,"type2":"185","type2str":null,"dataname":"4,720,000 Groups - Chinese-Uighur Parallel Corpus Data","datazy":[{"title":"Storage format","content":"TXT","desc":"Storage format"},{"title":"Data content","content":"Chinese-Uighur Parallel Corpus Data","desc":"Data content"},{"title":"Data size","content":"4.72 million pairs of Chinese-Uighur Parallel Corpus Data. The Chinese sentences contain 22 characters on average","desc":"Data size"},{"title":"Language","content":"Chinese, Uighur","desc":"Language"},{"title":"Application scenario","content":"machine translation","desc":"Application scenario"},{"title":"Accuracy rate","content":"90%","desc":"Accuracy rate"}],"datatag":"Chinese,Uighur,Han-Uyghur,Parallel corpus","technologydoc":null,"downurl":null,"datainfo":null,"standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":"","samplePresentation":[{"name":"/data/apps/damp/temp/ziptemp/APY220720002_demo1711015209158/APY220720002-demo/zh_ug ????.png","url":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY220720002_demo1711015209158/APY220720002-demo/zh_ug%20%3F%3F%3F%3F.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=4x5LxBrzAXyre6%2BPLWdnkk8B%2FKI%3D","intro":"","size":0,"progress":100,"type":"jpg"}],"officialSummary":"4,720,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.","dataexampl":null,"datakeyword":["Chinese","Uighur","Han-Uyghur","Parallel corpus"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"nlu","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"]}
4,720,000 Groups - Chinese-Uighur Parallel Corpus Data
Chinese
Uighur
Han-Uyghur
Parallel corpus
4,720,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Storage format
TXT
Data content
Chinese-Uighur Parallel Corpus Data
Data size
4.72 million pairs of Chinese-Uighur Parallel Corpus Data. The Chinese sentences contain 22 characters on average