[{"@type":"PropertyValue","name":"Data content","value":"text corpus of multi-round interpersonal dialogues in the real world."},{"@type":"PropertyValue","name":"Data size","value":"830,276 groups."},{"@type":"PropertyValue","name":"Collecting period","value":"the year 2,015"},{"@type":"PropertyValue","name":"Storage format","value":"txt"},{"@type":"PropertyValue","name":"Language","value":"Chinese"},{"@type":"PropertyValue","name":"Applications","value":"semantic parsing of multi-round dialogues in smart customer service and intelligent interaction scenarios."}]
{"id":150,"datatype":"1","titleimg":"https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/asset/productNew/nexdata/APY170101226.jpg?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=9Sj%2BZ3J4IPHivm7N8wnwaNRcaeY%3D","type1":"226","type1str":null,"type2":"226","type2str":null,"dataname":"830,276 groups - Multi-Round Interpersonal Dialogues Text Data","datazy":[{"title":"Data content","value":"text corpus of multi-round interpersonal dialogues in the real world."},{"title":"Data size","value":"830,276 groups."},{"title":"Collecting period","value":"the year 2,015"},{"title":"Storage format","value":"txt"},{"title":"Language","value":"Chinese"},{"title":"Applications","value":"semantic parsing of multi-round dialogues in smart customer service and intelligent interaction scenarios."}],"datatag":"Mobile terminal,Interactive text","technologydoc":null,"downurl":null,"datainfo":"6.46 million texts, which are interactive text data of real users on the mobile phone. The data itself has been processed to eliminate the user's private information (Replace sender and receiver with A and B, and sensitive information such as mobile phone number and name are replaced with '* * *'. Data can be used for tasks such as natural language understanding.","standard":null,"dataylurl":null,"flag":null,"publishtime":null,"createby":null,"createtime":null,"ext1":null,"samplestoreloc":null,"hosturl":null,"datasize":null,"industryPlan":null,"keyInformation":["Chinese","real users","830,276 groups"],"samplePresentation":["jpg","https://bj-oss-datatang-03.oss-cn-beijing.aliyuncs.com/filesInfoUpload/data/apps/damp/temp/ziptemp/APY170101226_demo1719482400164/APY170101226/1.png?Expires=4102329599&OSSAccessKeyId=LTAI8NWs2pDolLNH&Signature=Nmv81GixRLkrUZS4ZP0h2FdxrpU%3D","/data/apps/damp/temp/ziptemp/APY170101226_demo1719482400164/APY170101226/1.png",""],"officialSummary":"This database is the interactive text corpus of real users on the mobile phone. The database itself has been desensitized to ensure of no private information of the user's (A and B are the codes to replace the sender and receiver, and sensitive information such as cellphone number and user name are replaced with '* * *'). This database can be used for tasks such as natural language understanding.","dataexampl":"","datakeyword":["Interactive text corpus database"," text corpus database"],"isDelete":null,"ids":null,"idsList":null,"datasetCode":null,"productStatus":null,"tagTypeEn":"Type","tagTypeZh":null,"website":null,"samplePresentationList":null,"datazyList":null,"keyInformationList":null,"dataexamplList":null,"bgimg":null,"datazyScriptList":null,"datakeywordListString":null,"sourceShowPage":"llm","BGimg":"","voiceBg":["/shujutang/static/image/comm/audio_bg.webp","/shujutang/static/image/comm/audio_bg2.webp","/shujutang/static/image/comm/audio_bg3.webp","/shujutang/static/image/comm/audio_bg4.webp","/shujutang/static/image/comm/audio_bg5.webp"],"single":"yes"}
830,276 groups - Multi-Round Interpersonal Dialogues Text Data
Interactive text corpus database
text corpus database
This database is the interactive text corpus of real users on the mobile phone. The database itself has been desensitized to ensure of no private information of the user's (A and B are the codes to replace the sender and receiver, and sensitive information such as cellphone number and user name are replaced with '* * *'). This database can be used for tasks such as natural language understanding.
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
Specifications
Data content
text corpus of multi-round interpersonal dialogues in the real world.
Data size
830,276 groups.
Collecting period
the year 2,015
Storage format
txt
Language
Chinese
Applications
semantic parsing of multi-round dialogues in smart customer service and intelligent interaction scenarios.
Sample
Recommended Dataset
203,029 Groups - Chinese Medical Question Answering Data
The data contains 203,029 groups Chinese question answering data between doctors and patients of different diseases.
Medical question answering disease
82 Million Cantonese Script Data
Cantonese textual data, 82 million pieces in total; data is collected from Cantonese script text; data set can be used for natural language understanding, knowledge base construction and other tasks.
Cantonese script data Cantonese textual data Cantonese text data collection dialogue text data
10 Million Traditional Chinese Oral Message Data
Traditional Chinese SMS corpus, 10 million in total, real traditional Chinese spoken language text data; only contains text messages; the content is stored in txt format; the data set can be used for natural language understanding and related tasks.
Traditional Chinese SMS corpus traditional Chinese SMS data traditional Chinese SMS collection traditional Chinese corpus data