en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

75 Dictionaries of Different Chinese Fields

Chinese domain dictionary data
text data
NLU data
Entity Identification data

75 Chinese domain dictionaries, including data for a certain year and covering a wide range of content. Each line in the data file includes a term and its Chinese pinyin, and the terms are sorted alphabetically. This data set can be used for tasks such as natural language understanding, knowledge base building, etc..

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data content
Chinese Dictionary of Various Fields
Data size
Chinese Dictionary of 75 Fields
Collecting period
The year 2,013
Storage format
txt
Language
Chinese
Sample Sample
  • 75 Dictionaries of Different Chinese Fields
  • 75 Dictionaries of Different Chinese Fields
Recommended DatasetsRecommended Dataset
687,694 Open Domain Intention Annotation Data

Annotation of 687,694 sentences generated by users in the mobile phone scene, covering to-do scenes, location scenes, and schedule scenes. The data set can be used for natural language understanding tasks.

open domain data intent annotation data textual data annotation SMS text data nlu data Intention understanding data
82 Million Cantonese Script Data

Cantonese textual data, 82 million pieces in total; data is collected from Cantonese script text; data set can be used for natural language understanding, knowledge base construction and other tasks.

Cantonese script data Cantonese textual data Cantonese text data collection dialogue text data
10 Million Traditional Chinese Oral Message Data

Traditional Chinese SMS corpus, 10 million in total, real traditional Chinese spoken language text data; only contains text messages; the content is stored in txt format; the data set can be used for natural language understanding and related tasks.

Traditional Chinese SMS corpus traditional Chinese SMS data traditional Chinese SMS collection traditional Chinese corpus data
56,920 Car Fine Granularity Comments Annotation Data

It collectes comments from different car forums and fine-grained annotation is carried out on posts commented by users. Annotations include labels of manufacturer, brand, model, attribute, description value, tendency, etc. It can be used in fine-grained natural language understanding research, emotion analysis and some other fields.

Fine-grained car comment annotation data car comment data annotation text data collection nlu data
50,000 Chinese Social Comments Syntax Annotation Data

50,000 Chinese social comments syntax annotated data. The contents are hot news in 2013. It is annotated with dependency syntax. The contents cover entertainment, economics, technology, fashion, sports, culture and society. The data is stored in xml and can be used for natural language understanding.

Social commentary syntactic tagging data
8,178 Chinese Social Comments Events Annotation Data

8,178 Chinese social comments annotated data. The contents are hot news in 2013. Each piece of news contains one or more events and is annotated with time, theme, cause, procedure and result. The data is stored in xml and can be used for natural language understanding.

Social comment event annotation data event annotation comment annotation data event annotation data
10,000 Chinese News Events Annotation Data

10,000 Chinese news event annotated data. The contents are hot news in 2013. Each piece of news contains one or more events. Each event is annotated. The data is stored in xml and can be used for natural language understanding.

Chinese news corpus annotation corpus annotation news corpus corpus data
84,516 Sentences - English Intention Annotation Data in Interactive Scenes

84,516 Sentences - English Intention Annotation Data in Interactive Scenes, annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.

english intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

766f2752-c4d5-4ff1-a0ad-05d68dd2af15

a633ef68-b1f5-42e8-af0c-6f75e338b1ba