en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

75 Dictionaries of Different Chinese Fields

Chinese domain dictionary data
text data
NLU data
Entity Identification data

75 Chinese domain dictionaries, including data for a certain year and covering a wide range of content. Each line in the data file includes a term and its Chinese pinyin, and the terms are sorted alphabetically. This data set can be used for tasks such as natural language understanding, knowledge base building, etc..

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data content
Chinese Dictionary of Various Fields
Data size
Chinese Dictionary of 75 Fields
Collecting period
The year 2,013
Storage format
txt
Language
Chinese
Sample Sample
  • 75 Dictionaries of Different Chinese Fields
  • 75 Dictionaries of Different Chinese Fields
Recommended DatasetsRecommended Dataset
10 Million Traditional Chinese Oral Message Data

Traditional Chinese SMS corpus, 10 million in total, real traditional Chinese spoken language text data; only contains text messages; the content is stored in txt format; the data set can be used for natural language understanding and related tasks.

Traditional Chinese SMS corpus traditional Chinese SMS data traditional Chinese SMS collection traditional Chinese corpus data
82 Million Cantonese Script Data

Cantonese textual data, 82 million pieces in total; data is collected from Cantonese script text; data set can be used for natural language understanding, knowledge base construction and other tasks.

Cantonese script data Cantonese textual data Cantonese text data collection dialogue text data
687,694 Open Domain Intention Annotation Data

Annotation of 687,694 sentences generated by users in the mobile phone scene, covering to-do scenes, location scenes, and schedule scenes. The data set can be used for natural language understanding tasks.

open domain data intent annotation data textual data annotation SMS text data nlu data Intention understanding data
13,000,000 Groups – Man-Machine Conversation Interactive Text Data

Human-machine dialogue interaction textual data, 13 million groups in total. The data is interaction text between the user and the robot. Each line represents a set of interaction text, separated by '|'; this data set can be used for natural language understanding, knowledge base construction etc.

textual data of human-machine dialogue interaction human-machine dialogue text human-machine dialogue data dialogue text data
13 Modules – Entity Name Single-sentence Annotation Data

13 modules of more than 15,000 piece data collected from different scenes, with annotation on entity name and entity type, rich in content, high in data accuracy.

entity annotation associated entities textual data annotation entity type annotation entity name annotation
28,237 Intent-type single sentence annotation data

Intent-like single-sentence annotated textual data, the data size is 28,237 sentences, artificially written, and annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.

intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data
47,811 Sentences - Intention Annotation Data in Interactive Scenes

Intent-like single-sentence annotated textual data, the data size is 47811 sentences, annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.

intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data
84,516 Sentences - English Intention Annotation Data in Interactive Scenes

84,516 Sentences - English Intention Annotation Data in Interactive Scenes, annotated with intent classes, including slot and slot value information; the intent field includes music, weather, date, schedule, home equipment, etc.; it is applied to intent recognition research and related fields.

english intent annotation data interactive intent annotation data intent recognition nlp intent recognition data NLU data
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

f64b7596-6e85-4831-988b-953c62c94ac9

f51ddb9a-f3fa-4f51-8be8-9bd5d69febb2