en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

100K Chinese LLM Instruction-Following Dataset

LLM evaluation dataset
Chinese LLM instruction following dataset
Instruction-following prompt dataset
Prompt benchmark dataset LLM

This dataset contains 50-400 words, with each prompt containing at least three constraints to train and improve the instruction-following performance of large models. Categories cover generation (news releases, interview outlines, copywriting, manuscript proofreading, Chinese-English essays, grammar learning, research reports, study plans, poetry writing, food descriptions, advertising copy, sales scripts, official document writing assistance, official document review, policy document Q&A, etc.), rewriting (sentence rewriting, text correction, sentence merging, copywriting simplification), summarizing (content summarization), and extraction (event element extraction, opinion extraction, keyword extraction, stance extraction, entity extraction). All prompts are manually compiled to ensure diverse coverage. The dataset is suitable for systematic benchmarking and model assessment.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Quantity of Data
100,000
Data use
Instruction-Following Evaluation for Chinese LLM
Data content
A variety of complex prompt instructions, between 50 and 400 words, with no fewer than 3 constraints in each prompt
Production method
All prompt are manually written to satisfy the diversity of coverage
Language
Chinese
Sample Sample
  • 100K Chinese LLM Instruction-Following Dataset
  • 100K Chinese LLM Instruction-Following Dataset
  • 100K Chinese LLM Instruction-Following Dataset
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

e07ccb8a-91c7-4abf-929b-19e763172d23

dfdbecfe-2736-4fa5-bef4-3ee6b8089170