en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

1M Chinese Coding Questions Dataset – Python/Java/C++

Chinese coding questions dataset
programming QA data
parsed coding problems
Python Java C++ dataset
code generation LLM dataset
Chinese code questions

This dataset contains 1 million Chinese programming questions with corresponding answers, detailed parses (explanations), and programming language labels. It includes a wide range of questions in C, C++, Python, Java, and JavaScript, making it ideal for training large language models (LLMs) on multilingual code understanding and generation. The questions cover fundamental to advanced topics, supporting AI applications such as code completion, bug fixing, and programming reasoning. This structured dataset enhances model performance in natural language programming tasks and helps reinforce code logic skills in AI systems. All data complies with international privacy regulations including GDPR, CCPA, and PIPL.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Content
Code questions text;
Data Size
About 1 million;
Data Fields
Contains title, answer, parse and language;
Data Categories
c, c++, java, python, javascript;
Format
Jsonl;
Language
Chinese;
Data processing
Subject, questions, parse and answers were analyzed, and content was also cleaned
Sample Sample
  • 1M Chinese Coding Questions Dataset – Python/Java/C++
  • 1M Chinese Coding Questions Dataset – Python/Java/C++
  • 1M Chinese Coding Questions Dataset – Python/Java/C++
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

80c328c9-134b-40cd-9825-0dcc93c6d7d0

c0963727-2fc8-4074-98ad-42e0c082a730