en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Multilingual Grammar Correction Dataset – 480K Parallel Texts (DE, ES, FR, IT)

German
French
Spanish
Italian
proofreading
Multilingual Grammar Correction Dataset
Grammar Correction Dataset

This dataset focuses on the four major European languages (French, German, Spanish, Italian) and contains 480000 pairs of original and corrected text pairs. Each piece of data is presented in JSON format, including two fields: input (raw text) and output (corrected text), which can assist in natural language processing, machine translation, and language teaching research.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data content
Text pairs of original and corrected texts for four European languages
Data volume
480000 pairs
Languages
French, German, Spanish, Italian
Field
input,output
Format
JSON
Sample Sample
  • Multilingual Grammar Correction Dataset – 480K Parallel Texts (DE, ES, FR, IT)
  • Multilingual Grammar Correction Dataset – 480K Parallel Texts (DE, ES, FR, IT)
  • Multilingual Grammar Correction Dataset – 480K Parallel Texts (DE, ES, FR, IT)
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

6f0c6b4a-ecd1-4417-9546-c4c6fad76578

ad3466eb-4b86-4d5e-b87b-2e76cfd04a2e