en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

3,140,000 Groups - Chinese-Spanish Parallel Corpus Data

Chinese - Spanish Parallel Corpus Data
Chinese -Spanish Parallel Corpus
Parallel Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data

The 3,140,000 Groups - Chinese-Spanish Parallel Corpus Data is a bilingual texts is stored in text format. All of the data are related to science and technology. average sentence length is 37.1 characters. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
TXT
Data content
Chinese-Spanish Parallel Corpus Data
Data size
3.14 million pairs of Chinese-Spanish Parallel Corpus Data. The Chinese sentences contain 37.1 characters on average.
Language
Chinese,Spanish
Application scenario
machine translation
Accuracy rate
90%
Sample Sample
  • 3,140,000 Groups - Chinese-Spanish Parallel Corpus Data
Recommended DatasetsRecommended Dataset
1,340,000 Groups – English-Korean Parallel Corpus Data

English and Korean parallel corpus, 1340,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

English and Korean parallel corpus data English and Korean corpus collection Alignment Corpus Parallel Corpus Data Alignment Corpus Data
380,000 Groups – Japanese-English Parallel Corpus Data

Japanese and English parallel corpus, 380,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

Japanese and English parallel corpus data Japanese and English parallel corpus collection Alignment Corpus Parallel Corpus Data Alignment Corpus Data
100,000 Groups - Chinese-Uighur Parallel Corpus Data

100,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document, data fluency and loyalty is above 80%. Data cleaning, desensitization and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Uygur Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
80,120,000 Groups – Chinese-English Parallel Corpus Data

Parallel translation corpus between Chinese and English. It is stored in txt files. It covers files like travel, medicine, daily and TV play. Data cleaning, desensitization, and quality inspection have been carried out. It can be used as the basic corpus database in text data file as well as used in machine translation.

Chinese-English Parallel Corpus Data Chinese-English Alignment Corpus
1,980,000 Groups - Chinese-Polish Parallel Corpus Data

1,980,000 sets of Chinese and Polish language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Polish Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
1,990,000 Groups - Chinese-Czech Parallel Corpus Data

1,990,000 sets of Chinese and Czech language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Czech Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
APY230,328,001_980,000 Groups - Chinese-Urdu Parallel Corpus Data

980,000 sets of Chinese and Urdu language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Urdu Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
5,010,000 Groups - Chinese-Tibetan Parallel Corpus Data

5,010,000 sets of Chinese and Tibetan language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Tibetan Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

694392df-5487-486a-8158-bf9f9b50e439

3471ff74-b6ee-48f1-a185-fdd7dbaccee5