en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

6,020,000 Groups - Chinese-French Parallel Corpus Data

Chinese-French parallel corpus data
Chinese-French alignment
Parallel Corpus Data
Alignment Corpus Data

1 Million Pairs of Sentences - Chinese-French Parallel Corpus Data be stored in txt format. It covers multiple fields such as tourism, medical treatment, daily life, TV play, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Format
TXT
Data content
Chinese-French Parallel Corpus
Data size
6.02 million pairs of Chinese-French Parallel Corpus Data. The Chinese sentences contain 14.8 characters on average.
Language
Chinese, French
Applications
machine translation
Accuracy rate
90%
Sample Sample
  • 6,020,000 Groups - Chinese-French Parallel Corpus Data
Recommended DatasetsRecommended Dataset
7,290,000 Groups -Chinese -Vietnamese Parallel Corpus Data

7.29 Million Pairs of Sentences - Chinese-Vietnamese Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Vietnamese Parallel Corpus Data Chinese -Vietnamese Parallel Corpus Parallel Corpus Data Alignment Corpus Data
10,030,000 Groups – Chinese-Portuguese Parallel Corpus Data

10.03 Million Pairs of Sentences - Chinese-Portuguese Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Portuguese Parallel Corpus Data Chinese -Portuguese Parallel Corpus Parallel Corpus Data Alignment Corpus Data
5,310,000 Groups – Chinese-Germany Parallel Corpus Data

5.14 Million Pairs of Sentences - Chinese-Germany Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese - Germany Parallel Corpus Data Chinese -Germany Parallel Corpus Parallel Corpus Data Alignment Corpus Data
7,440,000 Groups – Chinese-Hindi Parallel Corpus Data

7.44 Million Pairs of Sentences - Chinese-Hindi Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Hindi Parallel Corpus Data Chinese -Hindi Parallel Corpus Parallel Corpus Data Alignment Corpus Data
1,080,000 Groups – English-Russian Parallel Corpus Data

English and Russian parallel corpus, 1,080,000 groups in total; excluded political, porn, personal information and other sensitive vocabulary; it can be a base corpus for text-based data analysis, used in machine translation and other fields.

English and Russian parallel corpus data English and Russian corpus collection English Russian parallel corpus Parallel Corpus Data Alignment Corpus Data
1,000,000 Groups - Chinese-Russian Parallel Corpus Data

1 Million Pairs of Sentences - Chinese-Russian Parallel Corpus Data be stored in .txt format. It covers multiple fields such as tourism, medical treatment, daily life, TV play, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese-Russian parallel corpus data Chinese-Russian alignment Parallel Corpus Data Alignment Corpus Data
9,830,000 Groups - Chinese-Japanese Parallel Corpus Data

9.83 Million Pairs of Sentences - Chinese-Japanese Parallel Corpus Data be stored in txt format. It covers multiple fields including general, IT, news, patent, and international engine. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese-Japanese parallel corpus Chinese-Japanese alignment Parallel Corpus Data Alignment Corpus Data
380,000 Groups - Uighur-Chinese Parallel Corpus Data

Uighur language and its parallel corresponding Chinese text data, 38,000 groups in total. They been cleaned, desensitized and gone through quality check. It can be used as base corpus for text data analysis in machine translation and related fields.

Parallel corpus Uighur corpus machine translation
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

63548aef-6efa-4319-b72c-af657bf8d95d

4c40358e-ed33-4c05-b1e1-6e1519d0da5a