en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

1,990,000 Groups - Chinese-Czech Parallel Corpus Data

Chinese and Czech Parallel Corpus Data
Alignment Corpus
Parallel Corpus Data
Alignment Corpus Data

1,990,000 sets of Chinese and Czech language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Storage format
TXT
Data content
Chinese-Czech Parallel Corpus Data, content has been preliminarily categorized, covering the fields of technology, healthcare, tourism, spoken, news and military.
Data size
1.99 million pairs of Chinese-Czech Parallel Corpus Data.
Language
Chinese, Czech
Application scenario
machine translation
Sample Sample
  • Waiting For Data
Recommended DatasetsRecommended Dataset
5,010,000 Groups - Chinese-Tibetan Parallel Corpus Data

5,010,000 sets of Chinese and Tibetan language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Tibetan Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
1,140,000 Groups - Chinese - Hebrew Parallel Corpus Data

1.14 Million Pairs of Sentences - Chinese-Hebrew Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Hebrew Parallel Corpus Data Chinese -Hebrew Parallel Corpus Parallel Corpus Data Alignment Corpus Data
12,820,000 Groups - Chinese-Korean Parallel Corpus Data

12,820,000 sets of parallel translation corpus between China and Korea, which are stored in txt files. It covers many fields including spoken language, traveling, news, and finance. Data cleaning, desensitization, and quality inspection have been carried out. It can be used as the basic corpus database in the text data files as well as used in machine translation.

China and South Korea Parallel Corpus Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
3,140,000 Groups - Chinese-Spanish Parallel Corpus Data

The 3,140,000 Groups - Chinese-Spanish Parallel Corpus Data is a bilingual texts is stored in text format. All of the data are related to science and technology. average sentence length is 37.1 characters. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese - Spanish Parallel Corpus Data Chinese -Spanish Parallel Corpus Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
850,000 Groups-English-Japanese Parallel Corpus Data

The 850,000 English Japanese Parallel Corpus Data is a bilingual text is stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. average English sentence 23 words. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.rn

English - Japanese Parallel Corpus Data English -Japanese Parallel Corpus Parallel Corpus Data Alignment Corpus Data
4,720,000 Groups - Chinese-Uighur Parallel Corpus Data

4,720,000 sets of Chinese and Uighur language parallel translation corpus, data storage format is txt document. Data cleaning, desensitization, and quality inspection have been carried out, which can be used as a basic corpus for text data analysis and in fields such as machine translation.

Chinese and Uygur Parallel Corpus Data Alignment Corpus Parallel Corpus Data Alignment Corpus Data
750,000 Groups - Chinese-Burmese Parallel Corpus Data

0.75 Million Pairs of Sentences - Chinese-Burmese Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Burmese Parallel Corpus Data Chinese -Burmese Parallel Corpus Parallel Corpus Data Alignment Corpus Data
7,290,000 Groups -Chinese -Vietnamese Parallel Corpus Data

7.29 Million Pairs of Sentences - Chinese-Vietnamese Parallel Corpus Data be stored in text format. It covers multiple fields such as tourism, medical treatment, daily life, news, etc. The data desensitization and quality checking had been done. It can be used as a basic corpus for text data analysis in fields such as machine translation.

Chinese -Vietnamese Parallel Corpus Data Chinese -Vietnamese Parallel Corpus Parallel Corpus Data Alignment Corpus Data
Tell Us Your Special Needs

By submitting, I agree to the Privacy Protection

6dbf8c7b-1abc-46f4-9815-6f2c06547162

f29af20c-cac9-48b5-a384-a312957cdeac