en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Bilingual Image Caption Dataset - 2.4 Million Pairs

image caption data
image captioning dataset
image text dataset
multimodal dataset
vision language dataset

THis dataset consisting of about 2.4 million image–text pairs. The images cover various categories, including landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, along with an aesthetic subset. Each image is paired with descriptive captions provided in both English and Chinese, covering overall scene understanding, local visual details, and high-level emotional context.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
2.4 million pairs of images and descriptions
Image type
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture
Data format
image format is .jpg, text format is .txt
Text length
in principle, the description should be no less than 200 Chinese characters
Main description content
overall scene of the picture, detailed description of the elements within the scene, and the emotions conveyed by the picture
Accuracy rate
the proportion of correctly labeled images is not less than 95%
Image Resolution
no less than 2 million pixels, most of them are higher than 5 million pixels
Sample Sample
  • Bilingual Image Caption Dataset - 2.4 Million Pairs
  • Bilingual Image Caption Dataset - 2.4 Million Pairs
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

71eac694-0147-4277-92d7-95be1a5b5d06

5c0d398c-1893-4ccf-89c4-208f4d2d6b8f