en

Please fill in your name

Mobile phone format error

Please enter the telephone

Please enter your company name

Please enter your company email

Please enter the data requirement

Successful submission! Thank you for your support.

Format error, Please fill in again

Confirm

The data requirement cannot be less than 5 words and cannot be pure numbers

m.nexdata.datatang.com

Image Caption Dataset - 814K Image of General Scenes

image caption dataset for llm
general scene image caption dataset
chinese image caption dataset
multimodal image text data
image description dataset

This dataset contains 814,312 image–text pairs covering a wide range of general scene categories, including landscapes, animals, flowers and trees, people, cars, sports, industries, and buildings. Category and an aesthetic subset. Each image is annotated with at least two single-sentence Chinese descriptions, with a small number of images containing only one description. The data is suitable for image captioning, vision–language model training, multimodal understanding.

Paid Datasets
This is a paid datasets for commercial use, research purpose and more. Licensed ready made datasets help jump-start AI projects.
SpecificationsSpecifications
Data size
700 thousand sets of images and descriptions
Image type
covers landscapes, animals, flowers and trees, people, cars, sports, industry, and architecture, as well as an aesthetic subset
Data format
image format is .jpg, text format is .txt
Description language
Chinese, English
Text length
in principle, a single sentence should be 5-20 characters, and each picture should cover no less than two types of descriptions, each with one sentence; a few images have only one description
Main description content
the main scene or some salient features in the image
Accuracy rate
the proportion of correctly labeled images is not less than 95%
Sample Sample
  • Image Caption Dataset - 814K Image of General Scenes
  • Image Caption Dataset - 814K Image of General Scenes
  • Image Caption Dataset - 814K Image of General Scenes
Recommended DatasetsRecommended Dataset
Tell Us Your Special Needs

Current Project Maturity

Early exploration (no concrete specs yet)
Defined goals, need professional guidance
Active development or optimization phase
Data & labeling experts with clear specifications

By submitting, I agree to the Privacy Protection

0226ae89-42cc-4b25-b159-abdbdd4d072b

f780403d-75b8-45f5-b9dd-dd2c71e6e1f2