Landmark Image Dataset – 200K Global Building Photos with Captions
This dataset contains 200,000 sets of images and bilingual captions (Chinese and English) featuring landmark buildings from over 20 countries, including the United States, United Kingdom, France, Germany, and Russia. Each set includes 1–10 images of a specific landmark, captured from different angles, distances, and time periods. The dataset covers approximately 80,000 domestic landmarks and 120,000 international ones. Types of landmarks include commercial buildings, ancient architecture, monuments, libraries, and scenic spots. Annotations include landmark country, city, location, category, and descriptive captions. This high-quality dataset is ideal for training models in landmark recognition, image classification, multilingual image captioning, and image-to-text retrieval.
   landmark image dataset  building recognition dataset  global landmark image caption dataset  bilingual image caption data  Chinese-English caption dataset  landmark classification dataset  image-text dataset  tourism landmark dataset  cultural heritage image dataset  image captioning for AI training