Articles | Volume 17, issue 3
https://doi.org/10.5194/essd-17-1245-2025
https://doi.org/10.5194/essd-17-1245-2025
Data description article
 | 
24 Mar 2025
Data description article |  | 24 Mar 2025

ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Zhenghang Yuan, Zhitong Xiong, Lichao Mou, and Xiao Xiang Zhu

Viewed

Total article views: 7,823 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
5,196 1,628 999 7,823 157 211
  • HTML: 5,196
  • PDF: 1,628
  • XML: 999
  • Total: 7,823
  • BibTeX: 157
  • EndNote: 211
Views and downloads (calculated since 27 Jun 2024)
Cumulative views and downloads (calculated since 27 Jun 2024)

Viewed (geographical distribution)

Total article views: 7,823 (including HTML, PDF, and XML) Thereof 7,624 with geography defined and 199 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Saved (final revised paper)

Latest update: 08 Jun 2026
Download
Short summary
ChatEarthNet is an image–text dataset that provides high-quality, detailed natural language descriptions for global-scale satellite data. It consists of 163 488 image-text pairs with captions generated by ChatGPT-3.5 and an additional 10 000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for training and evaluating vision–language geo-foundation models in remote sensing.
Share
Altmetrics
Final-revised paper
Preprint