Articles | Volume 17, issue 3
https://doi.org/10.5194/essd-17-1245-2025
https://doi.org/10.5194/essd-17-1245-2025
Data description article
 | 
24 Mar 2025
Data description article |  | 24 Mar 2025

ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Zhenghang Yuan, Zhitong Xiong, Lichao Mou, and Xiao Xiang Zhu

Viewed

Total article views: 7,149 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
4,841 1,336 972 7,149 134 197
  • HTML: 4,841
  • PDF: 1,336
  • XML: 972
  • Total: 7,149
  • BibTeX: 134
  • EndNote: 197
Views and downloads (calculated since 27 Jun 2024)
Cumulative views and downloads (calculated since 27 Jun 2024)

Viewed (geographical distribution)

Total article views: 7,149 (including HTML, PDF, and XML) Thereof 6,895 with geography defined and 254 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Saved (final revised paper)

Latest update: 28 Apr 2026
Download
Short summary
ChatEarthNet is an image–text dataset that provides high-quality, detailed natural language descriptions for global-scale satellite data. It consists of 163 488 image-text pairs with captions generated by ChatGPT-3.5 and an additional 10 000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for training and evaluating vision–language geo-foundation models in remote sensing.
Share
Altmetrics
Final-revised paper
Preprint