Articles | Volume 17, issue 3
https://doi.org/10.5194/essd-17-1245-2025
https://doi.org/10.5194/essd-17-1245-2025
Data description paper
 | 
24 Mar 2025
Data description paper |  | 24 Mar 2025

ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Zhenghang Yuan, Zhitong Xiong, Lichao Mou, and Xiao Xiang Zhu

Viewed

Total article views: 4,219 (including HTML, PDF, and XML)
HTML PDF XML Total BibTeX EndNote
2,858 457 904 4,219 83 113
  • HTML: 2,858
  • PDF: 457
  • XML: 904
  • Total: 4,219
  • BibTeX: 83
  • EndNote: 113
Views and downloads (calculated since 27 Jun 2024)
Cumulative views and downloads (calculated since 27 Jun 2024)

Viewed (geographical distribution)

Total article views: 4,219 (including HTML, PDF, and XML) Thereof 4,082 with geography defined and 137 with unknown origin.
Country # Views %
  • 1
1
 
 
 
 

Cited

Latest update: 09 Oct 2025
Download
Short summary
ChatEarthNet is an image–text dataset that provides high-quality, detailed natural language descriptions for global-scale satellite data. It consists of 163 488 image-text pairs with captions generated by ChatGPT-3.5 and an additional 10 000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for training and evaluating vision–language geo-foundation models in remote sensing.
Share
Altmetrics
Final-revised paper
Preprint