ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Yuan, Zhenghang; Xiong, Zhitong; Mou, Lichao; Zhu, Xiao Xiang

doi:10.5194/essd-17-1245-2025

Articles | Volume 17, issue 3

https://doi.org/10.5194/essd-17-1245-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

https://doi.org/10.5194/essd-17-1245-2025

© Author(s) 2025. This work is distributed under
the Creative Commons Attribution 4.0 License.

Articles | Volume 17, issue 3

Data description article

|

24 Mar 2025

Data description article |

| 24 Mar 2025

ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Zhenghang Yuan, Zhitong Xiong, Lichao Mou, and Xiao Xiang Zhu

Viewed

Total article views: 7,993 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	BibTeX	EndNote
5,306	1,682	1,005	7,993	162	214

HTML: 5,306
PDF: 1,682
XML: 1,005
Total: 7,993
BibTeX: 162
EndNote: 214

Views and downloads (calculated since 27 Jun 2024)

Month	HTML	PDF	XML	Total
Jun 2024	47	13	7	67
Jul 2024	149	52	8	209
Aug 2024	99	22	11	132
Sep 2024	90	37	93	220
Oct 2024	49	8	225	282
Nov 2024	52	8	174	234
Dec 2024	65	11	201	277
Jan 2025	48	11	150	209
Feb 2025	56	13	1	70
Mar 2025	385	63	3	451
Apr 2025	290	28	5	323
May 2025	195	34	1	230
Jun 2025	258	51	5	314
Jul 2025	181	33	18	232
Aug 2025	283	31	1	315
Sep 2025	566	29	1	596
Oct 2025	168	49	2	219
Nov 2025	298	55	5	358
Dec 2025	281	113	9	403
Jan 2026	533	149	18	700
Feb 2026	462	110	16	588
Mar 2026	203	320	10	533
Apr 2026	215	252	23	490
May 2026	214	128	11	353
Jun 2026	28	27	2	57
Jul 2026	91	35	5	131

Cumulative views and downloads (calculated since 27 Jun 2024)

Month	HTML	PDF	XML	Total
Jun 2024	47	13	7	67
Jul 2024	149	52	8	209
Aug 2024	99	22	11	132
Sep 2024	90	37	93	220
Oct 2024	49	8	225	282
Nov 2024	52	8	174	234
Dec 2024	65	11	201	277
Jan 2025	48	11	150	209
Feb 2025	56	13	1	70
Mar 2025	385	63	3	451
Apr 2025	290	28	5	323
May 2025	195	34	1	230
Jun 2025	258	51	5	314
Jul 2025	181	33	18	232
Aug 2025	283	31	1	315
Sep 2025	566	29	1	596
Oct 2025	168	49	2	219
Nov 2025	298	55	5	358
Dec 2025	281	113	9	403
Jan 2026	533	149	18	700
Feb 2026	462	110	16	588
Mar 2026	203	320	10	533
Apr 2026	215	252	23	490
May 2026	214	128	11	353
Jun 2026	28	27	2	57
Jul 2026	91	35	5	131

Viewed (geographical distribution)

Total article views: 7,993 (including HTML, PDF, and XML) Thereof 7,728 with geography defined and 265 with unknown origin.

Country	#	Views	%

1

1

Cited

Saved (final revised paper)

Latest update: 21 Jul 2026

Short summary

ChatEarthNet is an image–text dataset that provides high-quality, detailed natural language descriptions for global-scale satellite data. It consists of 163 488 image-text pairs with captions generated by ChatGPT-3.5 and an additional 10 000 image-text pairs with captions generated by ChatGPT-4V(ision). This dataset has significant potential for training and evaluating vision–language geo-foundation models in remote sensing.

ChatEarthNet: a global-scale image–text dataset empowering vision–language geo-foundation models

Viewed

Viewed (geographical distribution)

Cited

10 citations as recorded by crossref.

10 citations as recorded by crossref.

Saved (final revised paper)


Total:	0
HTML:	0
PDF:	0
XML:	0


Total:	0
HTML:	0
PDF:	0
XML:	0


Total:	0
HTML:	0
PDF:	0
XML:	0