Articles | Volume 17, issue 12
https://doi.org/10.5194/essd-17-6851-2025
© Author(s) 2025. This work is distributed under the Creative Commons Attribution 4.0 License.
Long history paddy rice mapping across Northeast China with deep learning and annualresult enhancement method
Download
- Final revised paper (published on 05 Dec 2025)
- Supplement to the final revised paper
- Preprint (discussion started on 29 Jan 2025)
- Supplement to the preprint
Interactive discussion
Status: closed
Comment types: AC – author | RC – referee | CC – community | EC – editor | CEC – chief editor
| : Report abuse
-
RC1: 'Comment on essd-2024-516', Anonymous Referee #1, 13 Feb 2025
- AC1: 'Reply on RC1', zihui zhang, 18 Mar 2025
-
RC2: 'Comment on essd-2024-516', Anonymous Referee #2, 04 Mar 2025
- AC2: 'Reply on RC2', zihui zhang, 18 Mar 2025
Peer review completion
AR – Author's response | RR – Referee report | ED – Editor decision | EF – Editorial file upload
AR by zihui zhang on behalf of the Authors (18 Mar 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (19 Mar 2025) by Yuhan (Douglas) Rao
RR by Anonymous Referee #1 (25 Mar 2025)
RR by Anonymous Referee #3 (27 Aug 2025)
ED: Reconsider after major revisions (28 Aug 2025) by Yuhan (Douglas) Rao
AR by zihui zhang on behalf of the Authors (05 Sep 2025)
Author's response
Author's tracked changes
Manuscript
ED: Referee Nomination & Report Request started (27 Sep 2025) by Yuhan (Douglas) Rao
RR by Anonymous Referee #3 (10 Oct 2025)
RR by Anonymous Referee #1 (17 Nov 2025)
ED: Publish subject to minor revisions (review by editor) (17 Nov 2025) by Yuhan (Douglas) Rao
AR by zihui zhang on behalf of the Authors (17 Nov 2025)
Author's response
Author's tracked changes
Manuscript
ED: Publish as is (18 Nov 2025) by Yuhan (Douglas) Rao
AR by zihui zhang on behalf of the Authors (19 Nov 2025)
This paper presents an effort to map the long-term paddy rice cultivation in Northeast China using Landsat time series and deep learning. The proposed long-term maps could be useful for understanding the historical crop dynamics in the study area. However, I do have major concerns as below:
Reference:
Olofsson, P., Foody, G.M., Herold, M., Stehman, S.V., Woodcock, C.E. and Wulder, M.A., 2014. Good practices for estimating area and assessing accuracy of land change. Remote sensing of Environment, 148, pp.42-57.
Other comments:
L22: Are there average numbers from 1985 to 2023? Make it clearer.
L44-45: What is the justification for such a statement that it's challenging to produce long-term maps using phenology-based methods? any references?
L64: I think you are saying that the final map for a specific year is derived from multiple intermediate maps within the year. But ‘multiple annual results” means multiple yearly maps, i.e., a map for each year. This could be confusing.
L116: You mentioned Result_pre in the text but there is no Result_pre in Eq.1.
L132: From Eq.2, t represents the image corresponding to the highest absolute value of the difference between the category probability and 0.5, not the direct highest Pi. Why not use max(Pi) instead? For example, if P1=0.1, P2=0.6, then there would be t=1, and Pt=P1=0.1. Would you determine the final results as non-paddy since Pt < 0.5?
L136: Are you using exactly the same parameters (models) in these different phenological stages? Otherwise, how could you ensure that the category probability outputs among m images are comparable?
L142: In which months did you download the data and from where? Specify the date range for each year, or the same range across years if they are consistent.
L145: Please provide the specific band names instead of numeric names.
L156: There are some issues with the validation dataset. The sampling strategy is unknown. How did you select the field sites to visit? If the distribution of validation datasets is biased (not randomly selected), then the map accuracy based on the validation datasets is not valid. Did you collect field data as point observations? Using points to validate 30-m maps is inappropriate especially when mixed pixels occur. What are the spatial and temporal distributions of training and validation data?
L170: 29906 + 9968 + 50956 + 16985 = 107815, this is less than the total size (68865 + 39098 = 107963, L154), did you remove any ground samples and why? What are the criteria for dividing the entire ground data into these training/validation sets with these specific numbers, for Landsat5 and 8/9 respectively?
L205: Please use explicit band names. In Fig.3, did you use the same probability threshold of 0.5 in both overlay maps and the ARE maps? For the red circled area, e.g., in E5, a non-paddy pixel in the overlay map means all category probability outputs are less than 0.5. According to Eq2, for ARE methods, a paddy pixel must have a probability greater than 0.5. How come a non-paddy pixel in the overlay map would become a paddy pixel in the ARE map?
L212: The distribution of the validation points is totally unknown. Are they derived from probability samples? Otherwise, the validation would not be valid. A confusion matrix based on pixel counting is not recommended. The population error matrix of classes with cell entries should be expressed in terms of the proportion of area. Besides, the uncertainty of these accuracy metrics should also be reported. Refer to Olofsson et. al (2014) for a guideline on how to conduct map accuracy evaluation in a solid manner.
Reference:
Olofsson, P., Foody, G.M., Herold, M., Stehman, S.V., Woodcock, C.E. and Wulder, M.A., 2014. Good practices for estimating area and assessing accuracy of land change. Remote sensing of Environment, 148, pp.42-57.
L236: Fig.4, what is the scale of this comparison? I assume this is the total area in the entire study area. What about the comparisons at the district, municipal, and provincial levels since you collected the agricultural statistics?
L263: what if there are no clear-sky observations available in the proceeding and subsequent years? did you leave it as no data? In Fig.7, what is "interpolated paddy"? Did you interpolate your classification directly from the classification in the previous/next year, if there is no cloud-free satellite data in the current year? This has to be clarified.
I assume the interpolated paddy pixels are derived from satellite data that are interpolated from previous/next Landsat observations. The term 'interpolated paddy' implies that the classified pixel itself is somehow interpolated, which makes no sense.
L275-276: There is no figure showing the admin boundaries and labels. It's hard to reader unfamiliar with China to link your descriptive context to the spatial locations in the map.
L301: This is contradictory to the table. Instead, #1 and #4 show that using data from only one sensor achieved the best accuracy and had the best results (if the models are trained and applied to the same sensor), compared to other scenarios using multiple sensors.
L304-305: Not clear how you conducted 'transfer learning". For #8, training on L5 & 20% L8, then apply to L8? Need to clarify.
L306: An enhanced accuracy compared to what?