Global Ice Water Path Retrieval Using Fengyun series Satellite Data: A Machine Learning Approach

Yang, Yifan; Dou, Tingfeng; Xu, Gaojie; Zhou, Rui; Li, Bo; Husi, Letu; Wang, Wenyu; Xiao, Cunde

doi:10.5194/essd-2025-447

Preprints

https://doi.org/10.5194/essd-2025-447

Preprints

10 Sep 2025

| 10 Sep 2025

Status: a revised version of this preprint is currently under review for the journal ESSD.

Global Ice Water Path Retrieval Using Fengyun series Satellite Data: A Machine Learning Approach

Yifan Yang, Tingfeng Dou, Gaojie Xu, Rui Zhou, Bo Li, Letu Husi, Wenyu Wang, and Cunde Xiao

Abstract. This study presents a novel machine learning framework (RobustResMLP) for retrieving the global ice water path (IWP) and cloud ice water path (CIWP) from 2009–2024 via passive microwave observations from China's Fengyun-3 series satellites' microwave humidity sounders (MWHS-I/II). The framework employs a lightweight multilayer perceptron architecture enhanced with gated residual units and hierarchical differential dropout to address the challenges associated with high-noise satellite data. By establishing rigorous spatiotemporal collocation with CloudSat 2C-ICE products, we generate three operational products: (1) synoptic type that orbital-resolution IWP/CIWP (15 km; 2009–2024), (2) climatic type that gridded monthly composites (1°×1°; 2011–2024), and (3) cloud layer mask (CLM) products. Notably, the 89 GHz channel emerges is the most influential predictor despite theoretical limitations. This approach achieves a critical compromise between pointwise accuracy and spatiotemporal completeness, enabling unprecedented decadal-scale cloud feedback analyses. All the datasets are open available in the netCDF4 format for community sharing.

Received: 26 Jul 2025 – Discussion started: 10 Sep 2025

Publisher's note: Copernicus Publications remains neutral with regard to jurisdictional claims made in the text, published maps, institutional affiliations, or any other geographical representation in this paper. While Copernicus Publications makes every effort to include appropriate place names, the final responsibility lies with the authors. Views expressed in the text are those of the authors and do not necessarily reflect the views of the publisher.

Download & links

Preprint (PDF, 1919 KB)

Supplement (588 KB)

Download & links

Yifan Yang, Tingfeng Dou, Gaojie Xu, Rui Zhou, Bo Li, Letu Husi, Wenyu Wang, and Cunde Xiao

Status: final response (author comments only)

RC1:
'Comment on essd-2025-447', Patrick Eriksson, 25 Oct 2025
The manuscript by Yang et al. presents a new dataset of retrievals based on the MWHS instrument series. These retrievals focus on the ice water path (IWP), but also cloud IWP (CIWP) and cloud mask are considered. Retrievals of IWP based on operational microwave radiometers are surprisingly few, despite some clear advantages of such measurements for the task. An important forerunner is the work of Holl et al. (2014), also applying machine learning, using the same reference dataset (2C-ICE) and making use of similar microwave radiometers. However, Holl et al. (2014) also included passive near and thermal infrared (IR) measurements and in such way increased the sensitivity at conditions matching lower IWP. On the other hand, by involving near-IR data a restriction to day-time was introduced, a limitation avoided in this work. Another strength of this work is the relatively long time series of data provided, in contrast to Holl et al. (2014) that so far not been applied in an operational manner.
That is, the retrievals presented fill an important gap, and we want to see this dataset description being published in ESSD. However, the manuscript requires a major revision; at least, the details of how these retrievals based on machine learning were developed must be better described and the characterization of the retrieval performance must be extended. Details behind this recommendation are elaborated below.
As there will be several references to our own work, including a suggestion to consider data produced by us, we've decided to not stay anonymous in the interest of transparency. This review is made by Patrick Eriksson, assisted by PhD student Peter McEvoy. That said, we think the references to our own work are motivated.
General comments:
The description of input data is in some parts detailed, such as the quality filtering. The specifications of the instruments' channels are found in the Supplement (but the existence of the supplement is not mentioned). On the other hand, very basic information is missing. Most importantly, the scanning type of the instrument and the footprint sizes are ignored. According to the WMO OSCAR database, the MWHM instruments are cross-track scanners having a 183 GHz nadir footprint size of 16 km. The lower frequencies included in MWHS-2 have a 32 km nadir resolution. Two major concerns appear here, none discussed in the manuscript. For a cross-track scanner, the footprint sizes vary with position inside the swath. How is this handled in the training, and does the retrieval accuracy depend on scan angle? Further, for MWHS-2 the retrievals involve channels having different spatial resolutions. How is this handled?

The footprint size is also of concern for the generation of the training database. With a MWHS resolution >= 16 km, even in the best case, only a fraction of the MWHS footprint is covered by the reference data (2C-ICE). The lack of footprint filling will result in erroneous training data, as the average IWP over the full footprint is not represented. This is an important concern, not discussed at all.

The manuscript is not clear, but it seems that the same problem is present in the test data, resulting in that the true retrieval error can not be assessed, just the error with respect to a partial IWP over the footprint. There is also a concern that inhomogeneous situations (but still physically correct) situations are ignored, see further comment for line 145.

Besides IWP, cloud ice water path (CIWP) is retrieved. There exists no firm definition of cloud ice (see Eriksson et al. (2025) for our view on the topic), still the authors do not define what they consider as cloud ice. There is a reference to Li et al. (2012), but this is ambiguous information, as explained below (comments for line 157). However, cloud ice is normally taken to roughly match suspended ice hydrometeors, thus having a size below 150 um (Li et al., 2012). It has been shown that microwave observations at the frequencies of concern have low sensitivity to such ice particles (e.g. Ekelund et al., 2020). Despite the low sensitivity, values can be extracted with ML, but the obtained results for CIWP should contain a low degree of direct measurement information; the CIWP is rather estimated through the correlation to IWP in the training dataset. Accordingly, the authors must motivate why the retrieval of CIWP is included. If it is kept, the limitations of this quantity must be analyzed and properly described.

In fact, by not including any infrared data (as done by Holl et al. (2011)) the sensitivity to smaller ice crystals and lower IWP is limited. This is acknowledged on line 381, but is not properly discussed in the main text or explored in the error characterization. The latter is mainly done in such way that the errors for IWP below about 100 g/m2 are difficult to discern. In particular, the performance when true IWP is zero is not clarified.

A CLM product is mentioned several times, including in the abstract, but no results from this product are shown. In fact, not even the nature of this product is described. Is this product holding cloud probabilities or binary mask (cloudy or not)? What vertical resolution?

Several statistical measures of the retrieval performance are derived, but these all assumes that the reference dataset is error free. The errors inherited from 2C-ICE must somehow be represented and brought forward. The results are compared to several other datasets, but they are all (beside DARDAR) known to have considerable biases with respect to 2C-ICE (Eliasson et al. (2011); Duncan and Eriksson (2018)) and these comparisons bring little value. On this side we can not avoid bringing up the Chalmers Cloud Ice Climatology (CCIC, Amell et al. (2024)), having important overlap with this work (also machine learning using 2C-ICE as reference). For averaged values, CCIC matches 2C-ICE well, and as such should constitute a more interesting dataset for comparison. As CCIC is based on infrared measurements and in this work only microwaves are considered, there is value in contrasting the strengths and weaknesses of the two dataset. As CCIC is quasi-global with 30 min resolution, even local retrievals can be compared.

Bad retrievals are mentioned here and there, but it is very hard to get an overview of what data to avoid and how these have been handled. Are bad retrievals included on gridded monthly means, and how they have been handled within the manuscript (e.g. when producing Fig. 6)? A complication here is that measurements are referred to by both using the instrument and satellite names. Adding a table with periods and areas with bad retrievals would help. That said, all retrievals that the manuscript cover shall be considered as results, and e.g. FY3A shall be included in Fig. 7.

The language is in many parts not clear. Examples are found below, but this shall not be taken as a complete list of language issues.

Specific comments
Line 21: According to the tables in the supplement, the instruments of concern can not be said to be "high-noise". It can also be questioned if noise is the main challenge in these inversions, this would be an ill-posed problem even in the limit of zero noise.
Line 23: It is unclear what is meant with "synoptic type that orbital-resolution", but presumably this refers to what is normally denoted to as level 2. The standard nomenclature of level 2 and 3 data should be adopted, see e.g. https://www.earthdata.nasa.gov/learn/earth-observation-data-basics/data-processing-levels
Line 26: Where is there a compromise between the accuracy on footprint level and the spatial-temporal completeness?
Line 27: The statement of "unprecedented" is vague and can be questioned. With respect to understanding the cloud feedback, for example, the retrievals based on MODIS must be considered as equally or more interesting. In any case, the CCIC retrievals (Amell et al. (2024); Pfreundschuh et al. (2025)) have a much higher spatial-temporal coverage, still offering a similar accuracy (as also trained on 2C-ICE).
Lines 32-36: The impact of cloud ice on the radiation budget is brought forward as the main motivation, but as the measurements of concern do not constrain the amount of cloud ice in a direct manner (as discussed above), other passive observations are more relevant for this aspect. On the other hand, the relatively direct measurement of larger ice hydrometeors is of high relevance for e.g. distribution of latent heat and understanding precipitation processes. That is, the motivation to bring forward should be considered.
Line 37: The statement of discrepancies in climate models "by orders of magnitude" needs closer specification. It is not true for mean IWP.
Line 49: Much of our knowledge in this matter goes back to work by Frank Evans, e.g. Evans and Stephens (1995), and seems reasonable to cite any of those works (as done by Zhao and Weng (2002)).
Line 51: "vertical profiles of the IWP"; IWP is a column value.
Line 64: The logic in these two sentences is not clear. Rephrase.
Line 81: Please replace Amell (2021) with the related journal publication Amell et al. (2022).
Line 83: The statement about Tana et al. (2025) does not seem correct. This was achieved, at least, in Amell et al. (2024).
Line 104: Wang et al. (2024) does not exist in the reference list.
Line 105: Tables S1-S4 are referenced in the text. It is very unclear that this refers to tables within the supplemental material. Please clarify that there is a supplement.
Line 105-106: The meaning of this sentence is unclear. What else than L1 should be used as basis for the retrievals?
Line 124-135: The quality control of the input Fengyun data is presented. Was any quality control or filtering applied to the 2C-ICE reference data?
Line 140: FY-3D and CloudSat are presented to be 30 min apart. If correct, there should not be any tropical collocations inside 15 min.
Lines 144: "pixel" seems to here mean boresight, but pixel indicates an area and is easily interpreted as footprint.
Line 145: This second criterion states, limiting co-locations to cases where the coefficient of variation for 2C-ICE pixels within an MWHS-II pixel is less than 0.6. This introduces a bias due to training only on relatively uniform cases. It would be helpful to have more motivation for this choice. Further, it must be clarified how the removed cases are considered in the error characterization.
Line 149: Please clarify what is meant by uniform distribution across latitude bands and how that is achieved.
Lines 149-153: Please clarify if and how these multiple training subsets are used. Or are they combined in some manner? Is there a separate model trained for each combination of MWHS-II and MWHS-I with IWP and CIWP?
Line 155: What is a balanced representation? In any case, motivate why going away from using the actual statistics of the reference dataset.
Line 157: As mentioned, just a reference to Li et al. (2012) is not sufficient. There is also a dot after (2012).
Line 158: Why are the number of cases for CIWP and IWP not the same (there should exist CIWP value for IWP)?
Line 159: As Sec 3 is very short (too short) it seems reasonable to merge Secs. 2 and 3.
Line 161: How has this resolution been determined? It sounds unlikely as not all channels used have a resolution of 15 km, and this resolution is only reached at nadir.
Line 162-163: Please provide more details on how the monthly means are provided. Any weighting of the data? Are all grid cells filled? Typical number of retrievals in each mean? Are those numbers reported in the resulting data files? See also first data comment.
Line 167: What is meant by "fundamental model" here? We can not find any RobustResMPL model outside this work. Or are the authors claiming to introduce RobustResMPL (but comments below contradict this)?
Line 169: Can 9 million parameters be considered as lightweight considering the few input data and the relatively limited scope of the model? For comparison, the MLP in Amell et al. (2022) had 0.3 million parameters. In any case, 9 million parameters seems large when compared to the training set of 700 000 – 900 000 cases. There should be a high risk for overfitting.
Line 170: “We make several significant improvements to the RobustResMLP” indicates that an existing model was used, but there is no reference to it.
Line 170: This list is appreciated. However, for a reader in the geoscience community, these techniques may not be familiar. It would be very helpful to have references to articles or other resources that provide more details on the techniques behind these improvements. Similarly, in Figure 1, references for "Lightweight Attention" and "Adaptive Feature Scaling" would be appreciated.
Sec 4.1: Since this is a supervised machine learning, the retrievals will work as long as the scenario being observed is similar to those in the training dataset. How does it handle rare events that are not close to the training set? Is there a way for the method to identify or flag retrievals that risk being out-of-distribution?
Line 186: Interesting solution on ensuring continuity across satellite generations by remapping values to the 150 GHz channel. However, at least a sentence or two quantifying any errors introduced by this approach, or providing motivation for why this can be expected to work sufficiently well, is motivated.
Line 193: Should be Fig. 1.
Figure 1: The first text box indicates that MWHS-I and II are used together. Presumably "and" should be "or".
Figure 1: The second text box explains that auxiliary data are used, but this is not mentioned in the text.
Figure 1: The text box "MLP Based Model for Mapping 150 GHz to 166 GHz" contradicts what was written in line 186, where the MLP is described as mapping to 150 GHz.
Line 200: With a log-transform, it must be described how IWP=0 was handled.
Line 219: It must be clarified how "detect clouds" is defined, for both retrievals. In the case of MWHS a log-transform is used and then no retrieval will be strictly zero.
Line 229-230: The naming convention suggests the orbital retrieval data is level 1, even though it is level 2. Recommend removing or changing L1 to L2 in the filenames.
Line 237: The naming convention proposes that the gridded data is level 1, even though it is level 3. Recommend removing or changing L1 to L3 in the filenames.
Line 240: Detail how "the most temporally stable" were identified and extracted.
Line 241: We were unable to find “Merged_Global_Mean.nc” in the data portal.
Sec 6: In brief, this section must be extended. The presented results must be discussed more carefully. For example, Fig. 3 having six panels of results is just commented in very brief terms. In addition, errors not covered by the present analysis must be incorporated, as indicated above.
Sec 6: For what data are the statistics derived? Training or validation data should not be used here. It must be clarified that the test data are sampled in an unbiased way. They should represent a fully random selection.
Sec 6: Since the model is trained and tested on 2C-ICE data, any bias and error from this dataset will be inherited. This issue should be discussed and the magnitude of these inherited errors should be listed.
Sec 6.1: We suggest making a figure with the occurrence fraction (histogram of values) of IWP in the training and test data, and retrieved dataset. This would clarify the nature of the training and test data, and also show the dynamic range of the retrievals.
Line 253: Please quantify “high accuracy”. The statement can be questioned as the biases reported are considerable.
Line 256: Start a new paragraph when starting to discuss Table 3. Same at line 261, when moving to SHAP.
Lines 256-257: The statement referring to Table 3 must be explained, what results show this? The next sentence "Our analysis ...", is this an explanation to the previous sentence, or a new topic?
Line 261: Clarify that Figs. S1 and S2 are in the supplementary material. What is SHAP?
Line 264: Wang et al. (2024) does not exist in reference list. In what way is there a consistency with Wang et al.?
Tables 2 and 3: Negative biases far exceeding the stated global mean IWP are reported. How is this possible. Are the retrievals giving negative values?
Table 3: Which combination of these inputs is the one applied for the general processing? Consider stating that clearly in the text, for example in Sec. 4. The two last combinations use lat and lon as input. Is that a wise choice? The training can then basically learn the geographical distribution (especially when using a net with millions of parameters). This could maybe be OK for some application, but should be avoided if temporal changes and trends are considered. If the model applied generally takes lat and lon as input, the consequence of this choice must be explored.
Figure 3: As IWP spans order of magnitudes, panels c and d should have a logarithmic x-axis, and the y-axis in f should give the relative error. As pointed out above, the errors when true IWP is zero must be reported somehow. For the first bar in panel b, please note that any retrieved IWP > 0 for (true) IWP=0 corresponds to an infinite relative error.
Sec. 7: The section only deals with IWP. There are no validations or comparisons for the other variables: CIWP and cloud mask. Such validations should also be performed.
Sec. 7: The section seems to ignore FY-3A. Does that indicate that those retrievals not are trustwhorthy? Include FY-3A, or remove it from the article (and disseminated data).
Sec. 7: Consider to include CCIC, as this is a product developed with similar objectives.
Figure 4: It is impossible to make a sensible comparison to 2C-ICE. This comparison requires that MWHS and ERA5 IWPs along the 2C-ICE transects are extracted and plotted together with 2C-ICE IWP as a line plot (and the transects added to panels e and h). Please include information on what satellite that carried the two instruments considered.
Line 305: The text reads as suggesting that Figure 5 shows that all IWP products exhibit fundamentally consistent spatial patterns. However, there are arguably larger differences in distribution between the FY-3X products than with the reference products, for example FY-3B and FY-3D. Text mentions that they are both "afternoon satellites". Are such differences expected?
Line 315: For clarity, please list which satellites are MWHS-I-based here again. To make it easier for users of the dataset to know what files to use.
Figure 7: According to the gridded dataset, FY-3B and FY-3C have overlapping data 2014–2019. The full FY-3B range should be included, or its omission should be motivated.
Line 330: Melia et al. (2016) does not seem to be a proper reference for DARDAR.
Figure 7: A downward trend for IWP for the FY-3X satellites can be seen, which is not reflected in MODIS, VIIRS or ERA5. That should be discussed. The FY-3X retrievals also appear to have a more clear annual cycle than any other dataset, that also should be discussed.
Sec 8: This section should be rewritten considering the general issues lifted above. In short, the section should also bring up limitations.
Line 348-353: In what way is the CLM a "distinct IWP product"?
Line 355-356: This reads as that neural networks automatically give a superior sampling. This is not correct, it is the choice of instrument that governs this aspect.
Line 357: The text indicates that "temporal continuity" has been achieved, while the main text mentions several data issues and biases between the FY-3X satellites are seen in Fig. 7.
Line 386-393: This "outlook" is not very relevant and can be removed. If kept, it must be revised to properly account for ongoing work in these directions. In addition, the previous paragraph is already of outlook character.
Line 398: Make it very clear where to find the data. Consider putting it as the first sentence. "The presented datasets are available at". The current phrasing is unclear.
Line 433-438: Duplicate reference.
Data comments
In the gridded data, all FY3B_MWHSX_GBAL_L1_YYYY_MEAN.nc (iwp) files have clear swath artefacts for certain months and they show up clearly when taking the yearly mean. Is there a suggested way to filter out these artefacts?
Is there a recommended way to combine overlapping files from different satellites to get a best estimate for the monthly gridded IWP?
There are some metadata issues in the NetCDF files; metadata units for IWP and CIWP are kg/m2, but value ranges suggest they are in g/m2. Cloud Mask Classification could benefit from a description on how to interpret the values.
Due to a suspected technical issue with the data provider, download speeds are unusably slow (max 20 KB/s). It takes > 4 hours to download the 300 MB gridded level 3 data files, and makes it unfeasible to download the > 400 GB orbital zip file. We have tried to download this on 9th Oct, 10th Oct and 12th Oct from multiple different internet connections in an effort to rule out local technical issues. Due to this, we were unable to look at the orbital level 2 dataset and its usefulness for the scientific community appears limited. For this reason we feel forced, at this moment, to rate the data quality as poor.
References
Amell, A., Eriksson, P., & Pfreundschuh, S. (2022). Ice water path retrievals from Meteosat-9 using quantile regression neural networks. Atmospheric Measurement Techniques, 15(19), 5701-5717.
Amell, A., Pfreundschuh, S., & Eriksson, P. (2024). The chalmers cloud ice climatology: Retrieval implementation and validation. Atmospheric Measurement Techniques, 17(14), 4337-4368.
Duncan, D. I., & Eriksson, P. (2018). An update on global atmospheric ice estimates from satellite observations and reanalyses. Atmospheric Chemistry and Physics, 18(15), 11205-11219.
Ekelund, R., Eriksson, P., & Pfreundschuh, S. (2020). Using passive and active observations at microwave and sub-millimetre wavelengths to constrain ice particle models. Atmospheric Measurement Techniques, 13(2), 501-520.
Eliasson, S., Buehler, S. A., Milz, M., Eriksson, P., & John, V. O. (2011). Assessing observed and modelled spatial distributions of ice water path using satellite data. Atmospheric Chemistry and Physics, 11(1), 375-391.
Eriksson, P., Baró Pérez, A., Müller, N., Hallborn, H., May, E., Brath, M., ... & Ickes, L. (2025). Advancements and continued challenges in global modelling and observations of atmospheric ice masses. EGUsphere, 2025, 1-42.
Evans, K. F., & Stephens, G. L. (1995). Microweve radiative transfer through clouds composed of realistically shaped ice crystals. Part II. Remote sensing of ice clouds. Journal of Atmospheric Sciences, 52(11), 2058-2072.
Holl, G., Eliasson, S., Mendrok, J., & Buehler, S. A. (2014). SPARE‐ICE: Synergistic ice water path from passive operational sensors. Journal of Geophysical Research: Atmospheres, 119(3), 1504-1523.
Li, J. L., Waliser, D. E., Chen, W. T., Guan, B., Kubar, T., Stephens, G., ... & Horowitz, L. (2012). An observationally based evaluation of cloud ice water in CMIP3 and CMIP5 GCMs and contemporary reanalyses using contemporary satellite data. Journal of Geophysical Research: Atmospheres, 117(D16).
Pfreundschuh, S., Kukulies, J., Amell, A., Hallborn, H., May, E., & Eriksson, P. (2025). The chalmers cloud ice climatology: A novel robust climate record of frozen cloud hydrometeor concentrations. Journal of Geophysical Research: Atmospheres, 130(6), e2024JD042618.
Zhao, L., & Weng, F. (2002). Retrieval of ice cloud parameters using the Advanced Microwave Sounding Unit. Journal of Applied Meteorology, 41(4), 384-395.
Citation: https://doi.org/10.5194/essd-2025-447-RC1
- AC1: 'Response to Reviewers: Manuscript essd-2025-447', Yifan Yang, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2025-447/essd-2025-447-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/essd-2025-447-AC1
RC2:
'Comment on essd-2025-447', Anonymous Referee #2, 02 Nov 2025
This manuscript presents a machine learning-based framework for retrieving the global Ice Water Path (IWP) from Fengyun series satellites (MWHS-I/II). The topic is of significant scientific importance and practical value, particularly as it develops a data product which is crucial for enriching the data sources available for global cloud and climate change research. The authors’ effort to provide access to the data and some of the code aligns with the principles of open science and is commendable.
However, as a manuscript submitted to Earth System Science Data, the current version suffers from significant shortcomings in the completeness of the methodology, description of the data product, adequacy of the validation, and overall reproducibility. The manuscript currently reads more like a summary of a traditional research paper rather than a comprehensive dataset description. To meet the publication standards of ESSD, Major Revision is required. Below are my comments to improve the manuscript.
The core focus of ESSD is the data itself. I strongly recommend that the authors restructure the paper to shift the emphasis from “algorithm research” to a “dataset description.” A dedicated section should be added to meticulously describe the final dataset’s format, such as variables, spatio-temporal resolution, coverage, quality control flags.

Ambiguous Input Features: It is unclear which input features were used to retrieve the IWP. Was it only the brightness temperatures from the MWHS-I/II channels? Were other ancillary data used, such as viewing geometry (satellite zenith angle), surface type, or elevation? This information is essential for understanding the model’s performance and for the study to be reproducible. A clear list of all input features is required.

Providing a per-pixel retrieval uncertainty estimate is a standard requirement for remote sensing data products. The authors need to state whether the dataset includes uncertainty information and, if so, explain how it was estimated (e.g., through model ensembling, quantile regression, or another method).

There is ambiguity between “IWP” and “CIWP”. The text mentions a process “employed to extract CIWP from the IWP data” (Line 121-122), yet the title and most of the text refer to “IWP.” The authors need to clearly define whether the model retrieves the total atmospheric ice water path (IWP) or is specific to the cloud ice water path (CIWP). This distinction is fundamental to the dataset’s definition and applicability.

The spatio-temporal collocation between the wide-swath Fengyun satellites and the nadir-viewing CloudSat is a critical step in building the training dataset. The authors have completely omitted a description of this process. What was the time window for a match? What was the spatial matching criterion (e.g., distance between the FY pixel center and the CloudSat footprint)? How many CloudSat profiles were averaged to match one FY pixel? These details must be added.

Aside from the weaknesses of the overall motivation, there are several specific comments with the technical content:
(Line 120) “truth value”: As mentioned in the major comments, please replace “truth value” with a more appropriate term like “reference value”.

(Line 121) CIWP Extraction Method: The phrase “employed to extract CIWP from the IWP data” is too vague. Please detail how the 2B-CLDCLASS data were used to derive CIWP from the 2C-ICE IWP. Was it by simply filtering for pixels classified as “high cloud”? How were multi-layer cloud scenarios handled?

The introduction should more clearly articulate the unique advantages and necessity of developing an IWP product based on Fengyun satellites compared to existing products (e.g., from MODIS, AIRS, MLS). Does it fill a spatio-temporal gap, or does it offer potential accuracy improvements in specific areas?

Explain all abbreviations (e.g., FY-4A, IWP, MWHS) upon first appearance.
Citation: https://doi.org/10.5194/essd-2025-447-RC2
- AC1: 'Response to Reviewers: Manuscript essd-2025-447', Yifan Yang, 08 Dec 2025
  
  The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2025-447/essd-2025-447-AC1-supplement.pdf
  
  Citation: https://doi.org/10.5194/essd-2025-447-AC1
AC1: 'Response to Reviewers: Manuscript essd-2025-447', Yifan Yang, 08 Dec 2025

The comment was uploaded in the form of a supplement: https://essd.copernicus.org/preprints/essd-2025-447/essd-2025-447-AC1-supplement.pdf

Citation: https://doi.org/10.5194/essd-2025-447-AC1

Yifan Yang, Tingfeng Dou, Gaojie Xu, Rui Zhou, Bo Li, Letu Husi, Wenyu Wang, and Cunde Xiao

Supplement

https://doi.org/10.5194/essd-2025-447-supplement

Data sets

Fengyun polar-orbiting satellite total/cloud ice water path retrieval dataset (2009-2024). Yifan Yang, Tingfeng Dou, Gaojie Xu, Rui Zhou, Bo Li, Letu Husi, Wenyu Wang, Cunde Xiao https://doi.org/10.11888/Atmos.tpdc.302932

Model code and software

Global Ice Water Path Retrieval Using Fengyun series Satellite Data: A Machine Learning Approach/generating figures and pre-post- precessing code Yifan Yang https://doi.org/10.5281/zenodo.16352116

Yifan Yang, Tingfeng Dou, Gaojie Xu, Rui Zhou, Bo Li, Letu Husi, Wenyu Wang, and Cunde Xiao

Viewed

Total article views: 1,125 (including HTML, PDF, and XML)

HTML	PDF	XML	Total	Supplement	BibTeX	EndNote
1,015	72	38	1,125	49	40	50

HTML: 1,015
PDF: 72
XML: 38
Total: 1,125
Supplement: 49
BibTeX: 40
EndNote: 50

Views and downloads (calculated since 10 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	805	10	9	824
Oct 2025	112	18	10	140
Nov 2025	55	13	10	78
Dec 2025	43	31	9	83

Cumulative views and downloads (calculated since 10 Sep 2025)

Month	HTML	PDF	XML	Total
Sep 2025	805	10	9	824
Oct 2025	112	18	10	140
Nov 2025	55	13	10	78
Dec 2025	43	31	9	83

Viewed (geographical distribution)

Total article views: 1,079 (including HTML, PDF, and XML) Thereof 1,079 with geography defined and 0 with unknown origin.

Country	#	Views	%

Latest update: 31 Dec 2025

Download

Preprint (1919 KB)
Metadata XML

Short summary

We built an AI using China's Fengyun satellites (2009–2024) to map global atmospheric ice vital for climate. It processes tough data, making 3 public sets: orbital ice scans, monthly global maps, cloud masks. First long-term ice records over land/ocean from Chinese satellite. Offers unmatched coverage for decade climate studies despite precision limits.


Total:	0
HTML:	0
PDF:	0
XML:	0