A global base temperature dataset for building energy demand modeling
Abstract. Accurate building energy demand modeling is critical to decarbonizing regional energy systems. The cooling and heating degree-day models are widely used due to their simplicity and low data requirements; however, the lack of accurate base temperature data limits their performance. In particular, the scarcity of high temporal resolution building energy demand data constrains regional-scale base temperature estimation through conventional methods such as the energy signature method and the performance line method. To address this limitation, this study develops a global regional-scale base temperature dataset based on the BiLSTM neural network framework with an attention mechanism. The dataset includes both cooling base temperature (Tcool) and heating base temperature (Theat) for each region, defined at a spatial scale equivalent to a U.S. state or a Chinese province. The BiLSTM framework demonstrates strong performance, with RMSE values of 1.39°C for training and 1.33°C for testing, and Pearson correlation coefficients of 0.84 for Tcool and 0.70 for Theat. Predicted results show that global Tcool ranges from 19–25°C and Theat from 14–18°C, consistent with physical principles. External validations using 16 independent datasets demonstrate that the predicted base temperatures significantly improve the accuracy of building energy demand modeling, reducing RMSE by 10.01% for cooling and 10.02% for heating, compared to official or empirical base temperatures. This dataset supplements sparse observational base temperature data and enhances the accuracy of building energy demand modeling, contributing to low-carbon energy system planning, broader climate impact assessment and weather-related financial applications. The proposed global Tbase dataset can be acquired from https://doi.org/10.6084/m9.figshare.30646376.v2 (He et al., 2025).
Overall, this paper presents an important dataset to show the base temperature globally. Without doubt, their effort will speed up the method to benchmark/set the base temperature, thereby regulating the adoption of air-conditioning systems or central heating systems for heating or cooling. This paper is overall well-written and logical. However, I suggest authors should make some revisions.
Authors may reflect residential buildings in the title of this paper.
The introduction can be rewritten to shorten the information presented before line 100, while the sentence in Line 103-105 and subsequent descriptions in this paragraph are odd. These contents do not really show a solid research gap. I encourage authors should re-visit how existing methods are costly and impractical. The mechanism to benchmark the base temperature in existing studies should also be criticized with their strengths and weaknesses.
Another comment on the introduction is that authors have well-charted the factors which affect energy use and the methods to characterize and benchmark the base temperature. However, authors have not presented the factors the terrain or topography. The terrain has also shaped energy use, occupant behaviors, and methods to calculate them. Moreover, in many countries, the determination of base temperature is associated with the culture and behaviors. There are also many studies linking the base temperature with indoor thermal comfort which is diverse among different populations.
Line 210, when describing methods, it is better to show the unit/metric of different variables. Based on this way, we can make sure the adoption of right indicators and basic mechanism. Furthermore, beyond ASHRAE, the IPCC also advocated the UK model to use and its accuracy in Europe is more solid than ASHRAE. Therefore, authors may re-think the adoption of Eq-1 to develop the model.
Figure 2, authors may reflect some areas that do not have any cooling demand or some areas without heating demand.