Data

The temperature forecast postprocessing methodologies used in this work require temperature observations and Numerical Weather Prediction (NWP) temperature forecasts. The available data for this study comprises 27 months, from March 2022 to May 2024 (both included).

Temperature observations

The Meteorological Service of Catalonia maintains a dense network of 185 automatic weather stations (AWS) (Serra et al. 2016) that meet WMO standards in location, measurements, and quality control. AWS are spread across Catalonia, with most of them (161) located in flat and populated areas. The remaining stations are divided evenly, with half situated between 500 and 2000 m above sea level (m a.s.l.), and the other half above 2000 m a.s.l., considered high mountainous areas. Data is collected every 30 minutes, but here, only hourly 2-m air temperature data was considered.

Numerical Weather Prediction model

Numerical Weather Prediction (NWP) model data was used from 10 different models to create what is commonly known as a Poor Man's Ensemble (PME) or multi-model ensemble. The 10 models considered in this study are presented in Table 1. These NWP models have different spatial resolutions, ranging from 1 to 10 km. Before selecting the nearest grid point of each model from every AWS, an interpolation to a 1 km spatial resolution is applied using Python library unimodel.

This approach follows a method similar to that described by Demaeyer et al., (2023) to account for altitude differences between each model's orography and that of a 1 km grid. The correction is performed by applying a dynamic lapse-rate correction based on linear regression of air temperature-altitude pairs around each grid point of the model, as outlined by Sheridan et al., (2010).

Table 1: Characteristics of the NWP models used in this study, including their names, spatial resolutions (in kilometers), temporal resolutions (in hours), and available lead times (in hours).

Since the 10 models considered here have different forecast lead times of 48 and 72 hours, the postprocessing for this study is limited to 48 hours. This ensures that all models are postprocessed with consistent lead times, and that the applied postprocessing techniques, EMOS Improver and Distributional Regression Networks, receive the same amount of input feature variables.

Training, validation, and test datasets

One of the postprocessing techniques relies on neural networks, which are typically trained using training and validation datasets and then evaluated on an unseen dataset called the test dataset. In this study, from a total of 27 available months, 18 were allocated to the training set, 5 to the validation set, and 4 to the test set. The validation and test months were chosen to encompass various characteristics and to cover all seasons of the year. The distribution of each split and the months assigned to them are illustrated below.

References

Demaeyer, J., Bhend, J., Lerch, S., Primo, C., Van Schaeybroeck, B., Atencia, A., ... & Vannitsem, S. (2023). The EUPPBench postprocessing benchmark dataset v1. 0. Earth System Science Data, 15(6), 2635-2653.

Sheridan, P., Smith, S., Brown, A., & Vosper, S. (2010). A simple height‐based correction for temperature downscaling in complex terrain. Meteorological Applications, 17(3), 329-339.