Journal topic
Earth Syst. Sci. Data, 11, 797–821, 2019
https://doi.org/10.5194/essd-11-797-2019
Earth Syst. Sci. Data, 11, 797–821, 2019
https://doi.org/10.5194/essd-11-797-2019

Review article 12 Jun 2019

Review article | 12 Jun 2019

# Merits of novel high-resolution estimates and existing long-term estimates of humidity and incident radiation in a complex domain

Merits of novel high-resolution estimates and existing long-term estimates of humidity and incident radiation in a complex domain
Helene Birkelund Erlandsen1,2,3, Lena Merete Tallaksen2, and Jørn Kristiansen3 Helene Birkelund Erlandsen et al.
• 1Norwegian Water Resources and Energy Directorate, Hydrology Department, Oslo, Norway
• 2University of Oslo, Department of Geoscienses, Oslo, Norway
• 3The Norwegian Meteorological Institute, Development Centre for Weather Forecasting, Oslo, Norway

Correspondence: Helene Birkelund Erlandsen (hebe@nve.no)

Abstract

To provide better and more robust estimates of evaporation and snowmelt in a changing climate, hydrological and ecological modeling practices are shifting towards solving the surface energy balance. In addition to precipitation and near-surface temperature (T2), which often are available at high resolution from national providers, high-quality estimates of 2 m humidity and surface incident shortwave (SW) and longwave (LW) radiation are also required. Novel, gridded estimates of humidity and incident radiation are constructed using a methodology similar to that used in the development of the WATCH forcing data; however, national 1 km×1 km gridded, observation-based T2 data are consulted in the downscaling rather than the $\mathrm{0.5}{}^{\circ }×\mathrm{0.5}{}^{\circ }$ Climatic Research Unit (CRU) T2 data. The novel data set, HySN, covering 1979 to 2017, is archived in Zenodo (https://doi.org/10.5281/zenodo.1970170). The HySN estimates, existing estimates from reanalysis data, post-processed reanalysis data, and Variable Infiltration Capacity (VIC) type forcing data are compared with observations from the Norwegian mainland from 1982 through 1999. Humidity measurements from 84 stations are used, and, by employing quality control routines and including agricultural stations, SW observations from 10 stations are made available. Meanwhile, only two stations have observations of LW. Vertical gradients, differences when compared at common altitudes, daily correlations, sensitivities to air mass type, and, where possible, trends and geographical gradients in seasonal means are assessed. At individual stations, differences in seasonal means from the observations are as large as 7 C for dew point temperature, 62 W m−2 for SW, and 24 W m−2 for LW. Most models overestimate SW and underestimate LW. Horizontal resolution is not a predictor of the model's efficiency. Daily correlation is better captured in the products based on newer reanalysis data. Certain model estimates show different dependencies on geographical features, diverging trends, or a different sensitivity to air mass type than the observations.

1 Introduction

Geophysical modeling is advancing, and more and more hydrological, ecological, and land surface models (from here on referred to as land models) are now estimating the surface energy balance . Shortwave radiation is the exogenous energy provider to Earth. At middle and higher latitudes, surface downward longwave radiation is an equally important radiative driver at the surface. Estimating the surface energy balance provides a sensible heat flux, as well as a latent heat flux, which in turn can be converted to evaporation or snowmelt, key variables for estimating the surface water balance.

Recent studies have shown the added value of using more forcing data than only precipitation and temperature when modeling evaporation or snow cover . High-quality and robust diagnoses, forecasts, and projections of evaporation- and snowmelt-related processes are essential for flood and hydropower management. Further, gridded data sets of high quality are needed to statistically bias-correct or downscale future climate scenarios , to spin up land surface models , and to assist model development.

Considerable effort is used to improve process description in environmental models and compare the results of different models. When land models are run without coupling to an atmospheric model, i.e., in offline or stand-alone mode, meteorological near-surface variables, commonly referred to as forcing data, are required. In practice, different communities use different forcing data estimates, such as the more empirically based estimates from the MTCLIM algorithms , estimates from numerical weather and climate models, or a combination of the two (see, e.g., Mizukami et al.2016). The different approaches used make it more difficult to compare the output across land models and have resulted in dedicated projects where various models are run with similar forcing data in, for example, the Inter-Sectoral Impact Model Intercomparison Project (ISI-MIP; ).

The Norwegian operational hydrological models have historically been calibrated and adapted to use high-resolution, gridded 2 m temperature and precipitation data as forcing data. For this purpose high-resolution (1 km×1 km) data sets, which cover the period from 1957 until the present (2019) with a daily resolution have been developed, namely the SeNorge data . In recent times a long-term high-resolution, quantile-mapping-based gridded data set of near-surface wind speed has also been developed at the Norwegian Meteorological Institute (MET Norway, available from http://thredds.met.no/thredds/catalog/metusers/klinogrid/KliNoGrid_16.12/FFMRR-Nor/catalog.html, last access: 10 June 2019). Gridded, observation-based, high-resolution data sets for humidity and incident radiation are, however, lacking.

Previous studies have compared and validated gridded estimates of humidity or incident radiation globally , for regions in the US , coastal Brazil , France , and the pan-Arctic region . No previous studies have, as far as we know, compared and assessed the quality of high-resolution empirically based and reanalysis-based estimates of humidity and incident radiation for regions within Europe (let alone for Norway specifically). This results in an additional and unnecessary source of uncertainty for land modeling in Norway.

Norway's complex topography and coastline may suggest that high-resolution data sets would perform better. When land models are run the near-surface temperature from the forcing data is usually adjusted to sea level and then to the land model's fine-resolution grid cell elevation that uses a standard atmospheric lapse rate to account for the difference in terrain height in the forcing data model and the land model to be run. A standard atmospheric temperature lapse rate may be unreasonable in winter, at high latitudes , and in complex terrain , and a much lower resolution in the forcing data grid compared to the land model may increase these error components. Further, if humidity is not also adjusted for inconsistencies between temperature and humidity will likely result in an unrealistic relative humidity . Incident radiation is influenced by variables showing a strong vertical dependence like near-surface temperature and humidity, cloud cover (e.g., Marty2000), and local variations in surface components like vegetation and snow cover , and thus may likely benefit from vertical adjustment.

While the spatial correlation may improve in a data set with a high spatial resolution, highlight the need to address temporal correlation on timescales shorter than monthly scales in data constructed from reanalyses for the purposes of forcing land surface models. A high horizontal resolution may lead to a better representation of the average state of a variable but not necessarily to an improved description of the concurrent temporal evolution of the forcing variables on shorter timescales, e.g., during the rapid passage of a low-pressure system with multiple distinct air mass characteristics and precipitation types.

This study addresses the aforementioned sources of uncertainty concerning commonly used estimates of humidity, either in the form of vapor pressure (VP) or converted to dew point temperature (Td), incident longwave radiation (LW), and incident (global) shortwave radiation (SW) available for long-term land surface modeling in the region through the following processes.

• The construction of an original data set, HySN (to explore the benefit of utilizing a 1 km×1 km national data set of 2 m temperature in the post-processing reanalysis data);

• The gathering of global long-term gridded data sets of humidity and incident radiation from two reanalysis data sets, two post-processed reanalysis data sets, and two versions of empirically based estimates compiled for continental Norway;

• The aggregation of available observations of humidity and incident radiation between 1982 and 1999 from a variety of providers, and where necessary, implementing quality control routines;

• The construction of multiple linear regression models to provide vertical gradients in both the observations and the model estimates, so that the variables may be adjusted to a similar altitude before their differences are assessed;

• The correlation of model estimates with observations on a daily timescale is explored by compiling anomaly correlation coefficients;

• The comparison of the model estimates' cumulative distributions; their sensitivity to weather types, continentality, and latitude; and their decadal trend to the observational data.

Additionally, two hypotheses are investigated. a – there are vertical gradients in near-surface humidity and incident radiation in our domain. b – the added value of the high horizontal resolution of the more empirically based estimates outweighs the added value of relying on estimates from coarser-resolution numerical weather prediction reanalyses.

The data sets considered are two global reanalysis data sets, the NASA Modern-Era Retrospective Analysis for Research and Applications version 2 (MERRA2) , and ECMWF's ERA-Interim , two products based on reanalysis data post-processed using higher-resolution gridded observational data, the Princeton Global Meteorological forcing data set, version 2 (PGMFDv2) , the WATCH forcing data methodology applied to ERA-Interim (WFDEI) , and two versions of high-resolution empirically based estimates from the preprocessor of the Variable Infiltration Capacity (VIC) model, a macroscale hydrological model largely based on the MTCLIM algorithms. Finally, a novel data set, the HySN data set, is compiled for the current study and evaluated. HySN is compiled by employing a similar method as was used in the development of PGMFDv2 and WFDEI; however, in this case ERA-Interim near-surface humidity and incident radiation are post-processed using a national, high-resolution, gridded 2 m temperature data set, SeNorge2 .

2 The gridded humidity and radiation estimates considered

Long-term data sets that are freely available, which can be used to drive hydrological, ecological, and land surface models for the Norwegian domain, include the newer reanalyses: MERRA2 and ERA-Interim. Due to computational constraints, currently available long-term global reanalysis data have horizontal resolutions ranging from $\mathrm{2}{}^{\circ }×\mathrm{2}{}^{\circ }$ to $\mathrm{0.5}{}^{\circ }×\mathrm{0.66}{}^{\circ }$. The MERRA2 reanalysis has a resolution of 0.5 latitude $×\phantom{\rule{0.125em}{0ex}}\mathrm{0.66}{}^{\circ }$ longitude. Around Oslo, Norway, this corresponds to a grid cell height and length of about 56 km×42 km. The reanalysis data sets are based on global circulation models ingesting large amounts of observational data by making use of complex assimilation techniques. However, substantial biases may still occur in reanalysis data. found a mean error of +42.9 % in precipitation intensity in ERA-40 over Norway between 1961 and 1990. found a negative bias in ERA-Interim surface LW radiation and precipitation between November 2007 and December 2008 across middle and high latitudes in the Northern Hemisphere.

The coarse resolution of reanalysis data and the knowledge of biases that may be present in them has spurred the development of post-processed reanalysis data sets. The PGMFDv2 and WFDEI are data sets consisting of variables relevant for forcing land surface models. The relevant variables are extracted from reanalysis data and post-processed and downscaled with gridded observational data. Both data sets are global and have horizontal resolutions of $\mathrm{0.5}{}^{\circ }×\mathrm{0.5}{}^{\circ }$. PGMFDv2 and WFDEI both adjust reanalysis estimates of humidity and LW with the gridded, $\mathrm{0.5}{}^{\circ }×\mathrm{0.5}{}^{\circ }$ Climatic Research Unit (CRU) T2 following the methods described in the development of NLDAS (Cosgrove2003). Taking a note from these methods, a novel high-resolution product is developed and validated in the current study: Hybrid SeNorge, abbreviated as HySN. HySN is constructed by post-processing ERA-Interim humidity and radiances in a similar manner to PGMFDv2 and WFDEI but utilizing a national data set, the 1 km×1 km SeNorge2 T2, rather than the $\mathrm{0.5}{}^{\circ }×\mathrm{0.5}{}^{\circ }$ CRU T2.

Another source of near-surface humidity and incident radiation estimates are the MTCLIM algorithms, which combine first principles from atmospheric physics with empirical extrapolation logic. Precipitation and temperature, variables that often are available from a dense network of surface observation stations, are used to estimate shortwave radiation and humidity. Versions of the MTCLIM routines are used to provide forcing data for a large number of hydrological and ecological models; it has, for example, recently been made available for the Mesoscale Hydrological model (MHm v5.9, https://doi.org/10.5281/zenodo.1069202). The variables estimated from MTCLIM are often utilized for impact studies, e.g., the impacts of climate change and forest management on ecosystem services in Europe . The algorithms have also been used to generate several gridded data set products of humidity and radiation for the US (e.g., Livneh et al.2013) and are used to provide climate change projections of humidity and radiation for the US . However, several recent studies have found regionally inconsistent biases in the MTCLIM algorithms .

The orography and land masks of the models are presented in Fig. 1. Compared to ERA-Interim orography, the SeNorge grid elevation is on average higher (mean: 37 m, median: 13 m; see the red areas in Fig. 1). The difference in maximum elevation is more than 1000 m. Meanwhile, near the coast and in inland areas the ERA-Interim orography is predominantly higher (see the blue areas in Fig. 1). The data sets are summarized in Table 1. Further details considering ERA-Interim, MERRA2, WFDEI, PGMFDv2, WFDEI, HySN, and two data sets from the VIC model's preprocessor, largely based on the MTCLIM algorithms, are presented in the following.

Table 1The following data sets provide estimates of humidity, LW, and SW, which are then evaluated. Precipitation is denoted as P, 2 m temperature as T2. The global data sets are retrieved from online repositories, while the data sets with regional coverage are compiled locally based on the stated input data, the VFD data sets using the VIC model's preprocessor, and the HySN data set based on the methods outlined in the current study. Additional references are given elsewhere in the text.

## 2.1 ERA-Interim

The ERA-Interim is a reanalysis data set developed by ECMWF, covering the time period from 1979 until the present. It is based on a 2006 release of the ECMWF operational model system (IFS Cy31r2) and has a horizontal resolution of about 80 km. It includes a four-dimensional variational analysis (4D-Var). Surface observations are ingested by optimal interpolation. The variables evaluated in this study are daily means of 2 m temperature and dew point temperature from analysis times (00:00, 06:00, 12:00, and 18:00 UTC) and LW and SW taken between +12 and +24 h into the forecast to allow for spin-up (see, e.g., Weedon et al.2014).

## 2.2 Modern-Era Retrospective Analysis for Research and Applications 2 (MERRA2)

MERRA2 is an atmospheric reanalysis data set developed by NASA, available from 1980 until the present, with a horizontal resolution of 0.5${}^{\circ }×\mathrm{0.625}$. Mass conservation constraints are imposed so that assimilated observations do not cause large violations of the global water balance. In MERRA2 land surface observations are not assimilated. The data variables used in this study are model orography (Mer2015a), pressure, and humidity from atmospheric single level diagnostic (Mer2015c), LW, and SW (Mer2015b).

## 2.3 Princeton's global meteorological forcing data set version 2 (PGMFDv2)

PGMFDv2 is an updated version of the 0.5${}^{\circ }×{\mathrm{0.5}}^{\circ }$ 60-year Princeton global meteorological forcing data set . The updates are described in . The humidity, LW, and SW estimates are based on the National Centers for Environmental Prediction–National Center for Atmospheric Research (NCEP-NCAR) reanalysis but post-processed to comply with the gridded, observation-based time series of precipitation, temperature, and cloud cover, with a horizontal resolution of 0.5${}^{\circ }×{\mathrm{0.5}}^{\circ }$ from the Climatic Research Unit (CRU TS 3.2.1) and satellite estimates of LW and SW.

## 2.4 The WATCH forcing data methodology applied to ERA-Interim (WFDEI)

The application of the WATCH forcing data methodology to ERA-Interim reanalysis data, WFDEI, is described in . The data are available from 1979 to the present and have a horizontal resolution of 0.5${}^{\circ }×{\mathrm{0.5}}^{\circ }$. The humidity, LW, and SW estimates are based on ERA-Interim data, post-processed to comply with the global gridded, observation-based time series of 2 m temperature, cloud cover, and interannual aerosol loading from CRU TS, using CRU TS 3.2.1 prior to 2009 similar to PGMFDv2.

## 2.5 Hybrid SeNorge ERA-Interim, HySN (H)

As part of this study, additional estimates of humidity and LW are derived, using methods based on , adjusting ERA-Interim humidity and LW to comply with the newly developed, 1 km×1 km SeNorge2 T2 data set . Further, the ERA-Interim SW estimates are adjusted based on the previously adjusted humidity estimates and the 1 km ×1 km orography.

For consistency with SeNorge precipitation and T2, the variables have a temporal resolution of a day, starting from 06:00 UTC. The data are currently compiled for the time period 1979–2017 and cover the same domain as the SeNorge2 grid. The data compilation is described in detail in the Supplement. The HySN data product is freely available from Zenodo (https://doi.org/10.5281/zenodo.1970170), and the Python code to generate the data is available on GitHub (https://doi.org/10.5281/zenodo.1435555).

## 2.6 VIC type forcing data, VFDv1, and VFDv2

The humidity and radiation estimates referred to here as VIC type forcing data (VFD) (see, e.g., Bohn et al.2013; Pierce et al.2013) are products of the VIC model's preprocessor. The VIC model includes the option to generate gridded humidity and radiation from gridded daily precipitation and maximum and minimum temperature. The VIC model preprocessor includes a slightly modified version of the MTCLIM model and algorithms for estimating longwave radiation. The MTCLIM algorithms included in the VIC preprocessors to estimate humidity use a modified version of the Magnus formula with daily minimum temperature used as a proxy for Td . Shortwave radiation is estimated using the Thornton and Running algorithm . The variables are estimated simultaneously; i.e., the algorithms supply each other with information .

Two versions of VFD are evaluated in this study. The first version, from here on called VFDv1, uses daily precipitation and mean temperature from SeNorge version 1.1 , supported by hourly temperature fields from a regional atmospheric reanalysis data set, NOrwegian ReAnalysis (NORA10, ), with a resolution of about 11 km to compile maximum and minimum temperature using a method similar to . The VIC4.0.6 preprocessor is used with default options, i.e., a modified version of MTCLIM4.2, and the TVA clear-sky and Bras full-sky LW algorithm (Bras1990).

The second version of VIC type forcing data, from here on called VFDv2, is based on slightly different input data, i.e., precipitation and mean, maximum, and minimum temperature from a newer version of the 1 km by 1 km SeNorge data, SeNorge2 . The VIC4.0.6 preprocessor is used with default options, i.e., a modified version of MTCLIMv4.3, and with LW estimates based on the clear-sky algorithm combined with the full-sky algorithm (for further references see ).

3 Study area

Norway is located in the receiving end of the westerlies that pass over the North Atlantic. This, combined with a long coast lined with mountains provides Norway with 1500 mm of precipitation a year, with distinct regional differences in precipitation amounts received. Although almost 40 % of Norway is covered by forest, evaporation from the land surface is estimated to be less than a fourth of the received precipitation . Most of Norway will normally have snow cover in the winter season, with the length of the snow season varying from a few days to 300 d a year (dependent on latitude, elevation, and distance from the coast). Mean temperature (1971–2000) is 1.3 C, and varies from 7 C near the coast in southern Norway to −4C in the mountains. Between 1976 and 2014 T2 increased by half a degree Celsius per decade .

4 Surface observations

The model estimates and observational data are compared between 1982 through 1999. The observational data include humidity measurements from 84 sites, SW observed at 10 sites, and LW observed at two sites. The comparison of the model estimates of incident radiation with stations data is only made possible by including observations from agricultural stations and applying quality control routines. The observations are gathered from the University of Bergen (UiB, SW and LW measurements), and from the Norwegian Meteorological Institute's repository for observational data, which also includes measurements from agricultural stations conducted by the Norwegian Institute of Bioeconomy Research (NIBIO). The locations of the stations used in the comparison with the model estimates are shown in Fig. 1.

Figure 1Panels (a)(e) show the orography and land mask of MERRA2 (a), ERA-Interim (b), PGMFDv2 (c), WFDEI (d), and SeNorge (c), respectively, visualized on the SeNorge UTM33 grid with a green–brown color scheme. For reference, national borders and the coastline derived from a high-resolution data set are delineated in black. The locations of the 84 VP stations used in the model comparison are denoted with red crosses in (f). The locations of SW and LW stations are marked in (g) with red and orange markers, respectively. Note that the southernmost LW station also measures SW. The last map (h) displays the difference in meters between the SeNorge and ERA-Interim orography in common land areas. Higher elevations in SeNorge are indicated with red, while blue indicates higher elevations in ERA-Interim.

## 4.1 Humidity

Humidity observations from 84 stations are included in the study (see Fig. 1). A minimum of 5 years of daily data was necessary for the station data to be included; however, most stations have the complete station record (18 years) available. The observations, once converted to vapor pressure, have an uncertainty ranging from around 5 % at 20 C to 6 % at −20C (Gabriel Kielland, personal communication, MET Norway, 2019). The latitude, longitude, altitude, distance to the ocean, and the start and end date of the time series are given for each station in the Supplement (Tables S1 and S2).

The location of the 10 stations included in the evaluation of modeled SW is displayed in Fig. 1 and covers a latitudinal range from 58.8 to 69.7 N. Seven of the stations are agricultural stations managed by NIBIO. The latitude, longitude, altitude, distance to the ocean, the start and end date of the time series, and the percentage of flagged data are given for each station in the Supplement (Table S3). The number of days of data used in the validation varies from 5.6 years for Gjengedal to more than 17 years for Bergen.

Most stations measure global radiation with a Kipp & Zonen CM11 thermoelectric pyranometer. The estimated uncertainty of hourly and daily totals of CM11 may be as low as 3 % in optimal conditions (Grini2015). Daily global SW measurements from Bergen station (UiB) are included in the World Radiation Data Centre (WRDC) and are quality-controlled by the data provider (UiB, A. Olseth). The daily estimates have an uncertainty of 3.5 % . Measurement errors and uncertainty may depend on sensor calibration, placement (e.g., sky-view), the temporal resolution of the measurements, cleaning of the pyranometer, and local weather conditions.

For stations other than Bergen, quality control procedures follow the methodology suggested by as outlined in Table 2. This procedure involves running rtmrun (Godøy2013); a Perl wrapper around Libradtran 1.7 ; a library for radiative heat transfer to provide solar zenith angle (SZA), extraterrestrial (SWE), and clear-sky (SWCS) incident shortwave radiation for each station location; and the Python scripts developed in to screen and flag the data based on automatic quality control tests. Measurements exceeding the upper and lower bounds given in Table 2 were flagged. Additionally, all station time series were visually inspected at hourly, daily, and monthly levels in order to flag erroneous data not captured by the automatic routines, with emphasis on data points marked as suspicious due to large hourly increments or very high or low variation in the ratio of observed to extraterrestrial radiation. Figure 2 shows the mean monthly values of SZA, SWE, SWCS, the raw measured values (SWraw), and values passing the quality control routines (SWQC) for Løken station.

Figure 2Estimated and measured shortwave radiation (W m−2) at Løken station (61.1 N, 9.1 E) for the period January 1991 to December 1999. Mean monthly solar zenith angle (SZA), (global) shortwave incident radiation at the top of the atmosphere (SWE), modeled clear-sky incident shortwave radiation SWCS), and station measurements of incident shortwave radiation before (SWraw) and after (SWQC) quality control are shown.

Table 2An overview of the automatic quality control tests, based on the relative values of the solar zenith angle (SZA), measured (SWraw), extraterrestrial (SWE), and clear-sky (SWCS) incident global radiation. The table is adapted from Table 4.1.1 in .

In the calculation of daily means, values were flagged as erroneous and subsequently excluded from the validation if more than two hourly data points were flagged or missing during daytime. The number of discarded days varied from 4 % at Kise to 29 % at Tromsø.

Bergen station and Voll station (Trondheim), denoted with orange markers in Fig. 1g, have observations of incident longwave radiation available for the time period considered. The lack of LW measurements is not an uncommon challenge (see, e.g., Carrer et al.2012). The stations' latitude, longitude, distance to the coast, and the start and end date of the data used are listed in the Supplement (Table S3). Both measurement stations are located within 5 km of the coast (see Fig. 1). The measurements from Bergen are managed and quality-controlled by UiB. In the first part of the period they are from a Schulze radiation balance meter, while later in the period they are from an Eppley pyrgeometer. The sensors are placed on the roof of UiB. The observation station at Voll, Trondheim, was managed by MET Norway from March 1996, until it was shut down. The Trondheim measurements are from a Kipp & Zonen CG 1 pyrgeometer located on the ground. At both stations unshaded sensors were used, possibly leading to slight overestimation due to solar near-infrared radiation contamination (overestimations of 10 % on cloud-free days were found in ). The data were quality-controlled by visual inspection for spikes and jumps and by comparing the consistency between the two time series. The Stefan–Boltzmann blackbody longwave radiation was set as an upper limit of the measurements, using the air temperature from the station. If more than 2 h were missing or flagged during a day, observations from that day were omitted in the subsequent validation.

5 Evaluation methods

Daily estimated values are compared to station observations. The nearest model grid cell is selected from the data sets without interpolation, to avoid introducing spatial or temporal smoothing of the meteorological fields . This study specifically looks into the altitudinal dependence of the humidity and surface incident radiation estimates, and as a starting point the estimates without adjustment to the observation stations' altitudes are used.

## 5.1 Vertical adjustment to station altitude

Prior to the comparison of the model estimates with the station observations, the observations and model estimates of VP and SW were analyzed using multiple linear regression with geographical features as predictors, in order to find vertical gradients so that the model estimates could be adjusted to station altitude. The geographical predictors used were altitude (either the stations' altitude or, for the models, the altitude of the nearest-neighbor grid cell to the stations), latitude above 57 N, and distance to the coast, which was calculated in Python using the Haversine distance from the station to the coastline, extracted from a coastline data set that is available via the Matplotlib Basemap Toolkit, implemented at a coarse resolution to not include large inland lakes. The limited number of SW measurement stations and the varying temporal availability of high-quality observational data made an evaluation of the altitudinal dependence of the SW more demanding. The SW data were first converted to clearness index (CI), which describes the daily incident shortwave radiation fraction of the potential extraterrestrial radiation at the local position and time (SW$↓/$ SWE), and then daily data of over 1000 concurrent measurements from eight stations were used in the regression.

The model estimates were adjusted to station altitude by multiplying their grid cell values with the difference in altitude to the observation station and a vertical adjustment gradient. For each model the vertical adjustment gradients were computed as the mean of coefficients found for the model in question and those found for the observational data, linearly interpolated from a seasonal to a daily frequency. A similar regression model was constructed to find vertical gradients in LW using a well-performing data set in lieu of observations due to the limited number of LW observation stations.

## 5.2 Evaluation metrics

Seasonality and aggregated means are assessed by plotting the mean monthly station values for the observations and models. The differences between the model estimates and observations at individual stations are displayed in heat maps. For each variable a table is provided listing several metrics. The tables list the following variables for each model:

1. $\mathrm{\Delta }=\stackrel{\mathrm{‾}}{{\mathit{\mu }}_{\text{station,model}}-{\mathit{\mu }}_{\text{station,observation}}}$, the mean (μ) of the station differences;

2. $|\mathrm{\Delta }|=\stackrel{\mathrm{‾}}{|\left({\mathit{\mu }}_{\text{station,model}}-{\mathit{\mu }}_{\text{station,observation}}\right)|}$, the mean of the station absolute differences;

3. $|\mathit{\delta }{|}_{max}=max\left(|{\mathit{\mu }}_{\text{station,model}}-{\mathit{\mu }}_{\text{station,observation}}|\right)$, the largest absolute difference at any station;

4. $|{\mathit{\delta }}^{\mathrm{s}}{|}_{max}=max\left(|{\mathit{\mu }}_{\text{season,station,model}}-{\mathit{\mu }}_{\text{season,station,observation}}|\right)$, the largest absolute difference found at any station in any season.

Also listed are the mean daily anomaly correlation coefficient (ACC), i.e., the daily Pearson correlation coefficient of the time series where the observed day-of-year mean is subtracted and the number of stations where the cumulative distribution of daily mean estimates passes (p>0.001), and finally the Kolmogrov–Smirnov test of similarity with the cumulative distribution of the observations. The Kolmogrov–Smirnov test returns the probability that the underlying one-dimensional probability distributions are the same (H0). The similarity between the models and observations on a daily frequency is visualized in Taylor plots, where the normalized standard deviation, the root-mean-square error, and the correlation coefficient of the de-seasonalized time series are displayed. The time series are de-seasonalized by subtracting the observed day-of-year climatology. The correlation coefficient thus corresponds to a non-centered version of the anomaly correlation coefficient (ACC).

## 5.3 Evaluation of geographical gradients

In order to see if the geographical dependencies of the model estimates of humidity and shortwave radiation differ significantly from those seen in the observational data, similar multiple linear regression models as those previously constructed to find vertical gradients are used. The predictors are the season, altitude (z), latitude above 57 N, and distance to the coast (C). The regression is first performed separately for each model and for the observations. However, a second iteration of regression is performed for each of the models, where the input data are composed of the observational data and model data are appended together with the data sources indicated. The data source is then used as a categorical predictor allowed to interact with any of the model coefficients. Significance of the interaction term, e.g., between latitude and model source, will indicate that the model's latitudinal gradient is significantly different from the gradient seen in the observations.

## 5.4 Air mass type sensitivity

The differences between the model estimates and observations are also inspected for an air mass type dependence. found significant decreases in the frequencies of dry moderate and dry polar air mass types at Bergen (Flesland) between 1974 and 2000. If the precision of the model estimates is dependent on air mass type, the derived changes in the variables with time may be less robust if the frequencies of the air mass types also change with time. The spatial, synoptic air mass type classification has been constructed for 48 stations in Europe and 7 stations in Norway (Sola, Flesland, Fornebu, Ørlandet, Bodø, Tromsø, and Slettnes) by , according to the methods developed in and . The categorization is done by using sub-daily surface observations of temperature, dew point, wind, pressure, and cloud cover at individual stations (often airports). The synoptic weather-typing classifies the local air mass conditions into the categories DP (dry polar), DM (dry moderate), DT (dry tropical), MP (moist polar), MM (moist moderate), MT (moist tropical), and TR (transitional). The dry weather types are associated with clearer conditions, while the moist weather types are associated with clouds and higher humidity. TR days are defined by large shifts in the synoptic variables, i.e., days where the weather type is changing. The MT weather type is often found in the warm sectors of cyclones, while the MP and MM type may be found in the vicinity of a front or in air transported inland from a cool ocean.

## 5.5 Comparison of trends

The year 1985 is considered the start of the SW brightening period in Europe after a period of SW dimming due to aerosol emissions (see, e.g., Wild et al., 2005). For time series of a sufficient length and quality, the observational data and the model data are inspected for trends. Stations that have less than 10 % missing daily data between January 1985 and December 1999 are considered, and trends are calculated for each calendar month using the Mann–Kendall test and by calculating the Sen slope . The analysis is done using the R software and functions within the “trend” package (Pohlert2018). A total of 59 humidity stations meet the criteria and are grouped into five geographical regions, southwest (SW, stations with  $<\mathrm{61.8}{}^{\circ }$ N, $<\mathrm{8}{}^{\circ }$ E), southeast (SE, stations with  $<\mathrm{61.8}{}^{\circ }$ N,  $>\mathrm{8}{}^{\circ }$ E), central (C, $\mathrm{61.8}>{}^{\circ }$ N$<\mathrm{64}{}^{\circ }$ N), northwest (NW, $>\mathrm{64}{}^{\circ }$ N, $<\mathrm{20.6}{}^{\circ }$ E), and northeast (NE, >64 N, >20.6 E) when assessing trends. For SW and LW only the station in Bergen (UiB) meets the criteria. For consistency the humidity observations from Bergen-Florida are inspected for trends as well.

6 Comparison between existing long-term estimates and the new hybrid approach, HySN

Daily estimates of near-surface humidity and SW and LW from MERRA2, ERA-Interim, PGMFDv2, WFDEI, VFDv1, VFDv2, and HySN (see Table 1) from 1982 to 1999 are first inspected for vertical gradients in order to adjust the model estimates to station altitude in the following. Further, the estimates' quality on a multi-annual timescale is assessed by considering their mean and absolute deviations from station measurements. The model estimate's distribution is assessed by comparing their daily cumulative distribution to that of the station observations. The estimate's similarity to the observations on a daily timescale is considered by inspecting the anomaly correlation coefficient, i.e., their daily correlation with the measurements after the seasonal cycle has been subtracted, and by considering if their differences to station measurements show sensitivity to the local daily air mass type, which has been classified for seven Norwegian stations by . For humidity and SW, the geographical gradients in the models are compared with those in the observations using multiple linear regression. A separate subsection presents modeled and observed trends in each calendar month from 1985 to 1999. Humidity trends are compared after grouping the observations and model estimates into five regions, while SW and LW are computed for Bergen, the only location where long-term measurements of SW and LW are available within the time period with little missing data.

## 6.1 Humidity

The model estimates of near-surface humidity are compared against humidity observations from 84 stations. The observation stations used in the validation of humidity are located on average 200 m below the coarse-scale grid cell elevation, i.e., in the fjords and on the coast rather than in the surrounding terrain.

### 6.1.1 Vertical gradients in humidity

Multiple linear regression of seasonal mean humidity at the location of the humidity stations shows that altitude is a significant predictor of humidity in the observations and all models (see Sect. S2 of the Supplement). The vertical gradient in the observations is close to the moist adiabatic lapse rate but varies considerably with season and distance to the coast (C). On average, dew point temperature decreases by 5.2 C km−1 increase in altitude in summer and freeze point temperature decreases by 4.4 C km−1in winter. Regression based on vapor pressure is found to give smaller relative errors than regression based on dew point temperature. This is because dew point temperature has a higher sensitivity to temperature at low temperatures. The observed vertical gradient in vapor pressure is −0.39hPa per 100 m in summer and −0.24hPa per 100 m in winter, and the vertical gradient is weakened by 0.11hPa per 100 m every 100 km away from the coast.

The vertical gradients in the humidity data differ depending on the data source; e.g., the estimates from PGMFDv2 and WATCH show a weaker decrease with altitude than the observations and other models. For each model the vertical gradients are computed as the mean of the seasonal coefficients found in the regression analyses of the model in question and the observations, linearly interpolated from a seasonal to a daily frequency. The altitudinal adjustment results in a mean difference in humidity, expressed as dew point temperature (Td) of about 1 C for the coarse-scale models and about 0.06 C for the estimates with a 1 km×1 km resolution. The largest adjustment is an increase in MERRA2's Td of 7.3 C at Tafjord station, where the model's orography is 1154 m above the station altitude.

### 6.1.2 Differences of humidity estimates to station observations

The seasonal cycle of the observations and models (adjusted to station altitude) is shown in Fig. 3a. The largest deviations are seen in the period of highest humidity, i.e., during summer. The signs of the average deviations are consistent throughout the year. PGMFDv2 (denoted with P), MERRA2 (M), and to some degree ERA-Interim (E) and WFDEI (W) show larger estimates than the observations, whereas VFDv1 (V1) and VFDv2 (V2) generally show lower estimates. The HySN (H) estimates follow the mean monthly values of the observations closely. This is also evident in Table 3, where summary of statistics for the humidity estimates presented, and HySN shows a mean station error in Td of just 0.1 C. An aggregated mean similar to the observations does not ensure small deviations from the measurements at individual stations.The VFD estimates have the second smallest deviation in aggregated mean (Δ); however, when considering the average absolute deviation ($|\mathrm{\Delta }|$) HySN, WFDEI, and ERA-Interim perform better than VFD.

Figure 3The seasonal cycles of monthly 2 m vapor pressure (a), incident shortwave radiation (b), and incident longwave radiation (c), averaged over the location observations are available. The month of the year is denoted on the horizontal axis. The observations (O) are plotted with a thick, continuous black line. MERRA2 (M, solid) and ERA-Interim (E, dashed) are plotted in red, PGMFDv2 (P, solid) and WFDEI (W, dashed) in blue, VFDv1 (V1, solid) and VFDv2 (V2, dashed) in orange, and HySN (H) with a dashed lilac line.

Table 3Summary of metrics showing the humidity estimates' similarity to station observations. Differences (Δ) are given in dew point temperature in degrees Celsius. Δ is the mean station difference, $|\mathrm{\Delta }|$ is the mean absolute station difference, $|{\mathit{\delta }}_{max}|$ is the largest absolute difference at any station, while $|{\mathit{\delta }}^{\mathrm{s}}{|}_{max}$ is the largest seasonal difference at any station. ACC is the anomaly (de-seasonalized) daily correlation coefficient, while K-S indicates the number of stations where the daily mean cumulative distribution passes the Kolmogrov–Smirnov test of similarity (p>0.001). The best scores are shown in bold.

Differences between the model estimates and the station measurements of humidity, expressed as dew point temperature, are depicted for each station, sorted from south (upper y axis) to north (lower y axis) in Fig. 4. The mean absolute difference ($|\mathrm{\Delta }|$ or MAE) varies from 0.7 C for the HySN estimates to 1.8 C for the PGMFDv2 estimates. The largest deviation occurs at an inland station, Fagernes, where PGMFDv2 Td estimates a 5.4 C higher Td than observed. The figure suggests a latitudinally dependent bias for certain models, and this is further explored in the following subsection by evaluating the models' geographical gradients.

Figure 4For each station, sorted from south to north (y axis), and each model (x axis) the differences between the modeled and observed station mean dew point temperature (a), incident shortwave radiation (b), and incident longwave radiation (c) are shown.

The humidity estimates are evaluated on a daily basis by de-seasonalizing the time series (subtracting the observed day-of-year mean). Figure 5 shows, for each model, the de-seasonalized time series of humidity, the mean temporal correlation coefficient (now equivalent to the anomaly correlation coefficient, ACC), the mean normalized root-mean-square error, and the mean standard deviation in a Taylor plot. Figure 5a visualizes the mean station metrics for the de-seasonalized time series of humidity. The estimates from ERA-Interim and post-processed ERA-Interim (HySN and WFDEI) are closest to the observations and show similar results. MERRA2 also shows a high ACC. PGMFDv2, which is based on an older reanalysis with lower spatial resolution, and the VIC type estimates show slightly poorer results, with an ACC ranging between 0.5 and 0.7 (see also Table 3).

Figure 5Taylor plots depicting the standard deviation ratio and correlation coefficients (ACCs) for the de-seasonalized time series of vapor pressure (a), incident shortwave radiation (b), and incident longwave radiation (c).

### 6.1.3 Evaluation of geographical humidity gradients

Multiple linear regression models are fitted to seasonal mean humidity with the four seasons as categorical predictors, where fall (autumn) is the baseline season in the model. The geographical predictors considered are altitude (z given in kilometers), latitude (above 57 N), and distance to the coast (C, given in per 100 km increments). Further, interactions between altitude and season and between altitude and continentality are included. Each model is paired with the observational data in a common regression model where the data source is included as a categorical predictor.

Figure 6 displays the regression coefficients for the observations and the coefficients for the models if they are significantly different (p<0.01) from those of the observations. Higher significance is marked with a darker color. The HySN estimates have similar coefficients to the observations. The regression shows (Fig. 6) that MERRA2 and PGMFDv2 have significantly higher intercepts (higher fall mean values at the coast of southern Norway) than the observations. MERRA2 further shows a stronger latitudinal gradient and a much weaker decrease in humidity with distance from the coast than the observations. In addition to having a higher intercept than the observations, PGMFDv2 shows a more pronounced seasonal dependency, a weaker continental gradient, and a 50 % stronger latitudinal gradient than the observations. VFDv1 and VFDv2 show a more than 60 % more pronounced decrease in humidity with continentality than the observations. VFDv2 also shows a weaker increase in humidity in summer than the observations.

Figure 6Seasonal and geographical dependencies of seasonal humidity (a) and daily clearness index (b) are depicted. The row names are the names of the coefficients, including the intercept (I), of the multiple linear regression model. The regression coefficients of the observational data are shown in the leftmost column of each plot, while the coefficients found for the model estimates are only shown if they are significantly different from those of the observations (using a limit of p<0.01 for humidity and p<0.05 for CI). Lower p values are indicated with darker colors using a logarithmic color scale (log 10(p)).

### 6.1.4 Air mass type sensitivity of humidity deviations

In Fig. 7a the daily deviations of the humidity estimates are grouped according to the location's daily air mass type classification (SSC type; see Sect. 5). The classification is available for Sola, Fornebu, Flesland, Bodø, Tromsø, and Slettnes stations and the humidity observations considered are from Saerheim, Aas, Bergen, Bodø, Tromsø, and Kirkenes stations. All the estimates are too humid in dry weather types. The PGMFDv2 and VFDv2 estimates show considerable overestimations of humidity in dry weather types and underestimations in moist weather types. The lack of range is consistent with the lower normalized standard deviation seen in the Taylor plot (Fig. 5a). The ERA-Interim, WFDEI, and HySN differences also show a slight sensitivity to air mass type but much less than the VFDv1, VFDv2, MERRA2, and PGMFDv2.

Figure 7Differences between model estimates and observed values binned according to the daily air mass type, classified for nearby stations within Norway . The air mass types are dry polar (DP), dry moderate (DM), dry tropical (DT), moist polar (MP), moist moderate (MM), moist tropical (MT), and transitional (T). Panel (a) shows the mean difference in vapor pressure for Saerheim, Aas, Bergen, Bodø, Tromsø, and Maze stations. Panel (b) shows differences between model estimates and observed incident shortwave radiation at Saerheim (Sola airport), Aas (Fornebu airport), Bergen (Flesland airport), and Trondheim (Ørland airport). Panel (c) depicts differences in incident longwave radiation binned according to air mass type in Bergen (Flesland airport) and Trondheim (Ørland airport).

## 6.2 Incident global shortwave radiation (SW↓)

SW observations from 10 sites on the Norwegian mainland are considered. At most locations the coarse-scale models' corresponding grid cells have an altitude 300–400 m above station altitudes.

### 6.2.1 Vertical gradients in clearness index (CI)

Multiple linear regression was used to provide a vertical gradient in SW, expressed as clearness index (CI, i.e., the fraction of SW of the extraterrestrial incoming radiation, SWE), in order to adjust the estimates to the stations' altitudes. Multiple linear regressions, including both continentality and altitude as predictors resulted in altitudinal coefficients varying in both magnitude and sign for the different models and observations (not shown). This was likely because the correlation between altitude and continentality varies between 0.56 and 0.86 depending on the data source. Excluding continentality from the predictors provided vertical CI gradients with a consistent sign. The observations show vertical CI gradients of 0.020∕100 m in winter, 0.013∕100 m in spring, 0.005∕100 m in summer and 0.003∕100 m in fall (see Fig. 6b). The observed SW thus increases, on average, with altitude in all seasons.

The effect of adjusting the model estimates to station altitude is an average reduction in SW of 0.7–1.5 W m−2 for the coarse-scale models (MERRA2, ERA-Interim, PGMFDv2, and WFDEI) and a reduction of merely 0.1–0.3 W m−2 for the models with a 1 km×1 km grid (VFDv1, VFDv2, HySN). The largest adjustment is a mean reduction of the PGMFDv2 SW estimate of 4 W m−2 at a station in southeastern Norway (Gjengedal).

### 6.2.2 Differences of SW↓ estimates to station observations

The mean monthly model estimates of SW averaged over all 10 stations, after adjustment to station altitude, are visualized in Fig. 3b. In winter the deviations are small, but in spring and summer all models except VFDv1 overestimate SW. The MERRA2 SW is on average 35 W m−2 higher than the observations in July, and the VFDv2 SW is 29 W m−2 higher than the observations in both June and July. ERA-Interim, WFDEI, and HySN show the largest overestimations in May, a month when solar radiation is high and snow cover is variable.

Figure 4b depicts the mean difference between the model estimate and the observations of SW at individual stations. At half of the stations the mean difference between the WFDEI and HySN and the observations is lower than the measurement uncertainty of newer pyranometers in optimal conditions (Sect. 4). The figure further shows that most models consistently overestimate SW. This is not true for VFDv1. While VFDv1 has the second lowest mean monthly deviation (Fig. 3), its mean absolute difference is larger, on average 10 W m−2. This is also evident in Table 4 where summary statistics for the SW estimates are presented.

Table 4As in Table 3 but with metrics listed for SW. Except for ACC and K-S, which are dimensionless, the units are given in W m−2.

Table 4 shows that at individual stations seasonal deviations in model estimates from station observations are as large as −62W m−2. The large underestimation is found in VFDv1 and not VFDv2 at a coastal station in northern Norway, Bodø. Also listed in the table is the percentage of stations where the daily model estimate, adjusted to station altitude, passes the Kolmogorov–Smirnov test of similarity of their cumulative distribution with the observations, which is zero cases for PGMFDv2 and 70 % for HySN.

The similarity of the model estimates to the observations at a daily frequency is visualized in a Taylor plot (Fig. 5b). As also seen for the humidity estimates, PGMFDv2, VFDv1, and VFDv2 have lower ACCs (31 %–48 %) than the estimates based on newer reanalysis data (73 %–78 %). PGMFDv2 in particular shows a variance at a daily frequency that is considerably smaller than the observations.

Figure 8Mean daily clearness index (CI) during winter (a) and summer (b) is depicted in violin plots, where the kernel density distribution of the observations and each model is shown, mirrored across the y axis. The observed median value is drawn with a blue solid line.

### 6.2.3 Evaluation of geographical gradients in clearness index

Similar to what has been done for humidity, the observations and the corresponding vertically adjusted model estimates of daily CI are compared using multiple linear regression. The seasons, latitude, and altitude are used as predictors, including interaction between season and latitude and between season and altitude. Figure 6b shows the regression coefficients of the observational data in the leftmost column. The coefficients found for the model estimates are only displayed if they are significantly different (p<0.05) from those of the observations. Larger differences (log 10(p)) are marked with a darker color. The estimated intercept of PGMFDv2 stands out in the plot. It is 40 % higher than the estimated intercept of the observations. The second most evident difference is the latitudinal gradient in both VFDv1 and VFDv2, which is several times stronger than observed. PGMFDv2 also shows a stronger latitudinal gradient than observed. Other notable differences are the estimated summer CI values of MERRA2, which are considerably higher than those seen in the observations.

### 6.2.4 Air mass type sensitivity of SW↓ deviations

Figure 7b shows the differences between the models' SW estimates and observations at Aas, Saerheim, Bergen-GFI, Bodø, and Tromsø stations, grouped according to the weather type at nearby weather stations (Fornebu, Sola, Flesland, Bodø, and Tromsø, respectively). All models except VFDv1 show positive SW deviations during weather types classified as moist, which occur most frequently. In the less prevalent dry weather types all models except MERRA2 show slightly lower estimates than observed. The largest underestimations are seen for the VFDv1 estimates. Further, the deviations of PGMFDv2, VFDv1, and VFDv2 show a stronger dependency on weather type than MERRA2, ERA-Interim, WFDEI, and HySN. The MERRA2 SW estimates are, however, overestimated during all weather types. The WFDEI SW estimates show considerable deviations from the ERA-Interim estimates, with larger underestimations found during both dry polar and dry tropical weather types, and considerably lower overestimations found on days classified with a moist tropical weather type. Grouping the clearness index into either dry or moist and transitional weather types shows that the observed CI decreases on average by 0.22 in moist and transitional types. A similar decrease is seen in MERRA2, ERA-Interim, WFDEI, and HySN but not in VFDv1 and VFDv2 (−0.12) or PGMFDv2 (−0.05). The summer and winter distributions of clearness index at the stations considered are depicted in Fig. 8. It is evident that PGMFDv2 spans a much smaller range of transmissivity than observed in both summer and winter and that the VIC type estimates have a bias towards low estimates and show less variability than observed in winter.

Since both WFDEI and HySN are based on ERA-Interim, and ERA-Interim shows overestimations of SW in summer, where observations were available, the differences in estimated SW in ERA-Interim and the observations were inspected for dependencies on differences in modeled and observed cloud cover, near-surface humidity, and snow cover using regression. The results varied with both season and location but for the aggregated data significant dependence on differences in observed and modeled 2 m humidity and snow cover was found (with higher snow cover in the model associated with higher SW estimates in the model), and in the warm season larger overestimations were seen in ERA-Interim when the model produced high clouds.

## 6.3 Incident longwave radiation (LW↓)

Only two stations have LW observations available during the validation period, Bergen-GFI (western Norway) and Trondheim-Voll (central Norway); both are located near the coast (see Fig. 1 and the Supplement). More than 17 years of daily measurements are available from the Bergen-GFI station, while at Trondheim-Voll only about 2 years of observations are available.

### 6.3.2 Differences of the LW↓ estimates to station observations

Figure 3c depicts the mean monthly LW at the two stations. Summary statistics for the LW estimates after adjustment to station altitude are also presented in Table 5. At the two stations all models except WFDEI estimate lower values than observed in all months. WFDEI, however, estimates on average 10–15 W m−2 more LW than is observed from October through January. The largest absolute differences are found in MERRA2, where LW is underestimated at 8 W m−2 in winter and at 25 W m−2 midsummer. The remaining models estimate 11 to 17 W m−2 less LW than observed in summer, and show smaller underestimations in winter.

Table 5As in Table 3 but with metrics listed for LW. Except for ACC and KS, which are dimensionless, the units are in W m−2.

The skill of the model estimates in capturing the day-to-day variability in LW is visualized in Fig. 5c, indicating the correlation and normalized standard deviation of the de-seasonalized time series. The estimates based on newer reanalysis data, MERRA2, ERA-Interim, WFDEI, and HySN have anomaly correlation coefficients around 0.8, while PGMFDv2, VFDv1, and VFDv2 have lower ACCs (46 %–60 %).

### 6.3.3 Air mass type sensitivity of LW↓ deviations

Figure 7c depicts the differences between the model's LW estimates and the observed values at Bergen and Trondheim stations, grouped according to the daily weather type (classified at Flesland and Ørlandet stations). On days classified with moist or transitional weather types, all models except WFDEI underestimate LW. PGMFDv2, VFDv1, and VFDv2 clearly have weather-type-dependent deviations from the observations, with underestimations in moist weather types and smaller underestimations or even overestimations compared to the observations in dry weather types. The MERRA2, ERA-Interim, WFDEI, and HySN estimates largely show similar differences to the observed values in all weather types. An additional comparison of the ERA-Interim estimates at Bergen, where cloud observations are available, showed no difference in average deviation in the estimates of incident longwave radiation on common clear-sky days compared to the remaining days (not shown). The lower estimates of incident longwave radiation in ERA-Interim are thus not likely to be primarily related to differences in cloud properties.

## 6.4 Modeled and observed trends in near-surface humidity, SW↓, and LW↓ from 1985 to 1999

January 1985 is considered to be the start of the brightening period in Europe after a period of SW dimming due to aerosol emissions (see, e.g., Wild et al.2005). In the following, where available, observations and co-located model estimates are inspected for trends in near-surface humidity, SW, and LW from 1985 to 1999.

After screening the observational time series 59 humidity stations are grouped into five geographical regions (southwest (SW), southeast (SE), central (C), northwest (NW), and northeast (NE); see Sect. 5). Table 6 lists the results of the trend tests for each calendar month and region, listing only the Sen slope if the Mann–Kendall trend test is significant at a 5 % or 10 % level, with the latter denoted in italics. In all regions except the northeastern part of Norway significant increases in observed Td occurred in September from 1985 to 1999. The observations further show an increase in Td in April in southern Norway, an increase July in the northeastern part of Norway, and a decrease in May Td in central Norway. For southern and central Norway all of the models capture the increase in September Td. The models reproduce the observed trends in spring and in northern Norway to a lesser degree.

Table 6Linear, decadal trends in monthly mean Td (C) between 1985 and 1999, significant at a 5 % or 10 % (denoted in italics) level in the observations (O) and the model estimates, listed with the month and region denoted on the left (month, region).

Bergen is the only location in Norway where long-term records of incident shortwave or longwave radiation are available with little missing data within the time range. Between January 1985 and December 1999, observations from the University in Bergen show trends in annual SW of 1.7 W m−2 per decade (p<0.1), while LW decreases with −8.4W m−2 per decade (p<0.001). At the nearly co-located measurement station of the Norwegian Meteorological Institute, at Bergen-Florida, annual dew point temperature has a trend of 1.2 C per decade (p<0.0001).

In individual calendar months larger trends were found. Observed August mean SW increased with 51 W m−2 per decade (Table 7). The observations also show modest, but significant, increases in October and December. Apart from WFDEI and VFDv1, which show no significant SW trends, the models reproduce a significant increase in SW in August, and no models show trends of the opposite sign during the time period considered.

Monthly mean LW in Bergen shows a significant decrease during several months of the year: the largest is found in May, with −21W m−2per decade (Table 8). In August, October, and December, months when concurrent increases in shortwave radiation were found (Table 7), the observations show significant decreases in incident longwave radiation. None of the models show equally strong negative trends in monthly mean LW in their corresponding grid cells. MERRA2 and ERA-Interim show no significant trends during the time period, whereas PGMFDv2, WFDEI, and HySN exhibit one single month with a negative trend. Meanwhile, VFDv1 and VFDv2 show weakly positive trends in September. The increasing trend in incident longwave radiation in September in the VIC type estimates may be related to the concurrent increase in humidity. All the models also show an increasing trend in humidity in the grid cell covering Bergen in September; however, the models generally show weaker trends than observed at Bergen-Florida station (see Table 9).

Table 7Linear, decadal changes in monthly mean SW (W m−2) at Bergen between January 1985 and December 1999, significant at a 5 % or 10 % (denoted in italics) level.

Table 8Linear, decadal changes in monthly mean LW (W m−2) at Bergen between January 1985 and December 1999 significant at a 5 % or 10 % (denoted in italics) level.

Table 9Linear, decadal changes in monthly mean Td (C) for Bergen-Florida station and for the co-located grid cells of the model estimates between January 1985 and December 1999, significant at a 5 % or 10 % (italics) level.

7 Discussion

Historical estimates of humidity and incident shortwave and longwave radiation have been compared to station observations from mainland Norway from 1982 through 1999. A total of 84 stations provide vapor pressure (VP) observations, 9 stations provided SW observations, while only 2 stations provided LW observations. The estimates evaluated are from two reanalysis data sets, MERRA2 and ERA-Interim; three data sets composed of reanalysis data blended with gridded observational data, PGMFDv2, WFDEI, and HySN; and two versions of the VIC type forcing data, estimates based on gridded observational data combined with empirical algorithms.

Differences between the estimates and observations are not necessarily due to errors in the estimates, as a vertical adjustment to station altitude is not a sufficient reason to require that the model grid cell estimates should equate to the observed values. The numerical model estimates may still differ from the observations for valid reasons, such as differences in snow cover, differences in land cover type (the observations are from sensors usually located over grass or, in some cases, on top of buildings), and the averaging out of sub-grid variability in the models (see, e.g., Göber et al.2008). The uncertainty in the observations may also contribute to the differences. However, large differences may suggest biases in the estimates.

Since two LW stations located more than 400 km apart could not provide an observation-based vertical gradient in LW, ERA-Interim was consulted instead. The gradients were, on average, −4.5W m−2 per 100 m in winter and −1.8Wm−2 per 100 m in summer. found vertical gradients in incident longwave radiation of −2.8Wm−2 per 100 m in winter and −3.1Wm−2 per 100 m in summer for the Alps and of −4.1W m−2 per 100 m in winter and −2.6W m−2 per 100 m in summer, when considering a subset of observation stations in Switzerland. The different vertical gradients found may be explained by differences in temperature and humidity gradients, different climatological distributions of clouds, and the difference in initial temperature, as LW is a function of temperature to the power of 4. The regression-based vertical adjustment of ERA-Interim LW estimates resulted in a larger correction of LW than the clear-sky adjustment implemented in HySN, alluding to the fact that the clear-sky altitudinal adjustment implemented in similar data products might be too low, especially for locations with a maritime climate, like Bergen.

## 7.2 Differences to station observations

### 7.2.1 Humidity estimates

The empirically based model estimates, VFDv1 and VFDv2, show, on average, slightly lower estimates of humidity than observed. Both VFD type estimates are found to show a 50 % stronger decrease in humidity with continentality than the observations (see Sect. 6.1.3). The modified version of the Magnus type formula, based on , used in MTCLIM to generate the VFD humidity estimates is likely not appropriate for Norway. Previous studies, e.g., in the development of gridded climate variables by and in the application of the MTCLIM model over complex terrain in Australia and in the western US , found that the method did not result in overall improved humidity estimates. Indeed, in the method is found to give improved estimates of humidity in locations where the ratio of potential evaporation to annual precipitation is larger than 2.5. In most regions of Norway this ratio is well below unity. The more conventional method of using daily minimum temperature as a proxy for dew point temperature will likely give relatively small overestimations of humidity compared to the underestimations resulting from using the method.

The reanalysis-based estimates all overestimate humidity, and the overestimations are generally higher in weather types classified as dry according to the methodology of . MERRA2 and PGFMDv2 particularly overestimate humidity in dry conditions. The same two models also show a significantly stronger decrease in humidity with latitude than observed. MERRA2 also shows a weaker decrease in humidity with continentality. The weaker decrease in humidity with continentality seen in MERRA2 may perhaps be partly explained by the model's coarse resolution and land mask (see Fig. 1), and MERRA2's exaggerated latitudinal gradient in humidity in Norway may perhaps be associated with MERRA2's larger latitudinal gradient in SW.

The humidity estimates from HySN match the observations best, considering all metrics except from the anomaly correlation coefficient (ACC). The ACC of ERA-Interim estimates is marginally higher (0.02) than in HySN. This is likely due to the capping of relative humidity at 100 % when applying the SeNorge2 temperature in the development of HySN. Combining the methods outlined in , which for humidity relies on the assumption of constant relative humidity with altitude, a high-quality reanalysis data set (ERA-Interim), and a high-resolution, national, observation-based temperature data set is found to provide high-quality daily estimates of humidity in the current study region. The coarser, reanalysis-based data sets generally show higher ACCs than the VFD estimates. Numerical weather prediction (NWP) models are skilled at capturing synoptic events, i.e., weather or climatological patterns on a spatial order of 1000 km, and a temporal order of days or weeks, such as cold air outbreaks and the changing sources of air masses during the passage of warm and cold fronts. Though the NWPs may have systematic biases and a much lower spatial resolution than empirically based estimates, it is not surprising that they are useful in representing daily weather variability.

Shortwave incident radiation is, on average, overestimated for all model estimates except VFDv1. HySN, ERA-Interim, and WFDEI vary in obtaining the highest ranking depending on the metric considered. For instance, WFDEI shows a slightly lower average deviation from the observations than ERA-Interim and HySN. On the other hand, WFDEI shows larger underestimations in dry weather types than ERA-Interim and HySN (Fig. 7). Overall, the three models provide vertically adjusted estimates of incident SW close to the observations, with average deviations from station measurements below 4 W m−2 and ACCs above 0.76.

The average difference between the ERA-Interim estimates and the observations is smaller than in , where an average overestimation of 12 W m−2 was found when comparing ERA-Interim SW estimates to station measurements in Europe between 2010 and 2014. The smaller difference seen in the current study may in part be explained by the relatively small amount of solar radiation reaching Norway, the different time periods considered, and the vertical adjustment included in the current study. also found that MERRA2 shows poorer results than ERA-Interim, with average overestimations of 18 W m−2. This is consistent with our findings, where MERRA2 has the highest mean deviation from the observations of any of the considered estimates. Overestimations of incident shortwave radiation over land are not only an issue of reanalysis data sets covering Europe but have been a long-standing issue in global and regional climate models .

Two versions of VIC type forcing data are evaluated in the current study. The two versions differ in their input data and in the version of VIC preprocessor used. The oldest version of the VFD data sets is partly based on a 11 km national reanalysis (NORA10) to provide maximum and minimum temperature. The older version showed large underestimations of incident shortwave radiation at several stations, particularly near the coast in northern Norway (Fig. 4). These findings are in line with , where the MTCLIM algorithms were found to underestimate incident SW radiation by 26 %, on average, at coastal sites. The MTCLIM algorithms implemented in VFD rely in part on the diurnal temperature range to estimate cloud cover, using a low range as an indication of cloud cover. Near the coast, the diurnal temperature range may be low due to the moderating influence of the nearby ocean, due to its high heat capacity. The more recently compiled version of VFD data, VFDv2, which is based on a newly developed, high-resolution gridded data set of Tmin and Tmax, does not show similar underestimations near the coast of northern Norway. The different estimates produced indicate that great care must be taken to make sure the VIC style forcing data have consistent input data and algorithm versions if the data are used in, for example, climate change impact studies.

The newer and higher-resolution input data used in VFDv2 did not result in a lower mean absolute station deviation, as its SW estimates were consistently overestimated. Both VFD versions show a much stronger latitudinal gradient than observed and a too strong altitudinal gradient in summer. The latter finding is in line with , where VFD type estimates for the Colorado River basin showed increasing overestimations of SW with increasing altitude. The exaggerated latitudinal gradient in SW may be connected to the use of the diurnal temperature range in the algorithm. found that the relationship between cloud cover and the diurnal temperature range breaks down for ranges below 5 C. Further, states that the relationship between diurnal temperature range and cloud cover is weak at around 60 N in winter and, further, becomes positive in the Arctic.

Binning the estimates according to air mass type shows that the PGFv2 and VIC type estimates show less sensitivity to the prevailing weather type than the observations. The observations and the remaining model estimates show a decrease in clearness index of about 0.22 on days classified as moist or transitional weather types rather than dry, while the VIC type estimates and PGFv2 show reductions of 0.12 and 0.05, respectively. On average, the VIC type estimates and PGFv2 underestimate incident radiation in dry weather types (see Fig. 7). The similarity between the PGMFDv2 and VFD estimates of SW may be explained by the fact that the PGMFD SW is bias-corrected based on gridded cloud cover from CRU using the relationship between SW and cloud cover, which is also used in VFD. Further, the gridded CRU cloud cover data set is a secondary or derived observational data set, which is, similar to VFD, in part based on regression using diurnal temperature range as a predictor . The lower sensitivity to air mass type found in PGMFDv2 and the VIC type forcing data might contribute to the lower ACC found for these estimates.

The evaluation of incident longwave radiation is compromised by the lack of observational data. Only two sites observe incident longwave radiation in the considered time period. The difference between the annual mean of the model estimates and observations are for the two stations considered larger for incident longwave radiation than for incident shortwave radiation. The annual deviations ranges from −16 to +7W m−2. Underestimations of monthly means are found throughout the year for all models except WFDEI. The deviations from the station observations for WFDEI, MERRA2, ERA-Interim, and HySN are largely similar in all weather types; i.e., the underestimations are also found on days classified with dry weather types. An additional evaluation of the ERA-Interim LW estimates for Bergen, where cloud observations are available, showed that the deviations from station observations were similar on days where clouds were present in the observations, the model, and the remaining days.

While overestimation of incident shortwave radiation has been a long-standing issue in many climate models and reanalyses, incident longwave radiation is typically underestimated . The causes of the underestimation are, however, debated. points to an improper representation of interactions between radiation and suspended frozen water particles in the atmosphere (solid hydrometeors) as a culprit, while speculate that errors in simulated aerosols, water vapor content, and cloud properties (rather than cloud amounts) are the cause. Local issues such as longwave emissions from nearby terrain may also contribute to the deviations . Lastly, observational uncertainty confounds the picture further, particularly given that the two sensors were unshaded.

The anomaly correlation coefficients are, as also seen for humidity and incident shortwave radiation, considerably lower for PGFMDv2 and the VFD estimates than the estimates based on newer reanalysis data (0.46–0.60 vs. 0.79–0.82). This may be caused by the representation of clouds in the models. As discussed for the PGFMDv2 and the VFD SW estimates, the use of diurnal temperature range as a proxy for cloud cover may not be suitable for the current maritime, high-latitude study region.

## 7.3 Trends

The analysis of observed humidity trends from 1985 to 1999 showed significant increases in April in southern Norway, a decrease in May in central regions of Norway, significant increases in July in the northeastern part of Norway, and increases in all regions except the northeastern part of Norway in September. All the data sets, both the reanalysis-based estimates and the more empirically based VFD estimates, capture the increase in humidity seen in September. The significant increases found in humidity when averaging over the stations in southeastern and southwestern Norway in April are not seen in any of the models. The VFD estimates do, however, capture some of the increases in humidity that were seen in the measurements from Bergen-Florida (Table 9). A recent study by found that changes in large-scale weather patterns can, in part, explain the significant increases in 2 m temperature between 1981 and 2010 seen in Scandinavia in September but not most of the increases seen in April. Another inquiry by found that the increasing temperature trends seen in May in many parts of Norway showed a strong correlation with a concurrent decrease in snow cover. The decline in snow cover in May found in was particularly strong in low-lying areas. If the changes in temperature and humidity are connected to local changes in snow cover, it is possible that the coarser-scale reanalysis data, which often have a mean grid cell altitude above the measurement station elevation, do not capture the measured changes.

Surface incident radiation was inspected for trends from 1985 to 1999 at the one station where measurements are available in the time period with little missing data: Bergen. A hardly significant (p<0.1) annual trend in SW was found in the observations, 1.7 W m−2 per decade. However, in individual calendar months larger trends were found. The largest trend, 51 W m−2 per decade, was found in the observations in May. The observed August trend was reproduced fairly well in ERA-Interim, PGMFDv2, and HySN, and a weaker but still significant trend was seen in MERRA2 and VFDv2. While ERA-Interim largely reproduces the trend, WFDEI shows no significant trends. For the considered location, the post-processing of ERA-Interim radiances, based on CRU cloud cover and interannual aerosol loading conducted in the production of WFDEI, has a negative effect on its ability to reproduce the observed trend. The clear-sky type post-processing of ERA-Interim implemented in HySN estimates left the trend close to its original value. The two versions of VFD also differed in their ability to capture the SW trend. This might be due to the maritime location of Bergen and coarser VFDv1 input data for Tmin and Tmax. A previous study by showed that circulation type changes could account for a large part of the dimming that was observed in Bergen before around 1980 but a lesser fraction of the subsequent brightening. The fact that ERA-Interim, which does not explicitly account for interannual aerosol changes, picks up the trend while WFDEI, where a correction for interannual aerosol loading has been applied does not, implies that a considerable part of the trend before 2000 must be included in the indirect effects of aerosol changes, which ERA-Interim assimilates, or other factors. On the other hand, MERRA2 accounts for interannual aerosol loading in the time period considered and captures a positive, albeit weaker trend in SW.

The annual trends in LW during the same period in Bergen were larger in magnitude than those found for SW, −8.4W m−2 per decade. The observed trend in any calendar month was larger for SW, while the LW trend showed more consistency throughout the year. More of the models reproduced the SW trend than the LW trend. Both versions of the VIC type forcing data, VFDv1 and VFDv2, simulated a weak increasing trend in September. Given that the VFD estimates did not produce changes in SW in the same month, the increase is likely due to the clear-sky parametrization and the concurrent simulated increase in September Td. WFDEI and HySN reproduce the decrease in LW seen in August, while ERA-Interim does not. This points to the fact that changes in near-surface temperature, which are used as a scaling factor and to adjust near-surface humidity in WFDEI and HySN, capture the signal that contributes to the decrease in LW. A larger sample of stations measuring incident radiation with a high quality is needed to evaluate how well the models capture trends within the region, particularly given the uncertainty in the observational data.

8 Code and data availability

The HySN data product is available archived in Zenodo (https://doi.org/10.5281/zenodo.1970170, ). The code used to produce the HySN estimates is written in Python and is available at https://github.com/helene-b-e/HySN.git (last access: 10 June 2019). Further, the particular version of the software code used to produce the HySN estimates validated here is archived in Zenodo (https://doi.org/10.5281/zenodo.1435555, ). The remaining data sets are available from the various named data providers.

9 Conclusions

Hydrological, ecological, and crop modellers seek landscape-scale data. Norway has a long coastline with mountains, fjords, and small islands. Strong land–sea contrast, high mountains, and a seasonal snow cover that is highly dependent on continentality and altitude results in a fine-scale variability difficult for coarse-scale models to represent. A Python script to downscale and consolidate reanalysis data with high-resolution national gridded temperature data has been developed, which, leaning on previously well-tested empirical relationships, provides estimates of humidity and incident radiation on a fine-scale grid. The downscaled humidity ensures that relative humidity is constrained at 100 %, so that, for example, reasonable evaporation estimates can be sought. The new estimates, HySN, provide humidity estimates with the overall highest quality given for the metrics considered here, also surpassing those based on estimating humidity from temperature alone, such as for the VIC type forcing data. The new estimates outperform the VIC type forcing data and the MERRA2 estimates of incident radiation; however, it is not clear that the new estimates have an added value compared to ERA-Interim and WFDEI. The lack of high-quality historical observations, particularly of incident longwave radiation, hinders a proper evaluation of the data sets.

• Additionally, this study has shown that (a) altitude is a significant predictor of humidity, SW, and LW in the domain. The coarse-scale estimates of Td increased on average by 1 C, SW by 0.7–1.5 W m−2, and LW increased by as much as 8.6 W m−2, when adjusted to station altitude.

• Further, the results have shown that a high resolution does not necessarily indicate high-quality estimates. The added value of the high horizontal resolution of the more empirically based estimates does not outweigh the added value of relying on estimates from coarser-resolution numerical weather prediction reanalyses (b). Not only is a higher daily temporal correlation (ACC) seen in the estimates based on newer reanalysis data compared to the VIC type forcing data but also a lower mean absolute station bias is seen for several reanalysis-based products (ERA, WFDEI, HySN). VFDv1 and VFDv2 show a 60 % stronger decrease in humidity with distance from the coast than the observations, alluding to the fact that the modified version of the Magnus type formula based on , implemented in VFD to estimate humidity from daily minimum temperature, is not appropriate for the Norwegian domain. Both VFDv1 and VFDv2 also show a decrease that is several times stronger in solar radiation with latitude than the observations, likely a result of using diurnal temperature range as a proxy for cloud cover, an assumption likely not appropriate in coastal environments and at high latitudes.

To our knowledge reanalysis-based estimates have not been compared with VIC type forcing data for regions within Europe (or Norway specifically). The comparison of model estimates may assist impact modellers that have not yet selected data to use. Some of the findings might help explain persistent errors, for instance found in the timing of snowmelt in a hydrological model. The findings provide emphasis for climate researchers to not only downscale T2 and precipitation from climate projections, and later use these to estimate humidity and incident radiation, but to utilize the climate model estimates of near-surface humidity and incident radiation. This is already done, for example, in , where the impact of climate change on surface hydrology is examined based on, depending on the hydrological model's structure, bias-corrected climate model output of precipitation, temperature, humidity, and incident radiation from the ISI-MIP project . Similarly, we envisage that further work would involve applying the HySN as an input to a hydrological model. Such a model exercise would imply modulating the model's code to accommodate humidity and radiation as input variables. Once the model includes more physically based parameterizations, the sensitivity in simulated runoff to the choice of forcing data can be assessed, including the impact of errors or perturbations in each of the forcing data variables.

The source code for computing HySN has been made available and may easily be configured to use other reanalysis data or other national data sets as input. The compilation of HySN requires merely half a day on a modern desktop computer. Part of the code might also be implemented in a model preprocessor or in the calculation of various indices, so that the variables do not need to be stored for long time spans. Future work entails calculating indices, such as reference evaporation, and updating the input data to ERA5 and a new version of SeNorge once the full historical time series of the two are available. Additionally, sub-daily estimates, the inclusion of terrain features such as slope and aspect, and adding a correction based on the lack of coupling between the land surface and the atmosphere at times when the differences in the local snow cover and snow cover modeled by the reanalysis are large might be promising, as initial results showed that differences in ground snow conditions between the reanalysis and the observations were significant in predicting the difference between ERA-Interim estimates of SW and the observations.

Supplement
Supplement.

Author contributions
Author contributions.

All authors discussed the results and contributed to the manuscript. HBE derived the HySN data set, acquired and quality-controlled data from external sources, analyzed the results, and drafted the manuscript.

Competing interests
Competing interests.

The authors declare that they have no conflict of interest.

Acknowledgements
Acknowledgements.

We would like to thank Sigbjørn Grini for providing the scripts for quality control of the SW. We further thank all data providers: MET Norway, NIBIO, UiB (where radiation measurements were provided by Jan Asle Olseth), NASA, and ECMWF. We also thank Ingjerd Haddeland, Jan Magnusson, and Shaochun Huang for compiling the VIC forcing data while working at NVE. The PGMFDv2 was downloaded from the ISI-MIP node of the ESGF server (https://esg.pik-potsdam.de/projects/isimip/, last access: 10 June 2019). The study forms a contribution to LATICE, which is a strategic research area founded by the Faculty of Mathematics and Natural Sciences at the University of Oslo. Helene Birkelund Erlandsen was funded by NVE.

Financial support
Financial support.

This research has been supported by the Norwegian Water Resources and Energy Directorate (grant no. 81077).

Review statement
Review statement.

This paper was edited by Thomas Blunier and reviewed by Graham Weedon and Emma Robinson.

References

Abatzoglou, J. T.: Development of gridded surface meteorological data for ecological applications and modelling, Int. J. Climatol., 33, 121–131, https://doi.org/10.1002/joc.3413, 2013. a

Almeida, A. C. and Landsberg, J. J.: Evaluating methods of estimating global radiation and vapor pressure deficit using a dense network of automatic weather stations in coastal Brazil, Agr. Forest Meteorol., 118, 237–250, https://doi.org/10.1016/S0168-1923(03)00122-9, 2003. a

Bohn, T. J., Livneh, B., Oyler, J. W., Running, S. W., Nijssen, B., and Lettenmaier, D. P.: Global evaluation of MTCLIM and related algorithms for forcing of ecological and hydrological models, Agr. Forest Meteorol., 176, 38–49, https://doi.org/10.1016/j.agrformet.2013.03.003, 2013. a, b, c, d, e, f, g, h

Bosilovich, M. G., Akella, S., Coy, L., Cullather, R., Draper, C., Gelaro, R., Kovach, R., Liu, Q., Molod, A., Norris, P., Wargan, K., Chao, W., Reichle, R., Takacs, L., Vikhliaev, Y., Bloom, S., Collow, A., Firth, S., Labow, G., Partyka, G., Pawson, S., Reale, O., Schubert, S. D., and Suarez, M.: MERRA-2: Initial evaluation of the climate Technical Report Series on Global Modeling and Data Assimilation, Tech. rep., NASA/TM–2015-104606, 2015. a, b

Bosilovich, M. G., Robertson, F. R., Takacs, L., Molod, A., and Mocko, D.: Atmospheric water balance and variability in the MERRA-2 reanalysis, J. Climate, 30, 1177–1196, https://doi.org/10.1175/JCLI-D-16-0338.1, 2017. a

Bower, D., McGregor, G. R., Hannah, D. M., and Sheridan, S. C.: Development of a spatial synoptic classification scheme for western Europe, Int. J. Climatol., 27, 2017–2040, https://doi.org/10.1002/joc.1501, 2007. a, b, c, d, e

Bras, R. L.: Hydrology: an introduction to hydrologic science, Addison Wesley Publishing Company, 1990. a

Brinckmann, S., Krähenmann, S., and Bissolli, P.: High-resolution daily gridded data sets of air temperature and wind speed for Europe, Earth Syst. Sci. Data, 8, 491–516, https://doi.org/10.5194/essd-8-491-2016, 2016. a

Bristow, K. L. and Campbell, G. S.: On the relationship between incoming solar radiation and daily maximum and minimum temperature, Agr. Forest Meteorol., 31, 159–166, https://doi.org/10.1016/0168-1923(84)90017-0, 1984. a, b

Bromwich, D. H., Wilson, A. B., Bai, L. S., Moore, G. W., and Bauer, P.: A comparison of the regional Arctic System Reanalysis and the global ERA-Interim Reanalysis for the Arctic, Q. J. Roy. Meteor. Soc., 142, 644–658, https://doi.org/10.1002/qj.2527, 2016. a

Bugmann, H., Cordonnier, T., Truhetz, H., and Lexer, M. J.: Impacts of business-as-usual management on ecosystem services in European mountain ranges under climate change, Reg. Environ. Change, 17, 3–16, https://doi.org/10.1007/s10113-016-1074-4, 2017. a

Bureau of Reclamation: Downscaled CMIP3 an CMIP5 Climate and Hydrology Projections: Release of Downscaled CMIP5 Climate Projections, Comparison with preceding Information, and Summary of User Needs. Prepared by the U.S. Department of the Interior, Bureau of Reclamation, Technic, Tech. rep., available at: https://gdo-dcp.ucllnl.org/downscaled_cmip_projections/techmemo/BCSD5HydrologyMemo.pdf (last access: 10 June 2019), 2013. a

Carrer, D., Lafont, S., Roujean, J.-L., Calvet, J.-C., Meurey, C., Le Moigne, P., and Trigo, I. F.: Incoming Solar and Infrared Radiation Derived from METEOSAT: Impact on the Modeled Land Water and Energy Budget over France, J. Hydrometeorol., 13, 504–520, https://doi.org/10.1175/JHM-D-11-059.1, 2012. a

Cosgrove, B. A.: Real-time and retrospective forcing in the North American Land Data Assimilation System (NLDAS) project, J. Geophys. Res., 108, 8842, https://doi.org/10.1029/2002JD003118, 2003. a, b, c, d, e, f, g

Deardorff, J. W.: Efficient prediction of ground surface temperature and moisture, with inclusion of a layer of vegetation, J. Geophys. Res., 83, 1889, https://doi.org/10.1029/JC083iC04p01889, 1978. a

Decker, M., Brunke, M. A., Wang, Z., Sakaguchi, K., Zeng, X., and Bosilovich, M. G.: Evaluation of the reanalysis products from GSFC, NCEP, and ECMWF using flux tower observations, J. Climate, 25, 1916–1944, https://doi.org/10.1175/JCLI-D-11-00004.1, 2012. a

Dee, D. P., Uppala, S. M., Simmons, A. J., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M. A., Balsamo, G., Bauer, P., Bechtold, P., Beljaars, A. C., van de Berg, L., Bidlot, J., Bormann, N., Delsol, C., Dragani, R., Fuentes, M., Geer, A. J., Haimberger, L., Healy, S. B., Hersbach, H., Hólm, E. V., Isaksen, L., Kållberg, P., Köhler, M., Matricardi, M., Mcnally, A. P., Monge-Sanz, B. M., Morcrette, J. J., Park, B. K., Peubey, C., de Rosnay, P., Tavolato, C., Thépaut, J. N., and Vitart, F.: The ERA-Interim reanalysis: Configuration and performance of the data assimilation system, Q. J. Roy. Meteor. Soc., 137, 553–597, https://doi.org/10.1002/qj.828, 2011. a, b

de Oliveira, A. P., Soares, J., Božnar, M. Z., Mlakar, P., and Escobedo, J. F.: An application of neural network technique to correct the dome temperature effects on pyrgeometer measurements, J. Atmos. Ocean. Tech., 23, 80–89, https://doi.org/10.1175/JTECH1829.1, 2006. a

Erlandsen, Helene B.: HySN Data set (Version v1.1) [Data set], Zenodo, https://doi.org/10.5281/zenodo.1970170, 2018a. a

Erlandsen, Helene B.: helene-b-e/HySN: HySN v1.0 (Version v1.0), Zenodo, https://doi.org/10.5281/zenodo.1435555, 2018b. a

Erlandsen, H. B., Haddeland, I., Tallaksen, L. M., and Kristiansen, J.: The Sensitivity of the Terrestrial Surface Energy and Water Balance Estimates in the WRF Model to Lower Surface Boundary Representations: A South Norway Case Study, J. Hydrometeorol., 18, 265–284, https://doi.org/10.1175/JHM-D-15-0146.1, 2017. a

Feld, S. I., Cristea, N. C., and Lundquist, J. D.: Representing atmospheric moisture content along mountain slopes: Examination using distributed sensors in the Sierra Nevada, California, Water Resour. Res., 49, 4424–4441, https://doi.org/10.1002/wrcr.20318, 2013. a

Göber, M., Zsótér, E., and Richardson, D. S.: Could a perfect model ever satisfy a naïve forecaster? On grid box mean versus point verification, Meteorol. Appl., 15, 359–365, https://doi.org/10.1002/met.78, 2008. a

Godøy, Ø.: rtmrun, a Perl wrapper around libRadtran, available at: https://github.com/steingod/rtmrun (last access: 6 June 2019), 2013. a

Grini, S.: Quality Control of Global Solar Irradiation Measured at Four Stations in Eastern Norway, PhD thesis, Norwegian University of Life Science, Ås, 2015. a, b, c, d

Gutmann, E. D., Rasmussen, R. M., Liu, C., Ikeda, K., Gochis, D. J., Clark, M. P., Dudhia, J., and Thompson, G.: A comparison of statistical and dynamical downscaling of winter precipitation over complex terrain, J. Climate, 25, 262–281, https://doi.org/10.1175/2011JCLI4109.1, 2012. a

Haddeland, I., Lettenmaier, D. P., and Skaugen, T.: Reconciling Simulated Moisture Fluxes Resulting from Alternate Hydrologic Model Time Steps and Energy Budget Closure Assumptions, J. Hydrometeorol., 7, 355–370, https://doi.org/10.1175/JHM496.1, 2006. a

Haddeland, I., Heinke, J., Voß, F., Eisner, S., Chen, C., Hagemann, S., and Ludwig, F.: Effects of climate model radiation, humidity and wind estimates on hydrological simulations, Hydrol. Earth Syst. Sci., 16, 305–318, https://doi.org/10.5194/hess-16-305-2012, 2012. a

Hanssen-Bauer, I., Drange, H., Førland, E., Roald, L. A., Børsheim, K. Y., Hisdal, H., Lawrence, D., Nesje, A., Sandven, S., Sorteberg, A., Sndby, S., Vasskog, K., and Ådlandsvik, B.: Klima i Norge 2100 Bakgrunnsmateriale til NOU Klimatilpasning (Climate in Norway 2100 background material for NOU climate adaptation), p. 148, 2009. a

Hanssen-Bauer, I., Førland, E., Haddeland, I., Hisdal, H., Mayer, S., Nesje, A., Nilsen, J., Sandven, S., Sandø, A., Sorteberg, A., and Ådlandsvik, B.: Climate in Norway 2100 – a knowledge base for climate adaptation, Tech. rep., The Norwegian Centre for Climate Services (NCCS), available at: https://www.miljodirektoratet.no/globalassets/publikasjoner/m741/m741.pdf (last access: 10 June 2019), 2017. a

Harpold, A. A., Kaplan, M. L., Klos, P. Z., Link, T., McNamara, J. P., Rajagopal, S., Schumer, R., and Steele, C. M.: Rain or snow: hydrologic processes, observations, prediction, and research needs, Hydrol. Earth Syst. Sci., 21, 1–22, https://doi.org/10.5194/hess-21-1-2017, 2017. a

Heikkilä, U., Sandvik, A., and Sorteberg, A.: Dynamical downscaling of ERA-40 in complex terrain using the WRF regional climate model, Clim. Dynam., 37, 1551–1564, https://doi.org/10.1007/s00382-010-0928-6, 2011. a

Hempel, S., Frieler, K., Warszawski, L., Schewe, J., and Piontek, F.: A trend-preserving bias correction – the ISI-MIP approach, Earth Syst. Dynam., 4, 219–236, https://doi.org/10.5194/esd-4-219-2013, 2013. a

Hirsch, R. M., Slack, J. R., and Smith, R. A.: Techniques of trend analysis for monthly water quality data, Water Resour. Res., 18, 107–121, https://doi.org/10.1029/WR018i001p00107, 1982. a

Hofstra, N., New, M., and McSweeney, C.: The influence of interpolation and station network density on the distributions and trends of climate variables in gridded daily data, Clim. Dynam., 35, 841–858, https://doi.org/10.1007/s00382-009-0698-1, 2010. a

Jerez, S., Thais, F., Tobin, I., Wild, M., Colette, A., Yiou, P., and Vautard, R.: The CLIMIX model: A tool to create and evaluate spatially-resolved scenarios of photovoltaic and wind power development, Renew. Sust. Energ. Rev., 42, 1–15, https://doi.org/10.1016/j.rser.2014.09.041, 2015. a

Kalkstein, L. S., Nichols, M. C., David Barthel, C., and Scott Greene, J.: A new spatial synoptic classification: Application to air-mass analysis, Int. J. Climatol., 16, 983–1004, https://doi.org/10.1002/(SICI)1097-0088(199609)16:9<983::AID-JOC61>3.0.CO;2-N, 1996. a

Katragkou, E., García-Díez, M., Vautard, R., Sobolowski, S., Zanis, P., Alexandri, G., Cardoso, R. M., Colette, A., Fernandez, J., Gobiet, A., Goergen, K., Karacostas, T., Knist, S., Mayer, S., Soares, P. M. M., Pytharoulis, I., Tegoulias, I., Tsikerdekis, A., and Jacob, D.: Regional climate hindcast simulations within EURO-CORDEX: evaluation of a WRF multi-physics ensemble, Geosci. Model Dev., 8, 603–618, https://doi.org/10.5194/gmd-8-603-2015, 2015. a, b

Kimball, J. S., Running, S. W., and Nemani, R.: An improved method for estimating surface humidity from daily minimum temperature, Agr. Forest Meteorol., 85, 87–98, https://doi.org/10.1016/S0168-1923(96)02366-0, 1997. a, b, c, d, e, f

Koster, R. D., Suarez, M. J., Liu, P., Jambor, U., Berg, A., Kistler, M., Reichle, R., Rodell, M., and Famiglietti, J.: Realistic Initialization of Land Surface States: Impacts on Subseasonal Forecast Skill, J. Hydrometeorol., 5, 1049–1063, https://doi.org/10.1175/JHM-387.1, 2004. a

Kotlarski, S., Paul, F., and Jacob, D.: Forcing a distributed glacier mass balance model with the regional climate model REMO. Part I: climate model evaluation, J. Climate, 23, 1589–1606, https://doi.org/10.1175/2009JCLI2711.1, 2010. a

Kristiansen, J., Bjørge, D., Edwards, J. M., and Rooney, G. G.: Soil Field Model Interoperability: Challenges and Impact on Screen Temperature Forecast Skill during the Nordic Winter, J. Hydrometeorol., 13, 1215–1232, https://doi.org/10.1175/JHM-D-11-095.1, 2012. a

Lapo, K. E., Hinkelman, L. M., Sumargo, E., Hughes, M., and Lundquist, J. D.: A critical evaluation of modeled solar irradiance over California for hydrologic and land surface modeling, J. Geophys. Res., 122, 299–317, https://doi.org/10.1002/2016JD025527, 2017. a

Li, J. L., Lee, W. L., Yu, J. Y., Hulley, G., Fetzer, E., Chen, Y. C., and Wang, Y. H.: The impacts of precipitating hydrometeors radiative effects on land surface temperature in contemporary GCMS using satellite observations, J. Geophys. Res., 121, 67–79, https://doi.org/10.1002/2015JD023776, 2016. a, b

Liang, X., Lettenmaier, D. P., Wood, E. F., and Burges, S. J.: A simple hydrologically based model of land surface water and energy fluxes for general circulation models, J. Geophys. Res., 99, 14415, https://doi.org/10.1029/94JD00483, 1994. a

Livneh, B., Rosenberg, E. A., Lin, C., Nijssen, B., Mishra, V., Andreadis, K. M., Maurer, E. P., and Lettenmaier, D. P.: A long-term hydrologically based dataset of land surface fluxes and states for the conterminous United States: Update and extensions, J. Climate, 26, 9384–9392, https://doi.org/10.1175/JCLI-D-12-00508.1, 2013. a

Lofgren, B. M., Hunter, T. S., and Wilbarger, J.: Effects of using air temperature as a proxy for potential evapotranspiration in climate change scenarios of Great Lakes basin hydrology, J. Great Lakes Res., 37, 744–752, https://doi.org/10.1016/j.jglr.2011.09.006, 2011. a

Lussana, C., Saloranta, T., Skaugen, T., Magnusson, J., Tveito, O. E., and Andersen, J.: seNorge2 daily precipitation, an observational gridded dataset over Norway from 1957 to the present day, Earth Syst. Sci. Data, 10, 235–249, https://doi.org/10.5194/essd-10-235-2018, 2018a. a, b

Lussana, C., Tveito, O. E., and Uboldi, F.: Three-dimensional spatial interpolation of 2 m temperature over Norway, Q. J. Roy. Meteor. Soc., 144, 344–364, https://doi.org/10.1002/qj.3208, 2018b. a, b, c, d

Marty, C.: Surface radiation, cloud forcing and greenhouse effect in the Alps, PhD thesis, ETH Zurich, 2000. a, b, c, d, e

Mayer, B. and Kylling, A.: Technical note: The libRadtran software package for radiative transfer calculations – description and examples of use, Atmos. Chem. Phys., 5, 1855–1877, https://doi.org/10.5194/acp-5-1855-2005, 2005. a

Meloni, D., Di Biagio, C., Di Sarra, A., Monteleone, F., Pace, G., and Sferlazzo, D. M.: Accounting for the solar radiation influence on downward longwave irradiance measurements by pyrgeometers, J. Atmos. Ocean. Tech., 29, 1629–1643, https://doi.org/10.1175/JTECH-D-11-00216.1, 2012. a

MERRA-2 const_2d_ctm_Nx: Constant Model Parameters for Usage by CTM 0.5×0.625 degree V5.12.4, https://doi.org/10.5067/4Z3YUPM81GRJ, 2015a. a

MERRA-2 tavg1_2d_rad_Nx: 2d, 1-Hourly, Time-Averaged, Single-Level, Assimilation, Radiation Diagnostics V5.12.4, https://doi.org/10.5067/Q9QMY5PBNV1T, 2015b. a

MERRA-2 inst1_2d_asm_Nx: 2d, 1-Hourly, Instantaneous, Single-Level, Assimilation, Single-Level Diagnostics V5.12.4, https://doi.org/10.5067/3Z173KIE2TPD, 2015c. a

Milly, P. C. D. and Dunne, K. A.: On the hydrologic adjustment of climate-model projections: The potential pitfall of potential evapotranspiration, Earth Interact., 15, 1–14, https://doi.org/10.1175/2010EI363.1, 2011. a

Mizukami, N., Clark, M., Slater, A., Brekke, L., Elsner, M., Arnold, J., and Gangopadhyay, S.: Hydrologic Implications of Different Large-Scale Meteorological Model Forcing Datasets in Mountainous Regions, J. Hydrometeorol., 15, 474–488, https://doi.org/10.1175/JHM-D-13-036.1, 2014. a, b, c, d

Mizukami, N., Clark, M. P., Gutmann, E. D., Mendoza, P. A., Newman, A. J., Nijssen, B., Livneh, B., Hay, L. E., Arnold, J. R., and Brekke, L. D.: Implications of the Methodological Choices for Hydrologic Portrayals of Climate Change over the Contiguous United States: Statistically Downscaled Forcing Data and Hydrologic Models, J. Hydrometeorol., 17, 73–98, https://doi.org/10.1175/JHM-D-14-0187.1, 2016. a

Mohr, M.: New Routines for Gridding of Temperature and Precipitation Observations for “seNorge.no”, Met. no Report, 8, available at: ftp://ftp.met.no/projects/klimagrid/doc/NewRoutinesforGriddingofTemperature.pdf (last access: 10 June 2019), 2008. a, b

Mueller, B., Hirschi, M., Jimenez, C., Ciais, P., Dirmeyer, P. A., Dolman, A. J., Fisher, J. B., Jung, M., Ludwig, F., Maignan, F., Miralles, D. G., McCabe, M. F., Reichstein, M., Sheffield, J., Wang, K., Wood, E. F., Zhang, Y., and Seneviratne, S. I.: Benchmark products for land evapotranspiration: LandFlux-EVAL multi-data set synthesis, Hydrol. Earth Syst. Sci., 17, 3707–3720, https://doi.org/10.5194/hess-17-3707-2013, 2013. a

New, M., Hulme, M., and Jones, P.: Representing twentieth-century space-time climate variability. Part I: Development of a 1961–90 mean monthly terrestrial climatology, J. Climate, 12, 829–856, https://doi.org/10.1175/1520-0442(1999)012<0829:RTCSTC>2.0.CO;2, 1999. a, b, c

Nilsen, I. B., Stagge, J. H., and Tallaksen, L. M.: A probabilistic approach for attributing temperature changes to synoptic type frequency, Int. J. Climatol., 37, 2990–3002, https://doi.org/10.1002/joc.4894, 2017. a

Parding, K., Olseth, J. A., Liepert, B. G., and Dagestad, K.-F.: Influence of atmospheric circulation patterns on local cloud and solar variability in Bergen, Norway, Theor. Appl. Climatol., 125, 625–639, https://doi.org/10.1007/s00704-015-1517-8, 2016. a, b

Pierce, D. W., Westerling, A. L., and Oyler, J.: Future humidity trends over the western United States in the CMIP5 global climate models and variable infiltration capacity hydrological modeling system, Hydrol. Earth Syst. Sci., 17, 1833–1850, https://doi.org/10.5194/hess-17-1833-2013, 2013. a, b, c, d, e

Pohlert, T.: trend: Non-Parametric Trend Tests and Change-Point Detection, available at: https://CRAN.R-project.org/package=trend (last access: 10 June 2019), r package version 1.1.0, 2018. a

Prata, A. J.: A new long-wave formula for estimating downward clear-sky radiation at the surface, Q. J. Roy. Meteor. Soc., 122, 1127–1151, https://doi.org/10.1002/qj.49712253306, 1996. a

Raleigh, M. S., Livneh, B., Lapo, K., Lundquist, J. D., Raleigh, M. S., Livneh, B., Lapo, K., and Lundquist, J. D.: How Does Availability of Meteorological Forcing Data Impact Physically Based Snowpack Simulations?, J. Hydrometeorol., 17, 99–120, https://doi.org/10.1175/JHM-D-14-0235.1, 2016. a

Reistad, M., Breivik, b., Haakenstad, H., Aarnes, O. J., Furevik, B. R., and Bidlot, J.-R.: A high-resolution hindcast of wind and waves for the North Sea, the Norwegian Sea, and the Barents Sea, J. Geophys. Res., 116, C05019, https://doi.org/10.1029/2010JC006402, 2011. a

Rizzi, J., Nilsen, I. B., Stagge, J. H., Gisnås, K., and Tallaksen, L. M.: Five decades of warming: impacts on snow cover in Norway, Hydrol. Res., 49, nh2017051, https://doi.org/10.2166/nh.2017.051, 2017. a, b

Rodell, M., Houser, P. R., Berg, A. A., and Famiglietti, J. S.: Evaluation of 10 Methods for Initializing a Land Surface Model, J. Hydrometeorol., 6, 146–155, https://doi.org/10.1175/JHM414.1, 2005. a

Rontu, L., Wastl, C., and Niemelä, S.: Influence of the Details of Topography on Weather Forecast – Evaluation of HARMONIE Experiments in the Sochi Olympics Domain over the Caucasian Mountains, Front. Earth Sci., 4, 13 pp., https://doi.org/10.3389/feart.2016.00013, 2016. a

Rydsaa, J. H., Stordal, F., Bryn, A., and Tallaksen, L. M.: Effects of shrub and tree cover increase on the near-surface atmosphere in northern Fennoscandia, Biogeosciences, 14, 4209–4227, https://doi.org/10.5194/bg-14-4209-2017, 2017. a

Schmied, H. M., Müller, R., Sanchez-Lorenzo, A., Ahrens, B., and Wild, M.: Evaluation of radiation components in a global freshwater model with station-based observations, Water, 8, 450, https://doi.org/10.3390/w8100450, 2016. a, b

Sheffield, J., Goteti, G., and Wood, E. F.: Development of a 50-Year High-Resolution Global Dataset of Meteorological Forcings for Land Surface Modeling, J. Climate, 19, 3088–3111, https://doi.org/10.1175/JCLI3790.1, 2006. a, b

Sheridan, S. C.: The redevelopment of a weather-type classification scheme for North America, Int. J. Climatol., 22, 51–68, https://doi.org/10.1002/joc.709, 2002. a

Shi, X., Wild, M., and Lettenmaier, D. P.: Surface radiative fluxes over the pan-Arctic land region: Variability and trends, J. Geophys. Res., 115, D22104, https://doi.org/10.1029/2010JD014402, 2010. a, b

Slater, A. G.: Surface Solar Radiation in North America: A Comparison of Observations, Reanalyses, Satellite, and Derived Products, J. Hydrometeorol., 17, 401–420, https://doi.org/10.1175/JHM-D-15-0087.1, 2016. a, b

Stagge, J. H., Tallaksen, L. M., Xu, C., and Van Lanen, H.: Standardized precipitation-evapotranspiration index (SPEI): Sensitivity to potential evapotranspiration model and parameters, Proceedings of FRIEND-water, 1, 367–373, 2014. a

Szczypta, C., Calvet, J.-C., Albergel, C., Balsamo, G., Boussetta, S., Carrer, D., Lafont, S., and Meurey, C.: Verification of the new ECMWF ERA-Interim reanalysis over France, Hydrol. Earth Syst. Sci., 15, 647–666, https://doi.org/10.5194/hess-15-647-2011, 2011. a

Teklesadik, A. D., Alemayehu, T., van Griensven, A., Kumar, R., Liersch, S., Eisner, S., Tecklenburg, J., Ewunte, S., and Wang, X.: Inter-model comparison of hydrological impacts of climate change on the Upper Blue Nile basin using ensemble of hydrological models and global climate models, Climatic Change, 141, 517–532, https://doi.org/10.1007/s10584-017-1913-4, 2017. a

Thornton, P. E. and Running, S. W.: An improved algorithm for estimating incident daily solar radiation from measurements of temperature, humidity, and precipitation, Agr. Forest Meteorol., 93, 211–228, https://doi.org/10.1016/S0168-1923(98)00126-9, 1999. a, b, c, d, e, f

Thornton, P. E., Hasenauer, H., and White, M. A.: Simultaneous estimation of daily solar radiation and humidity from observed temperature and precipitation: an application over complex terrain in Austria, Agr. Forest Meteorol., 104, 255–271, https://doi.org/10.1016/S0168-1923(00)00170-2, 2000. a

Tveito, O. E. and Førland, E. J.: Mapping temperatures in Norway applying terrain information, geostatistics and GIS, Norsk Geogr. Tidsskr., 53, 202–212, https://doi.org/10.1080/002919599420794, 1999. a, b

Urraca, R., Huld, T., Gracia-Amillo, A., Martinez-de Pison, F. J., Kaspar, F., and Sanz-Garcia, A.: Evaluation of global horizontal irradiance estimates from ERA5 and COSMO-REA6 reanalyses using ground and satellite-based data, Sol. Energy, 164, 339–354, https://doi.org/10.1016/J.SOLENER.2018.02.059, 2018. a, b

Vormoor, K. and Skaugen, T.: Temporal Disaggregation of Daily Temperature and Precipitation Grid Data for Norway, J. Hydrometeorol., 14, 989–999, https://doi.org/10.1175/JHM-D-12-0139.1, 2013. a

Warszawski, L., Frieler, K., Huber, V., Piontek, F., Serdeczny, O., and Schewe, J.: The Inter-Sectoral Impact Model Intercomparison Project (ISI–MIP): Project framework, P. Natl. Acad. Sci. USA, 111, 3228–3232, https://doi.org/10.1073/pnas.1312330110, 2014. a

Weedon, G. P., Gomes, S., Viterbo, P., Shuttleworth, W. J., Blyth, E., Österle, H., Adam, J. C., Bellouin, N., Boucher, O., and Best, M.: Creation of the WATCH Forcing Data and Its Use to Assess Global and Regional Reference Crop Evaporation over Land during the Twentieth Century, J. Hydrometeorol., 12, 823–848, https://doi.org/10.1175/2011JHM1369.1, 2011. a, b

Weedon, G. P., Balsamo, G., Bellouin, N., Gomes, S., Best, M. J., and Viterbo, P.: The WFDEI meteorological forcing data set: WATCH Forcing Data methodology applied to ERA-Interim reanalysis data, Water Resour. Res., 50, 7505–7514, https://doi.org/10.1002/2014WR015638, 2014. a, b, c

Wessel, P. and Smith, W. H. F.: A global, self-consistent, hierarchical, high-resolution shoreline database, J. Geophys. Re.-Sol. Ea., 101, 8741–8743, https://doi.org/10.1029/96JB00104, 1996. a

Wild, M., Gilgen, H., Roesch, A., Ohmura, A., Long, C. N., Dutton, E. G., Forgan, B., Kallis, A., Russak, V., and Tsvetkov, A.: From dimming to brightening: decadal changes in solar radiation at Earth's surface, Science, 308, 847–850, https://doi.org/10.1126/science.1103215, 2005. a

Wild, M., Folini, D., Hakuba, M. Z., Schär, C., Seneviratne, S. I., Kato, S., Rutan, D., Ammann, C., Wood, E. F., and König-Langlo, G.: The energy balance over land and oceans: an assessment based on direct observations and CMIP5 climate models, Clim. Dynam., 44, 3393–3429, https://doi.org/10.1007/s00382-014-2430-z, 2015.  a

Wild, M., Ohmura, A., Schär, C., Müller, G., Folini, D., Schwarz, M., Hakuba, M. Z., and Sanchez-Lorenzo, A.: The Global Energy Balance Archive (GEBA) version 2017: a database for worldwide measured surface energy fluxes, Earth Syst. Sci. Data, 9, 601–613, https://doi.org/10.5194/essd-9-601-2017, 2017. a

Zib, B. J., Dong, X., Xi, B., Kennedy, A., Zib, B. J., Dong, X., Xi, B., and Kennedy, A.: Evaluation and Intercomparison of Cloud Fraction and Radiative Fluxes in Recent Reanalyses over the Arctic Using BSRN Surface Observations, J. Climate, 25, 2291–2305, https://doi.org/10.1175/JCLI-D-11-00147.1, 2012. a, b