Introduction

Climate of the Past

Clim. Past

1814-9332

Copernicus Publications

Göttingen, Germany

10.5194/cp-12-2255-2016

Assessing performance and seasonal bias of pollen-based climate reconstructions in a perfect model world

Rehfeld

Kira

krehfeld@awi.de

https://orcid.org/0000-0002-9442-5362

Trachsel

Mathias

mtrachs@umd.edu Telford

Richard J.

https://orcid.org/0000-0001-9826-3076

Laepple

Thomas

https://orcid.org/0000-0001-8108-7520

1Alfred Wegener Institute, Helmholtz Centre for Polar and Marine Research, 14473 Potsdam, Germany 2Department of Biology, University of Bergen, Postboks 7803, 5020 Bergen, Norway 3Bjerknes Center for Climate Research, Allégaten 55, 5007 Bergen, Norway apresent address: British Antarctic Survey, Cambridge, UK bpresent address: Department of Geology, University of Maryland, College Park, MD, USA

Kira Rehfeld (krehfeld@awi.de) and Mathias Trachsel (mtrachs@umd.edu)

21December2016

12 12 22552270 28January2016 18February2016 26November2016 5December2016

This work is licensed under a Creative Commons Attribution 3.0 Unported License. To view a copy of this license, visit http://creativecommons.org/licenses/by/3.0/

This article is available from https://cp.copernicus.org/articles/12/2255/2016/cp-12-2255-2016.html

The full text article is available as a PDF file from https://cp.copernicus.org/articles/12/2255/2016/cp-12-2255-2016.pdf

Reconstructions of summer, winter or annual mean temperatures based on the species composition of bio-indicators such as pollen, foraminifera or chironomids are routinely used in climate model–proxy data comparison studies. Most reconstruction algorithms exploit the joint distribution of modern spatial climate and species distribution for the development of the reconstructions. They rely on the space-for-time substitution and the specific assumption that environmental variables other than those reconstructed are not important or that their relationship with the reconstructed variable(s) should be the same in the past as in the modern spatial calibration dataset. Here we test the implications of this “correlative uniformitarianism” assumption on climate reconstructions in an ideal model world, in which climate and vegetation are known at all times. The alternate reality is a climate simulation of the last 6000 years with dynamic vegetation. Transient changes of plant functional types are considered as surrogate pollen counts and allow us to establish, apply and evaluate transfer functions in the modeled world. We find that in our model experiments the transfer function cross validation r2 is of limited use to identify reconstructible climate variables, as it only relies on the modern spatial climate–vegetation relationship. However, ordination approaches that assess the amount of fossil vegetation variance explained by the reconstructions are promising. We furthermore show that correlations between climate variables in the modern climate–vegetation relationship are systematically extended into the reconstructions. Summer temperatures, the most prominent driving variable for modeled vegetation change in the Northern Hemisphere, are accurately reconstructed. However, the amplitude of the model winter and mean annual temperature cooling between the mid-Holocene and present day is overestimated and similar to the summer trend in magnitude. This effect occurs because temporal changes of a dominant climate variable, such as summer temperatures in the model's Arctic, are imprinted on a less important variable, leading to reconstructions biased towards the dominant variable's trends. Our results, although based on a model vegetation that is inevitably simpler than reality, indicate that reconstructions of multiple climate variables based on modern spatial bio-indicator datasets should be treated with caution. Expert knowledge on the ecophysiological drivers of the proxies, as well as statistical methods that go beyond the cross validation on modern calibration datasets, are crucial to avoid misinterpretation.

Introduction

Continental-scale climate reconstructions are frequently used as a paleodata target to evaluate and benchmark climate models e.g.,. Currently, climate models and proxy data disagree on the annual mean temperature changes over the course of the Holocene . It was argued that seasonal biases in proxy-based climate reconstructions might be the root of the observed proxy–model divergence .

To arrive at quantitative assessments of past climate changes from pollen assemblages, transfer function algorithms are used to establish a link between modern climate and vegetation composition across space. The derived relationships are then applied to fossil pollen percentages, counted in sediment archives. A basic assumption underlying these transfer functions is methodological uniformitarianism , namely that modern spatial relationships between species, vegetation and environmental conditions can be applied to past conditions e.g.,.

One specific requirement is that environmental variables other than those considered in the calibration are not important or that their relationship with the reconstructed variable(s) was the same in the past as it is in the modern spatial calibration dataset . Biological proxies generally respond to a multitude of environmental variables and thus the first part of the assumption is rarely met . Therefore constancy and equivalence of the covariance of relevant parameters in space and time have to be assumed to allow the substitution of spatial gradients (in a modern calibration) for temporal changes (in the past; ). This assumption, which we name “correlative uniformitarianism”, is certainly violated in the real world. For example in the modern climate summer and winter, temperatures are highly correlated across space. In contrast, the major driving forces behind the Holocene temperature evolution, local summer and winter insolation have been anticorrelated over the past 10 000 years due to precessional forcing e.g.,.

The validity of assuming correlative uniformitarianism, specifically the effect of confounding variables on reconstructions from bio-indicators, was investigated using simulated artificial data and it was shown that this can lead to misleading reconstructions and an underestimation of the prediction error . However, without knowing the past climate evolution, it is difficult to estimate the potential implications for reconstructing the Holocene climate evolution.

Here, we use a Holocene climate model simulation with interactive vegetation as a test bed for pollen transfer function methods. In the model world, the modern spatial climate and its relationship to vegetation are known, along with the Holocene climate and vegetation evolution. Our general approach bears some similarities to previous “pseudoproxy” experiments, in which climate model simulations were used to test calibrations for temperature reconstructions of the last millennia . However, as these studies target proxy records for climate which are calibrated temporally against meteorological data (such as tree ring parameters), they largely focus on the effect of proxy noise on the reconstruction. We ignore these proxy imperfections and age uncertainty and focus on the implications of correlative uniformitarianism, which is one operational assumption behind the use of spatial calibrations to reconstruct temporal changes.

Key questions are as follows: (i) To what extent does the correlative uniformitarianism, and aspects of the estimation processes, bias reconstructions of the Holocene temperature evolution? (ii) Are there statistical indicators that can inform us about the actual reconstructability of climate variables?

To address these questions within the model world, we need to assume that model climate and vegetation changes are consistent with each other and that modeled plant functional type (PFT) and land cover type changes (desert fraction) can be used as surrogates for pollen counts in sedimentary archives.

Methods Climate model simulations

Temperature (a) and precipitation changes (b), vegetation turnover (c) and vegetation diversity as measured by the Hill's number N2 of PFTs (d) between 6k and 0k BP in the ECHAM5/MPIOM model simulation .

We use a 6000-year-long transient simulation of the coupled atmosphere–ocean climate model ECHAM5/MPIOM with a dynamic land surface and vegetation scheme provided by the JSBACH module to investigate pollen-based climate reconstruction techniques. This simulation is described in (hereafter 6k run) and is only forced by orbital changes over the last 6000 years. Environmental and atmospheric variables are available on a regular 3.75∘×3.75∘ latitude/longitude grid. The vegetation module is described in and . The modeled climate–vegetation interaction through the growth, competition and mortality of the four tree, two shrub and two grass PFTs is nontrivial: within each grid cell, plants compete for fractional cover, given their own net primary productivity, natural mortality as well as disturbance-driven mortality in response to climate (fire, heat and cold extremes, growing season length). Given a latitude, soil texture, CO2 concentration, temperature and precipitation, processes changing water balance, photosynthesis, leaf cover and respiration are simulated on a daily or monthly time step. The turnover of wood, leaves and roots, decomposition, mortality and establishment is calculated annually, and the resulting vegetation cover is fed into the next year. Table S1 in the Supplement lists the PFTs and their bioclimatic temperature limits. The Holocene climate and vegetation evolution of this model simulation have been extensively used and characterized in paleoclimate model–data comparisons . While vegetation biases have been observed against present-day conditions in some areas , the overall patterns are consistent . Climate and vegetation changes from mid-Holocene to present day are substantial (Fig. ) and differ between the seasons (Fig. , top row). We note that although the resolution of the climate model, and thus the model world calibration dataset, is coarse, its spatial and seasonal range is comparable to that of real-world calibration datasets (Fig. S1 in the Supplement).

Reconstruction methods

Quantitative climate reconstruction based on a multivariate pollen count dataset requires algorithms that translate past vegetation changes into estimates of past climate changes. Most approaches use three datasets: a paired calibration set and one downcore pollen record. The calibration set combines modern pollen and climate data from recent, or modern, conditions taken from surface samples across ecological and climatic gradients. An example from the real world would be pollen counts from lake sediment surfaces across Europe, paired with data from meteorological stations near these lakes. Several approaches for quantitative reconstructions based on ecological species counts have been established see, e.g.,for a review. Here we focus on two popular techniques: best modern analog methods (here BMA, often also called modern analogue technique) and the multivariate calibration method of weighted averaging (WA).

BMA methods directly match the species composition of fossil assemblages against the modern calibration set . To obtain a reconstruction value for a fossil sample, N analog modern samples with the lowest ecological distance (most commonly estimated using the squared-chord distance; ) are selected. Their modern reference climate variables are averaged to obtain the past climate estimate. These approaches are expected to work well on samples with a low number of taxa. In this study we use BMA with N=5 and the squared-chord distance. Multivariate calibrations, however, are based on the regression of modern vegetation onto estimates of a climate variable at many calibration sites to establish one global parametric function between them. In WA calibration, climate optima for different taxa are derived by computing a weighted average of climate variable estimates at all sites at which a taxon is present. Weights are derived from the relative abundance of the taxon. The step from past vegetation composition to estimates of past climate then relies on a second weighting step, in which the climate optima of all taxa present in the fossil sample are averaged, again weighted by their relative abundance. We employ WA here to illustrate results that are common to reconstructions based on BMA and WA-related methods, which may therefore depend on properties of the dataset, or the general approach of reconstructing climate based on modern spatial climate calibrations. In this study we use WA with square-root transformed scores and inverse deshrinking.

Estimates of reconstruction uncertainty

In a real-world situation, the true climate evolution is unknown and a root mean square error of prediction (RMSEP) is estimated in the modern calibration set. In the following we use k fold cross validation with k=10 (1/k-th of the samples are used for verification) but note that, even using leave-group-out cross validation, the RMSEP may be biased low due to autocorrelation in the modern data . As we know the true climate in the model world, we can additionally obtain the root mean square error of the reconstruction (RMSE) by comparing the reconstructed climate variable to its simulated counterpart. We employ multivariate constrained ordination methods to test which climate variables explain vegetation variance. While redundancy analysis (RDA) extends principal component analysis, canonical correspondence analysis (CCA) is the equivalent method for frequency data and allows a unimodal relationship between the species and the environment . We evaluate the similarity between trend and correlation fields using a sign test, similar to Kendall's rank correlation, defined as a fraction ν(X,Y)=S(X,Y)#reconstr. grid cells varying between -1 and +1. A grid cell counts into the sign sum S(X,Y) as +1 if the signs in field X and field Y are the same and as -1 if they are opposite. Summation goes over all grid cells where a reconstruction was performed. This sign test yields ν=1 if and only if all grid cells in field X and Y have the same sign and ν=-1 if all signs are opposing. ν=0 suggests that there are as many grid cells with opposing signs as there are with the same signs, indicating that there is no underlying similarity between the fields.

Calibration and reconstruction workflow

We perform PFT-based calibrations and climate reconstructions at each grid point on land which displays enough diversity and temporal variations in the simulated vegetation. Therefore, we select all points for the reconstruction tests with an effective number of taxa N2 larger than 2

The Hill's number N2 is defined as N2=∑i=1Npi2-1, as the reciprocal of the weighted mean of the abundances p. If all taxa are equally abundant and pi=1/N, N2 is equal to N. If only one taxon is present, and all others are zero, N2=1.

and vegetation turnover larger than 0.5. Turnover is estimated from the length of the first detrended correspondence analysis axis in standard deviation units . The simulated vegetation history through time at a grid point forms the fossil vegetation dataset. The simulated modern surrounding vegetation and climate fields, averaged over the last 30 years, yield the matrices containing modern pollen and climate information for the modern training set. We select all surrounding land points in a radius of 2500 km and subsample them such that the calibration set size is roughly equal for all sites and not latitude dependent. Pollen matrix columns contain the percentages of the nine PFTs (acronyms at the end of the paper, details in Table S1), including the desert fraction as a virtual PFT. Each column in the modern climate matrix corresponds to a climate variable and we choose the warmest month (MTWA), coldest month (MTCO), annual mean temperatures (MAT) and precipitation (MPWA, MPCO, MAP) variables.

We note that large-scale PFT-based pollen reconstructions use roughly 2–3 times the number of PFTs as, e.g., in, and raw pollen spectra contain often more than 10 times the number of taxa. However, the effective number of PFTs in the fossil record, as estimated by Hill's N2, is much lower than the number of taxa itself, and rare taxa do not have a large influence on reconstructions using BMA or WA. Our cutoff at N2=2 is well within the range of N2 for modern pollen spectra (Fig. S2), although the N2 is lower for PFTs than for taxa by construction. In general, a low number of PFTs or taxa may lead to a problem of multiple analogs, where a pollen assemblage is similar to several modern assemblages that are very different in their climatic setting . However, supporting our cutoff choice at N2=2, we do not find indications that this is a problem here. The overall high transfer function r2 (Fig. ) shows that analogs are not picked at random from the training set. To pinpoint this further, we calculate the ratio of the standard deviations of the temperatures at the analog sites and the standard deviation of the temperatures across the whole training set (Fig. S3). The ratios are generally smaller than 0.5, thus illustrating that the analog sites are not randomly drawn from the training set.

In many conventional paleoecological studies, one or two climate variables would be selected for reconstruction, which are expected to have influenced vegetation development significantly and independently . As we want to investigate, which variables can be skillfully reconstructed, we perform joint reconstructions of all six climate variables, both via BMA and WA. We note that jointly reconstructing several climate variables is done in several large-scale regional reconstructions e.g., in and come back to this later in the discussion. Figure illustrates the whole calibration and reconstruction workflow for a BMA reconstruction at an example grid point selected from the Arctic (120∘ E, 72∘ N). CCA analyses (Fig. d) suggest that summer temperature is the main climate variable driving modern vegetation around the site, whereas winter temperatures have little to no impact on the vegetation changes in the model. A summer temperature calibration based on BMA can explain considerable amounts of variance in the modern vegetation–climate relationship, and it also shows a low RMSEP of ∼ 1.15 ∘C. In the model world, we can compare reconstructed and the simulated true past model climate evolution (Fig. f) and find that summer temperatures (MTWA) are faithfully reconstructed, whereas the reconstructions of annual mean (MAT) and winter temperatures (MTCO) largely fail.

Exemplary calibration, BMA reconstruction and verification workflow for the grid point site in Siberia (120∘ E, 72∘ N) highlighted as a red square in (a). Surrounding grid points from which the modern analogs are drawn are shown as black dots, chosen analogs in blue. CCA analyses show that MTWA explains most variance in modern vegetation (b, d) and performs sufficiently well in leave-one-out cross validation (c). The jointly reconstructed climate variables show considerable shared (black) and rather little independent variance (grey) in the modern calibration (d). Past vegetation changes, as shown in the percentage PFT diagram (e), appear to be correlated with (f) simulated and reconstructed climate. PFT acronyms are listed in the appendix. Red lines show the simulated “true” past temperatures, black lines the reconstructions. (g) The MTWA reconstruction explains most fossil vegetation variance in the randomTF significance test, compared to the other temperature variables, and falls outside the confidence interval of the test (red line). The dashed line corresponds to the maximum amount of variance a single variable can explain .

Results Simulated and reconstructed Holocene temperature trends

Linear trend in the simulated (top row) vs. the reconstructed temperature evolution between 6k and present day based on BMA (middle row) and WA (bottom row). Saturated red and blue colors indicate that the grid point's trends are stronger than 1 K kyr-1.

The simulated mid–late Holocene temperature evolution shows a zonal structure characterized by warming trends around the Equator and across Asia and cooling trends in the mid-to-high latitudes (Fig. top row). The seasonal insolation forcing caused by changes of the orbital configuration results in distinct temporal trends for summer and winter temperature, which differ in their strength and in some regions also in their signs. In the Arctic regions, the trends in the model simulation are strong (∼ -0.5 K kyr-1) for summer and weaker (∼-0.1 K kyr-1) for winter and the annual mean. The warming trends around the Equator appear strongest in the coldest month. Similar patterns occur in the mean annual precipitation, with drying in the Northern and wetting in the Southern Hemisphere. We focus here on temperature and refer the reader with interest in the precipitation changes to Fig. S4. We now analyze the winter (MTCO), summer (MTWA) and annual mean (MAT) temperature patterns reconstructed using BMA and WA (Fig. , middle and bottom rows). Reconstructed winter trend patterns diverge from the simulated trends. In many regions the reconstructed trends are higher than ±1 K kyr-1 in magnitude and thus stronger than anywhere in the simulated model climate. Negative temperature trends in polar regions are not consistently captured, and an east-to-west warm-to-cold gradient appears for both reconstruction techniques WA and BMA. In contrast, the reconstructed summer trends show broad similarities to the simulated temperature changes. Equatorial warming and polar cooling are captured by both WA and BMA. Differences exist in the magnitude of the changes, rather than the sign, except for in the Middle East, where warming is suggested by BMA and WA, and the true simulation trends showed a cooling, in particular around present-day Turkey.

Amongst the climate variables, MTWA appears to be most consistent between simulations and reconstructions. This is also supported by the results of the sign test (described in Sect. ), which yields ν≈0.5 for WA and BMA. MTCO is least consistent (ν≈0.3). Between WA and BMA, results appear more patchy for BMA than for WA (i.e., sign or magnitude vary less gradually across space), but this does not imply that either method captures correct degrees of change. This is further underlined by the temperature standard deviations taken across the trend fields, which are much larger for WA (sd‾=1.8 K, bottom row in Fig. ) and BMA (sd‾=2.9 K, middle row) than for the simulation (sd‾=1.2 K, top row). Thus, for both reconstruction methods reconstructed trends are spatially more heterogeneous than the simulated trends.

The spatial patterns and magnitudes of the reconstructed trends are very similar across all three seasons (compare panels across rows in Fig. ). Visually, they show a stronger similarity than the spatial patterns of the simulated seasonal trends (compare panels of the top row). This is due to the fact that grid cells with large positive or negative trends appear in the same positions across the seasons (i.e., row-wise) but not necessarily across methods (i.e., column-wise). The sign test shows slightly larger correspondences within each row/across seasons for the same method (ν‾=0.59) than for the columns/same season across methods (ν‾=0.47). Due to the influence of the strong trends in the same places, this discrepancy is stronger for Pearson correlations across the fields of Fig. (by method, ρ‾=0.79; by season, ρ‾=0.46). One explanation for this observation could be that all seasonal reconstructions are biased towards a single specific season.

Correlation of coldest and warmest month temperatures. The correlation patterns across modern calibration space (a) are similar to the temporal correlation pattern estimated from WA reconstructions (b). The correlations at the sites picked as modern analogs (c) are similar to those obtained in the final BMA reconstructions (d). In contrast, the “true” temporal correlation pattern from the model temperatures differs considerably from the reconstructed temporal correlation fields. This demonstrates that the correlation in the reconstructions mainly depends on the modern calibration and not, as one would hope for, on the correlation of the Holocene temperature evolution. Crosses in (b) and (d) indicate grid boxes with a r2<0.5 in cross validation.

Seasonal bias of temperature reconstructions

To further investigate this finding, we analyze the correlation between the different seasons in the simulations across modern space and across time and contrast them with the correlation through time between the reconstructed seasonal time series (Fig. ). Ideally, the temporal correlation of the reconstructions should equal the temporal correlation of our “true” (model-simulated) climate evolution. Correlations across modern space are calculated over all the grid points relevant in the calibration and reconstruction process; thus for WA these are all grid boxes in a radius of 2500 km, whereas for BMA only the sites picked as modern analog in the reconstruction are used (see Fig. a for an example). For simplicity, we perform the analysis for winter (MTCO) against summer (MTWA) temperature, but other variable combinations (e.g., temperature against precipitation) would lead to similar results.

Across modern space MTCO and MTWA are mostly positively correlated (Fig. a), as towards the poles temperatures get colder in summers as well as in winter. Exceptions are found around eastern Russia and equatorial regions in Africa, where summer and winter temperatures are anticorrelated across space.

The temporal correlations of the WA-reconstructed MTCO and MTWA (Fig. b) show a very similar pattern of the correlation sign, although with stronger amplitudes of the correlation values. Indeed, the sign test yields ν=0.76, indicating that the large majority of the grid cells in Fig. a and Fig. b share the same sign. In contrast, the “true” temporal MTCO–MTWA correlation over the late Holocene (Fig. e), which should ideally be similar to the reconstructed temporal correlation (Fig. b), shows a different picture (ν=0.26). This suggests that the modern spatial covariance has been directly propagated to the temporal covariance of the reconstructions. Here, and in Fig. 5, we mask grid points for fossil reconstructions with low transfer function performance as measured by the cross validation r2, as we expect them to return less reliable results.

The same observation holds for the BMA-based results (Fig. d). The modern spatial MTCO–MTWA covariances at the sites picked as modern analogs, shown in Fig. c, are noisier than the covariances calculated over all grid boxes but show a similar pattern. The seasonal correlation in the BMA reconstructions again directly follows the modern spatial MTCO–MTWA correlation (ν=0.68). In contrast, the similarity to the actual temporal covariance (Fig. e) is low, as the sign test underlines (ν=0.03).

Reconstruction skill

We showed that the ability to reconstruct Holocene temperature trends in our model world strongly depends on the analyzed season and region (Fig. ). It is also important to quantify the reconstruction skill for the full Holocene evolution, including millennial variability and absolute temperature estimates. We analyze two metrics: (i) the temporal Pearson correlation between the “true” past changes and the climate variable reconstructions (“correlation skill”, Fig. ) and (ii) the RMSE deviation of the reconstructed from the “true” climate. Consistently high correlation skill values for the BMA reconstruction can be found across the Arctic for MTWA and in the Sahel for MAP. Simulated MAT changes are correlated with MTWA changes in the high latitudes, which explains the relatively weaker but positive correlation there. Winter precipitation reconstructions do not show good skill anywhere. Most regions with high positive correlation skill show comparably low temporal RMSE (Fig. S5), whereas many regions with low RMSE do not show high correlation skill. In a real-world situation, the true past climate evolution is unknown and an RMSEP is estimated from the modern calibration set (see Sect. ). In our model world, the RMSEP is below 3 ∘C for MTWA and MAT, whereas it is generally higher for winter temperature, in particular for North America. The low correlation skill for winter temperatures in the Arctic is also reflected by the temporal RMSE and the modern RMSEP (Figs. S5 and S6). A comparison of summer temperature downcore RMSE and modern spatial RMSEP, given in Fig. S7, shows that modern RMSEP is higher than the actual reconstruction error in many places, but there is little resemblance to the patterns of the estimated downcore RMSEP. If the calibration radius is reduced, the modern calibration error decreases (results not shown).

Performance of the BMA calibration models as evaluated by the correlation between the reconstructed and simulated climate variables (a–f) at each grid point. Crosses mask grid boxes with cross validation r2< 0.5.

Testing for the predictability of reconstruction skill

Climate variables explaining most variance in modern vegetation (a), between reconstructed climate and fossil vegetation (b) and simulated climate and fossil vegetation (c). Variables explaining most variance in the modern world (a) are not necessarily those explaining vegetation changes in the “true” model past (c).

The inaccuracy of the covariance estimates (Fig. b) and the dependency of the reconstruction skill on the analyzed climate variable (Fig. ) highlight that it is important to determine which climate variables can be reconstructed in a given setting – and with which other variables they are colinear in the modern training set. We can discern two statistical approaches to identify the driving variable for climate-related vegetation changes: those relying on the modern calibration set and those involving the fossil downcore record. In both, higher variance explained should be reflecting a higher environmental relevance .

In the following, we compare the results of estimating the driving climate variable with both approaches (Fig. a, b), with the pattern of the “true” climate variable explaining most simulated fossil vegetation change in our model simulation (Fig. c). The ordination fields underlying this summary figure are given in the Figs. S8 to S10. For the modern spatial approach, we use CCA ordination of modern PFTs and climate to determine the climate variable which explains most vegetation variance across the modern calibration space (Fig. a). Temperature variables dominate the ordination results globally, except for the Sahel zone, which is dominated by precipitation changes. MTWA explains most variance in Arctic Canada and eastern Siberia, whereas MAT appears to dominate in Siberia and northern Europe. For the fossil downcore record approach, we identify which BMA-reconstructed climate variable explains most variance in the fossil vegetation set using constrained ordination (RDA). The results, as can be seen in Fig. b, are different and less smooth than those obtained for the modern spatial vegetation changes. Note that the patterns we observe here are highly similar to those identified from the ratio of the first two axes of the ordination (; Fig. S11). Finally, as we have access to the “true” past vegetation and climate changes in the model world, we can assess which climate variable explains most simulated fossil vegetation change. The RDA results, shown in Fig. c, confirm a strong summer temperature signal above the Arctic circle and the potential existence of a precipitation signal in the Middle East and the Sahel zone. Contemplating Fig. a, b and c we observe that the driving variables, identified by the fossil downcore approach (Fig. b), are closer to the true (Fig. c) driving variables than the driving variables estimated from the modern calibration dataset (Fig. a). This suggests that looking at the variance explained by downcore reconstructions may tell us more about what actually drove vegetation changes than looking at the variance explained in modern vegetation.

Spatial patterns of BMA transfer function r2 in the modern calibration set (grid points with a distance of less than 2500 km from the reconstruction site) of the six jointly reconstructed climate variables MTCO (a), MTWA (b), MAT (c), MPCO (d), MPWA (e) and MAP (f). Points with a r2<0.5 are crossed out. Transfer function performance appears good, although some variables had little impact on vegetation changes in the past.

Outcome of the significance test using randomTF. Outcome of the significance test using randomTF. All 196 grid points above 50∘ N are considered, and p values are estimated for all climate variables. Actual relevance is obtained by counting the number of times the variable is picked as the most relevant variable in the RDA of simulated climate and vegetation (Fig. ) and dividing by the number of grid cells.

randomTF: significant (p<0.1) randomTF: not significant (p>0.1) Relevance (%) RMSEP r(rec,sim) No. cells (%) RMSEP r(rec,sim) No. cells (%) MTCO (∘C) 1.5 4.16 0.17 13.8 3.31 0.08 86.2 MTWA (∘C) 84.7 0.92 0.71 68.9 2.00 0.37 31.1 MAT (∘C) 13.8 2.43 0.56 23.5 2.13 0.26 76.5 MPCO (mm yr-1) 0.0 180.80 -0.03 9.2 113.63 0.00 90.8 MPWA (mm yr-1) 0.0 237.44 0.06 16.3 184.9 0.00 83.7 MAP (mm yr-1) 0.0 150.52 0.21 15.8 123.76 0.04 84.2

Furthermore, analyzing the variance explained in the modern calibration dataset can suggest a high importance (by a high explained variance) for variables that are not necessarily relevant to vegetation development. This is due to the colinearity of the climate variables (see Fig. b). This is demonstrated in Fig. , which shows the transfer function r2 for all climate variables. In large parts of Siberia, MAT explained most variance (Fig. a). However, MTWA transfer function r2 (Fig. b) is about as high as that of MAT (Fig. c) there and dominates the rest of the Arctic. MAP appears well reconstructible in the Southern Hemisphere, in regions where MTCO also has a high transfer function r2. Seasonal precipitation transfer functions do not perform well on interregional scales outside Africa. There, they appear to perform better, which is likely due to their colinearity with MAP (see Fig. ).

For the potentially more skillful approach of using the downcore reconstruction to test for reconstruction skill, a formalized test (randomTF) has been proposed in . It relies on the comparison between the fossil variance explained by the actual reconstruction and the variance explained by reconstructions based on surrogate modern climate (but using the same modern and fossil pollen assemblages). Above 50∘ N, where temperature changes occur over the course of the 6k run, 84.7 % of the grid cell vegetation changes are identified as most strongly related to MTWA (Table ). If the randomTF test has power, it should indicate a lower p value for reconstructions of climate variables that were related to vegetation changes. Table indicates a significant p value (≤ 0.1) for MTWA in 68.9 % of grid cells. MAT, picked as most relevant in 14 % of the grid cells, appears reconstructible in 23 % of the grid cells. MTCO, MAP, MPCO and MPWA – which have no or little relevance for vegetation development in the region – show up as significant in only 14–16 % of the grid cells. Although our test approach does not meet the criteria of a formal statistical power assessment, these results suggest that randomTF may have indicative power.

Influence of the modern climate background on the reconstructed climate

Ideally, a climate reconstruction should not depend on the climate state in which the calibration set was taken. We test this in a case study by comparing the calibration to the most recent time period (the last 30 years of the model run, equivalent to 0–30 yr BP), which we use throughout the paper, to one for the first period (5970–6000 yr BP) in the simulation. We subsequently perform reconstructions for both calibration periods. Figure shows exemplary BMA results for a Siberian site.

Reconstructions are sensitive to the calibration time period. Warmest month temperature trends for reconstructions based on a calibration for the last 30 years (0 k) and first 30 years (6k) of the model run (a); 6k results are mostly warmer (b). All time series are based on 300 year running means.

Averaged across all reconstruction sites, MTWA reconstructions calibrated at 6k are 0.75 K (-3.6, 1.7 K, 90 % confidence interval) warmer than those based on calibrations at 0 k. In particular, sites across the Northern Hemisphere are reconstructed with warmer temperatures. Relative temperature variations largely match between the reconstructions. Inspection of the locations and temperatures around the analog sites chosen for the 0 and 6k calibrations suggests that the warm bias may be caused by spatial autocorrelation in the vegetation, rather than climate, in addition to other local confounding factors. The 6k analog sites tend to lie further northward (in the Northern Hemisphere) than those for the 0 k calibration. However, the 6k analog sites do not systematically cluster northward. Therefore, the northward migration of the analog sites does not compensate fully for the warmer background climate state, so that the overall reconstructed temperatures are warmer. This demonstrates that, at least in our experiment, the climatological and ecological similarity of the calibration period to the period for reconstruction influences the reconstruction outcome.

The question of whether the detected differences in Fig. are significant or not using the calibration RMSEP is not straightforward. A standard assumption in paleoclimate reconstructions is that errors in time and space are independent as assumed, e.g., in. This assumption would result in a standard error of 0.13 ∘C, thus considerably smaller than the differences we found. In the (unrealistic) extreme case of a complete dependency of errors, the differences would be not significant (standard error 3.5 ∘C). In reality the true uncertainty likely lies between the two extremes assumed here, but a more detailed analysis of the spatial and temporal covariance structure of the proxy uncertainty is required to provide better error estimates.

Discussion

Using a Holocene climate model simulation as a test bed for pollen-based climate reconstructions allowed us to analyze the reconstruction skill and to understand potential seasonal biases of pollen-based climate reconstruction methods.

Correlative uniformitarianism

Transfer function reconstructions rely on the exchangeability of spatial and temporal relationships between climatic, environmental and ecological variables and on the uniformity of correlations across space and time. We have demonstrated that spatial and temporal correlations are not equivalent on orbital timescales in our model world Holocene, which has implications for seasonal temperature reconstructions. The space-for-time substitution in transfer functions hence leads to seasonal biases in the reconstructions, as the assumption of correlative uniformitarianism is violated. This is consistent with findings of , who tested the space-for-time substitution for the prediction of biodiversity changes. They observed that while generalized dissimilarity models fitted across space could predict large-scale patterns of diversity across time through the late Quaternary, the relationship between turnover and environmental variables was different through space and through time. Furthermore, space-for-time substitution was less successful for the Holocene, which is likely due to the relatively smaller temporal climate variations compared to the spatial variations. showed that reconstructions using different modern calibration datasets differed in their means and variations around this mean. The calibration datasets had different temperature distributions. This could be a consequence of a violation of correlative uniformitarianism: the relationships between climate variables and ecological changes, which are transferred to the final reconstruction, are likely different for calibrations extending to different locations (as in ) or for different time periods (as in Sect. 3.5).

Limitations

The complexity of the vegetation representation in the model, as well as the simulated climate evolution, is a strong simplification of reality. Therefore, results on the Holocene evolution of specific PFTs, the actual spatial pattern of PFTs or the reconstructability of a certain climate variable in a certain region should not be directly translated to actual pollen-based climate reconstructions. However, conclusions on reconstruction methods and the relation of spatial calibration and downcore reconstruction only require a consistent dataset of climate and vegetation parameters in space and time and do not depend on details of the climate evolution or vegetation response, as long as the dataset is realistic enough that we can apply the PFT-based reconstruction workflow. The major factor shaping our results is that the modern spatial relationships between climate variables is different from the changes in the relationships over time, which is a robust feature related to the transient insolation forcing .

One might be concerned that the low number of simulated PFTs, or the low spatial resolution of the model, might bias our reconstruction efforts. However, we showed that the actual information contained in the plant functional types and the spatial climate field is not fundamentally different from that in the PFTs (or taxa) and the climate calibration datasets used in real-world reconstructions (Figs. S1 and S2). Note that it is likely, but not proven, that a larger Hill's N2 ensures more meaningful reconstructions. What constitutes a too low number, or a significant difference in N2, is as yet unknown.

Given the design of our study, we have limited our analyses to identifying general features of the calibration vs. reconstruction relationship rather than interpreting the actual numbers of temperature changes or reconstruction biases. Furthermore, we assumed perfect proxy recording and did not add any non-climatic noise. If these were added, tests which rely on the downcore record, such as randomTF, may become less powerful, and downcore RMSE could become higher.

Our main study region – the Northern Hemisphere and Arctic Russia in particular – is characterized by cold temperatures and is particularly sensitive to the orbital changes in the model simulation. Hence MTWA is the predominant driving variable. Multivariate analyses suggest that this is not the case everywhere (see Fig. ). Given the conceptual nature of our study – and the simplicity of the vegetation model – we have limited our discussion to the identifiability of a single driving variable. This does not exclude that in other regions multiple climatic controls on vegetation may be more important. Thus, other regions may be better suited to test the ability of transfer functions to disentangle changes in multiple climate variables.

Identification of climate variables driving vegetation evolution through time

Our study shows that in our model world, regardless of the reconstruction technique, the reconstructed climate evolution is very similar between the variables (Fig. ). This strong covariance between the variables is determined by the modern spatial covariance and not, as one would hope, the temporal covariance of local climate (Fig. ). This finding can be understood in a simple thought experiment. Let us assume that the vegetation evolution at every grid point would be driven by one single variable. This single variable could be one of the analyzed variables (e.g., summer temperature) or any other variable, such as the length of the growing season, cloudiness or soil moisture. All other variables have no direct influence on the vegetation and are merely covarying with the driving variable. In this case, the reconstructed covariability is implicit in the transfer function and fully determined from the modern spatial relationship, regardless of the true past relationship between the variables, and this is similar to what we found (Figs. and ). Reconstruction skill will consequently depend on whether we reconstruct the driving variable or, in the case that we reconstruct a secondary variable, on whether the relationship with the driving variable is the same across space and in time. The example of our model world Arctic shows that the latter is not always the case. Past vegetation changes there, as Fig. shows, were predominantly driven by summer temperature and mean annual temperature change, yet the modern transfer function r2 for MTCO is acceptable in most grid boxes (Fig. ). Skill for winter temperature reconstructions is, however, low (Fig. a), particularly in regions where the modern spatial covariance between summer and winter temperatures (Fig. a, c) is negative, whereas the temporal covariance is positive (Fig. e).

Simulated (red) and BMA-reconstructed (black) extratropical mean temperature changes over the 6k run (BMA). The amplitude of the summer temperature trends (a) agree well, whereas the amplitude for the simulated mean annual temperature change (b) is overestimated in the reconstructions.

Therefore, an important question is whether we can determine the variable driving vegetation changes. This would increase our confidence in the reconstruction. In the simplest case, vegetation patterns across modern space are only determined by the current climate. In this case, the climate variable maximizing the modern spatial correlation, information accessible in the real world, would be the driving variable (Fig. a). However, the variable explaining most of the modern spatial vegetation variance was, in our evaluation, not necessarily the one explaining most of the temporal vegetation evolution (compare Fig. a vs. c). Therefore, either other parameters beyond modern climate play a role or the driving variable was not included in our set of six variables. In the model world, and likely in reality, both occur. Evolving parameters such as soil properties partly determine the spatial vegetation distribution, but they are constant over time in the model world. However, the chances of identifying the correct driving variable are also small, as, for example, the length of the growing season might have a stronger influence than summer temperature. What follows from this is that methods that rely only on the modern spatial climate–vegetation relationship are insufficient to identify the driving variables across time. Here, inverse modeling reconstruction techniques which do not rely on modern spatial calibration sets may provide useful additional information. In addition to the downcore tests outlined in Sect. 3.4 a priori expert knowledge on regional ecology is helpful to identify variables of climatic and ecological relevance.

Seasonal bias on reconstructed trends in non-driving variables

In the northern hemispheric extratropics of our model world, summer temperature is the variable driving vegetation change across the mid-to-late Holocene. The modern spatial correlation between summer, winter and consequently also mean annual temperatures is positive. Since the modern spatial information determines the downcore temporal reconstruction for all variables, the reconstructions of winter–annual mean temperature changes are biased towards the trend in summer temperatures. What are the implications of such a bias on reconstructions of climate variables which are not primarily influencing vegetation? Figure shows the simulated and BMA-reconstructed summer and annual mean temperature for the northern hemispheric extratropics (all grid boxes north of 50∘ N). Patterns and magnitudes are highly similar for WA, as well as when only grid boxes with summer–annual mean temperature as dominant variables are picked (not shown). Mid-to-late Holocene summer temperatures are slightly overestimated, but the trend and magnitude are correct. In contrast, the annual mean cooling has the same magnitude as the reconstructed (and simulated) summer cooling – it is exaggerated due to the summer bias in the reconstruction. This could affect the reconstruction of the annual mean temperature evolution of the past 11 000 years . The reconstructed cooling trend in the mid–late Holocene was stronger than the cooling simulated by climate models. This mismatch is potentially related to a seasonal bias of the reconstruction and insolation changes as latent and unconsidered variables. Seasonal insolation changes are likely to have direct effects on vegetation by changing the season length, and thus the number of days for growth, and indirect effects by changing local temperatures and their seasonality. Another example is the comparison between pollen-proxy-based and climate-model-simulated winter temperature changes between the Last Glacial Maximum and present day, which are stronger in the reconstructions than in the model simulations . Such a correlation bias on jointly reconstructed climate variables is hard to detect and prove for real-world data. However, the above considerations suggest that for non-driving variables, physically implausible temperature reconstructions may arise due to correlations across modern space. Consequently, estimated temperature trends based on proxy data may appear larger than in the model world or may have a different shape. Given our above results, such findings could potentially be explained as changes that are overestimated in the proxy data due to confounding effects of third variables, for example summer length or precipitation changes.

Implications and Outlook

We have focused our analysis on the seasonal evolution of temperatures. However, it is likely that similar biases also affect pollen-assemblage-based reconstructions of other climate variables, such as precipitation. In this light, the result of larger pollen-derived than model-simulated precipitation changes between the mid-Holocene and present day might be influenced by a reconstruction bias, as the linkage between temperature and precipitation may differ across space, time and timescales . Similarly, that modern spatial relationships differ from past temporal relationships might also affect other assemblage-based climate reconstructions. Examples include planktonic foraminifera counts, which are used to reconstruct marine temperature changes; in this case, the climate variables include water temperature at different seasons and water depths . Similar effects might also be in place for other environmental or climate proxies such as chironomids, diatoms and dinoflagellates , which all rely on modern spatial calibration approaches. Consequently, it would be interesting to study ecological, geographical and climatic effects on reconstruction results in other ecological models e.g., FORAMCLIM;. In the vegetation model used, the simulated PFTs have broad climatic tolerances (Table S1). This might exaggerate the seasonal bias problem, as the winter sensitivity of the simulated vegetation might too be low. While this would strengthen our general conclusion that transfer function diagnostics based on modern calibration data alone are not sufficient to characterize reconstructability, it asks for a cautious interpretation of the magnitude of the reconstruction bias.

More work is needed to quantify the impact of seasonality and other secondary variables on temperature estimates based on biomarker proxies and to develop methods that acknowledge and account for confounding variables in the reconstruction. Repeating this study with a dynamic vegetation model that simulates a larger number of PFTs e.g., LPJ-GUESS; or with models for marine ecology e.g., FORAMCLIM; could provide more insight. Transient paleoclimate model experiments with more complex land surface and biosphere schemes (i.e., with a larger number of PFTs) would be particularly useful to test whether assemblage-based climate reconstruction methods allow for the accurate joint reconstruction of several climate variables.

Future work, extending the conceptual approach in this study, could test the following:

The reconstructability of multiple climate parameters could be assessed in an idealized setting. This could be done using artificial vegetation and climate or a coupled climate model with a vegetation model of higher complexity (than JSBACH) and/or with larger climatic changes. It could also allow in-depth tests for the predictability of reconstruction skill for one or more climate parameters e.g., using methods described in.

The impact of species richness on the model error is unknown. Tests could employ vegetation models of different complexity run for the same climate forcing (e.g., by contrasting JSBACH results with LPJ-GUESS results) or random datasets e.g.,. Here, it is particularly important to exploit the independence of the modern validation statistics.

Adding proxy noise and age uncertainty would allow a more in-depth comparison of spatial and temporal errors, and a more representative test of the randomTF algorithm.

A first estimate of potential biases in model–data comparison of multiple climate variables can be obtained through the comparison of simulated spatial and temporal covariances. If they are very different, caution is called for in the interpretation of joint proxy reconstructions of these variables.

Conclusions

Using a Holocene climate model simulation with interactive vegetation as a test bed, we analyzed the skill and potential biases in pollen-based climate reconstructions. We find that in our model experiments, transfer function reconstruction methods pull the spatial covariances between climate variables through into the downcore temporal reconstructions. As a consequence, temporal changes of a dominant climate variable (for the Northern Hemisphere, often summer temperature) are imprinted on a less important variable (here often winter temperature), leading to reconstructions biased towards the dominant variable's trends. Given the conceptual nature of our study, we consider these results as primarily illustrative and have limited ourselves to testing the reconstructability of individual parameters. More work is needed to develop and test methods for the reconstruction of multiple climate parameters and for the predictability of reconstruction skill.

One assumption underpinning transfer function climate reconstructions is that environmental variables other than those considered in the calibration are not important or that their relationship with the reconstructed variable(s) is the same in the past as in the modern spatial calibration dataset. In our model world, we have clearly shown that this assumed correlative uniformitarianism is violated, as the modern spatial relationship between climate variables, such as winter and summer temperatures, and the past temporal relationship often differs. Translating this to real-world reconstructions would imply that large-scale reconstructions of multiple climate variables need to be carefully considered, as reconstructions of climate variables which are not primarily influencing vegetation can be biased. It would also imply that the driving climate variables cannot be reliably determined by only analyzing the modern spatial climate–vegetation relationship. Therefore, climate variables which actually drove vegetation variability in the past are likely better identified using expert knowledge on ecology and with statistical analyses involving the fossil vegetation record.

Acronyms

PFT

Plant functional type teT

PFT: tropical evergreen trees

tdT

PFT: tropical deciduous trees

eteT

PFT: extratropical evergreen trees

etdT

PFT: extratropical deciduous trees

PFT: raingreen shrubs

PFT: cold shrubs

PFT: C3 grass

PFT: C4 grass

surrogate PFT: bare soil

BMA

Best modern analog method (in literature also known as modern analog technique)

Weighted averaging

RDA

Redundancy analysis

CCA

Canonical correspondence analysis

RMSE(P)

Root mean square error (of prediction)

MAT

Mean annual temperature

MTWA

Mean temperature warmest month

MTCO

Mean temperature coldest month

PANN

Mean annual precipitation

MPCO

Mean precipitation coldest month

MPWA

Mean precipitation warmest month

Code availability

All analyses were carried out in the open source environment R, version 3.2.2. Reconstructions were performed using the rioja package (v. 0.9-5), paleosig (v. 1.1-3) and the vegan library (v. 2.3-0). The code is available on request.

Data availability

The output of the ECHAM5/MPIOM model was provided by Anne Dallmeyer and Johann Jungclaus. Model data are available at 10.1594/PANGAEA.773607 .

The Supplement related to this article is available online at doi:10.5194/cp-12-2255-2016-supplement.

Acknowledgements

We gratefully acknowledge Anne Dallmeyer and Johann Jungclaus for providing the ECHAM5/MPIOM model output, Ulrike Herzschuh for discussion and John W. Williams and two anonymous reviewers helped us to improve the manuscript. Andrew Dolman is thanked for proofreading. We thank the Initiative and Networking Fund of the Helmholtz Association (grant VH-NG-900) for funding and the DAAD-PPP program (contract 57160457) for travel support. The article processing charges for this open-access publication were covered by a Research Centre of the Helmholtz Association. Edited by: E. Zorita Reviewed by: three anonymous referees

References Bartlein et al.(2011)

Bartlein, P. J., Harrison, S. P., Brewer, S., Connor, S., Davis, B. A. S., Gajewski, K., Guiot, J., Harrison-Prentice, T. I., Henderson, A., Peyron, O., Prentice, I. C., Scholze, M., Seppä, H., Shuman, B., Sugita, S., Thompson, R. S., Viau, A. E., Williams, J., and Wu, H.: Pollen-based continental climate reconstructions at 6 and 21 ka: A global synthesis, Clim. Dynam., 37, 775–802, 10.1007/s00382-010-0904-1, 2011.

Birks et al.(2010)

Birks, H. J. B., Heiri, O., Seppä, H., and Bjune, A. E.: Strengths and Weaknesses of Quantitative Climate Reconstructions Based on Late-Quaternary Biological Proxies, Open Ecol. J., 3, 68–110, 10.2174/1874213001003020068, 2011.

Birks and Seppä(2005)

Birks, H. J. B. and Seppä, H.: Pollen-based reconstructions of late-Quaternary climate in Europe – progress, problems, and pitfalls, Acta Palaeobot., 44, 317–334, 2005.

Blois et al.(2013)

Blois, J. L., Williams, J. W., Fitzpatrick, M. C., Jackson, S. T., and Ferrier, S.: Space can substitute for time in predicting climate-change effects on biodiversity, P. Natl. Acad. Sci. USA, 110, 9374–9379, 2013.

Borcard et al.(2011)

Borcard, D., Gillet, F., and Legendre, P.: Numerical Ecology with R, Springer, New York, 301 pp., 10.1007/978-1-4419-7976-6, 2011.

Braconnot et al.(2012)

Braconnot, P., Harrison, S. P., Kageyama, M., Bartlein, P. J., Masson-Delmotte, V., Abe-Ouchi, A., Otto-Bliesner, B., and Zhao, Y.: Evaluation of climate models using palaeoclimatic data, Nature Climate Change, 2, 417–424, 10.1038/nclimate1456, 2012.

Brovkin et al.(2009)

Brovkin, V., Raddatz, T., Reick, C. H., Claussen, M., and Gayler, V.: Global biogeophysical interactions between forest and climate, Geophys. Res. Lett., 36, L07405, 10.1029/2009GL037543, 2009.

Dallmeyer et al.(2011)

Dallmeyer, A., Claussen, M., Herzschuh, U., and Fischer, N.: Holocene vegetation and biomass changes on the Tibetan Plateau – a model-pollen data comparison, Clim. Past, 7, 881–901, 10.5194/cp-7-881-2011, 2011.

Dallmeyer et al.(2013)

Dallmeyer, A., Claussen, M., Wang, Y., and Herzschuh, U.: Spatial variability of Holocene changes in the annual precipitation pattern: A model-data synthesis for the Asian monsoon region, Clim. Dynam., 40, 2919–2936, 10.1007/s00382-012-1550-6, 2013.

Dallmeyer et al.(2015)

Dallmeyer, A., Claussen, M., Fischer, N., Haberkorn, K., Wagner, S., Pfeiffer, M., Jin, L., Khon, V., Wang, Y., and Herzschuh, U.: The evolution of sub-monsoon systems in the Afro-Asian monsoon region during the Holocene- comparison of different transient climate model simulations, Clim. Past, 11, 305–326, 10.5194/cp-11-305-2015, 2015.

Davis et al.(2003)

Davis, B. A. S., Brewer, S., Stevenson, A., and Guiot, J.: The temperature of Europe during the Holocene reconstructed from pollen data, Quaternary Sci. Rev., 22, 1701–1716, 10.1016/S0277-3791(03)00173-2, 2003.

Fischer and Jungclaus(2011)

Fischer, N. and Jungclaus, J. H.: Evolution of the seasonal temperature cycle in a transient Holocene simulation: orbital forcing and sea-ice, Clim. Past, 7, 1139–1148, 10.5194/cp-7-1139-2011, 2011.

Fischer and Jungclaus(2012)

Fischer, N. and Jungclaus, J. H.: Holocene experiment with coupled atmosphere-ocean-model ECHAM5/MPI-OM, 10.1594/PANGAEA.773607, 2012.

Gould(1965)

Gould, S. J..: Is uniformitarianism necessary?, Am. J. Sci., 263, 223–228, 1965.

Guiot et al.(2009)

Guiot, J., Wu, H. B., Garreta, V., Hatté, C., and Magny, M.: A few prospective ideas on climate reconstruction: from a statistical single proxy approach towards a multi-proxy and dynamical approach, Clim. Past, 5, 571–583, 10.5194/cp-5-571-2009, 2009.

Harrison et al.(2014)

Harrison, S. P., Bartlein, P. J., Brewer, S., Prentice, I. C., Boyd, M., Hessler, I., Holmgren, K., Izumi, K., and Willis, K.: Climate model benchmarking with glacial and mid-Holocene climates, Clim. Dynam., 43, 671–688, 10.1007/s00382-013-1922-6, 2014.

Hijmans et al.(2005)

Hijmans, R. J., Cameron, S. E., Parra, J. L., Jones, P. G., and Jarvis, A.: Very high resolution interpolated climate surfaces for global land areas, Int. J. Climatol., 25, 1965–1978, 2005.

Hill(1973)

Hill, M. O.: Diversity and evenness: a unifying notation and its consequences, Ecology, 54, 427–432, 10.2307/1934352, 1973.

Hill and Gauch(1980)

Hill, M. O. and Gauch, H. G.: Detrended correspondence analysis: an improved ordination technique, Vegetatio, 42, 47–58, 1980.

Juggins(2013)

Juggins, S.: Quantitative reconstructions in palaeolimnology: new paradigm or sick science?, Quaternary Sci. Rev., 64, 20–32, 10.1016/j.quascirev.2012.12.014, 2013.

Juggins and Birks(2012)

Juggins, S. and Birks, H. J. B.: Data handling and numerical techniques, in: Dev. Paleoenviron. Res. Track. Environ. Chang., Using Lake Sediments, edited by: Birks, H. J. B., Lotter, A. F., Juggins, S., and Smol, J. P., 14, 745 pp., Springer, Berlin/Heidelberg, 2012.

Jungclaus et al.(2006)

Jungclaus, J. H., Keenlyside, N., Botzet, M., Haak, H., Luo, J.-J., Latif, M., Marotzke, J., Mikolajewicz, U., and Roeckner, E.: Ocean Circulation and Tropical Variability in the Coupled Model ECHAM5/MPI-OM, J. Climate, 19, 3952–3972, 10.1175/JCLI3827.1, 2006.

Küttel et al.(2007)

Küttel, M., Luterbacher, J., Zorita, E., Xoplaki, E., Riedwyl, N., and Wanner, H.: Testing a European winter surface temperature reconstruction in a surrogate climate, Geophys. Res. Lett., 34, 2–7, 10.1029/2006GL027907, 2007.

Laepple and Lohmann(2009)

Laepple, T. and Lohmann, G.: Seasonal cycle as template for climate variability on astronomical timescales, Paleoceanography, 24, PA4201, 10.1029/2008PA001674, 2009.

Laepple and Huybers(2014)

Laepple, T. and Huybers, P.: Ocean surface temperature variability: Large model-data differences at decadal and longer periods, P. Natl. Acad. Sci. USA, 111, 16682–16687, 10.1073/pnas.1412077111, 2014.

Liu et al.(2014)

Liu, Z., Zhu, J., Rosenthal, Y., Zhang, X., Otto-Bliesner, B. L., Timmermann, A., Smith, R. S., Lohmann, G., Zheng, W., and Elison Timm, O.: The Holocene temperature conundrum, P. Natl. Acad. Sci. USA, 111, E3501–E3505 10.1073/pnas.1407229111, 2014.

Lombard et al.(2011)

Lombard, F., Labeyrie, L., Michel, E., Bopp, L., Cortijo, E., Retailleau, S., Howa, H., and Jorissen, F.: Modelling planktic foraminifer growth and distribution using an ecophysiological multi-species approach, Biogeosciences, 8, 853–873, 10.5194/bg-8-853-2011, 2011.

Mann et al.(2005)

Mann, M. E., Rutherford, S., Wahl, E., and Ammann, C.: Testing the fidelity of methods used in proxy-based reconstructions of past climate, J. Climate, 18, 4097–4107, 2005.

Marcott et al.(2013)

Marcott, S. A., Shakun, J. D., Clark, P. U., and Mix, A. C.: A reconstruction of regional and global temperature for the past 11 300 years, Science, 339, 1198–201, 10.1126/science.1228026, 2013.

Mauri et al.(2014)

Mauri, A., Davis, B. A. S., Collins, P. M., and Kaplan, J. O.: The influence of atmospheric circulation on the mid-Holocene climate of Europe: a data-model comparison, Clim. Past, 10, 1925–1938, 10.5194/cp-10-1925-2014, 2014.

Meyer et al.(2015)

Meyer, H., Opel, T., Laepple, T., Dereviagin, A. Y., Hoffmann, K., and Werner, M.: Long-term winter warming trend in the Siberian Arctic during the mid- to late Holocene, Nat. Geosci., 8, 122–125, 10.1038/ngeo2349, 2015.

Overpeck et al.(1985)

Overpeck, J., Webb, T., and Prentice, I. C.: Quantitative interpretation of fossil pollen spectra: Dissimilarity coefficients and the method of modern analogs, 10.1016/0033-5894(85)90074-2, 1985.

Raddatz et al.(2007)

Raddatz, T. J., Reick, C. H., Knorr, W., Kattge, J., Roeckner, E., Schnur, R., Schnitzler, K. G., Wetzel, P., and Jungclaus, J.: Will the tropical land biosphere dominate the climate-carbon cycle feedback during the twenty-first century?, Clim. Dynam., 29, 565–574, 10.1007/s00382-007-0247-8, 2007.

Rehfeld and Laepple(2016)

Rehfeld, K. and Laepple, T.: Warmer and wetter or warmer and dryer? Observed versus simulated covariability of Holocene temperature and rainfall in Asia, Earth Planet. Sc. Lett., 436, 1–9, 10.1016/j.epsl.2015.12.020, 2016.

Salonen et al.(2013)

Salonen, J. S., Helmens, K. F., and Seppä, H., and Birks, H. J. B.: Pollen-based palaeoclimate reconstructions over long glacial-interglacial timescales: Methodological tests based on the Holocene and MIS 5d-c deposits at Sokli, northern Finland, J. Quaternary Sci., 3, 271–282, 10.1002/jqs.2611, 2013.

Scott(1963)

Scott, G. H.: Uniformitarianism, the uniformity of nature, and paleoecology, New Zeal. J. Geol. Geophys., 6, 510–527, 10.1080/00288306.1963.10420063, 1963.

Sitch et al.(2003)

Sitch, S., Smith, B., Prentice, I. C., Arneth, A., Bondeau, A., Cramer, W., Kaplan, J. O., Levis, S., Lucht, W., Sykes, M. T., Thonicke, K., and Venevsky, S.: Evaluation of ecosystem dynamics, plant geography and terrestrial carbon cycling in the LPJ dynamic global vegetation model, Glob. Change Biol., 9, 161–185, 2003.

Telford and Birks(2005)

Telford, R. J. and Birks, H. J. B.: The secret assumption of transfer functions: Problems with spatial autocorrelation in evaluating model performance, Quaternary Sci. Rev., 24, 2173–2179, 10.1016/j.quascirev.2005.05.001, 2005.

Telford and Birks(2009)

Telford, R. J. and Birks, H. J. B.: Evaluation of transfer functions in spatially structured environments, Quaternary Sci. Rev., 28, 1309–1316, 10.1016/j.quascirev.2008.12.020, 2009.

Telford and Birks(2011)

Telford, R. J. and Birks, H. J. B.: A novel method for assessing the statistical significance of quantitative reconstructions inferred from biotic assemblages, Quaternary Sci. Rev., 30, 1272–1278, 10.1016/j.quascirev.2011.03.002, 2011.

Telford et al.(2013)Telford, Li, and Kucera

Telford, R. J., Li, C., and Kucera, M.: Mismatch between the depth habitat of planktonic foraminifera and the calibration depth of SST transfer functions may bias reconstructions, Clim. Past, 9, 859–870, 10.5194/cp-9-859-2013, 2013.

ter Braak et al.(1996)ter Braak, van Dobben, and di Bella

Ter Braak, C. J., van Dobben, H., and di Bella, G.: On inferring past environmental change from species composition data by nonlinear reduced rank models, in: Invited Papers, the XIIIth International Biometric Conference, The Biometric Society, edited by: van Houwelingen H. C., Amsterdam, 65–70, 1996.

Trenberth(2005)

Trenberth, K. E.: Relationships between precipitation and surface temperature, Geophys. Res. Lett., 32, 2–5, 10.1029/2005GL022760, 2005.

von Storch et al.(2004)

von Storch, H., Zorita, E., Jones, J. M., Dimitriev, Y., González-Rouco, F., and Tett, S. F. B.: Reconstructing past climate from noisy data, Science, 306, 679–82, 10.1126/science.1096109, 2004.

Wanner et al.(2008)

Wanner, H., Beer, J., Bütikofer, J., Crowley, T. J., Cubasch, U., Flückiger, J., Goosse, H., Grosjean, M., Joos, F., Kaplan, J. O., Küttel, M., Müller, S. A., Prentice, I. C., Solomina, O., Stocker, T. F., Tarasov, P., Wagner, M., and Widmann, M.: Mid- to Late Holocene climate change: an overview, Quaternary Sci. Rev., 27, 1791–1828, 10.1016/j.quascirev.2008.06.013, 2008.

Yu(2013)

Yu, S.-Y.: Quantitative reconstruction of mid- to late-Holocene climate in NE China from peat cellulose stable oxygen and carbon isotope records and mechanistic models, Holocene, 23, 1507–1516, 10.1177/0959683613496292, 2013.

</app></app-group></back> </article>