Articles | Volume 20, issue 10
https://doi.org/10.5194/cp-20-2373-2024
https://doi.org/10.5194/cp-20-2373-2024
Research article
 | 
24 Oct 2024
Research article |  | 24 Oct 2024

Can machine-learning algorithms improve upon classical palaeoenvironmental reconstruction models?

Peng Sun, Philip B. Holden, and H. John B. Birks

Model code and software

Rioja: analysis of Quaternary science data S. Juggins https://github.com/nsj3/rioja

palaeoSig: Significance Tests for Palaeoenvironmental Reconstructions R. J. Telford and M. Trachsel https://github.com/richardjtelford/palaeoSig

Schimasuperbra/MEMLM: Multi Ensemble Machine Learning Model (v1.0.0) P. Sun https://doi.org/10.5281/zenodo.13138593

Download
Short summary
We develop the Multi Ensemble Machine Learning Model (MEMLM) for reconstructing palaeoenvironments from microfossil assemblages. The machine-learning approaches, which include random tree and natural language processing techniques, substantially outperform classical approaches under cross-validation, but they can fail when applied to reconstruct past environments. Statistical significance testing is found sufficient to identify these unreliable reconstructions.