Summary and Outline

This thesis addresses my final project for the 2019/20 edition of the Master in High-Performance Computing at SISSA and ICTP.

Earth-system and environmental models calibration is a complex, computationally intensive task. At present, there is no general theory of model calibration, but instead a large collection of methods, algorithms and case studies. As a result, calibration is often more an art than a science: one must make several discretionary choices, guided more by his own experience and intuition than by the scientific method.

One of the challenges is the large number of parameters involved. For this reason, preliminary sensitivity analysis may be used to reduce this number and select the relevant parameters. Still, the computational load of sensitivity analysis and calibration is high.

In this work I used High-Performance Computing solutions to calibrate GEOtop [RBO06,EGDallAmicoR14], a complex, over parameterized hydrological model. I used the derivative-free optimization algorithms implemented in the Facebook Nevergrad Python library [RT18], and run them on the Ulysses v2 HPC cluster, thanks to the Dask framework [Tea16].

GEOtop has been used to simulate the time evolution of variables as soil water content and evapotranspiration of mountain agricultural sites in South Tyrol with different elevations, land cover (pasture, meadow, orchard), and soil types. In these simulations GEOtop solved the energy and water budget equations on a one-dimensional domain, i.e. on a thin column of soil and neglecting the lateral fluxes. Even in the simplified case of homogeneous soil, one has tens of parameters. These parameters control the soil and vegetation properties, but only a few of them are experimentally available, hence the need for calibration.

The computational aspects of GEOtop calibration have been examined, and the important issue of robustness against model convergence failures has been addressed. Finally, the scaling of calibration time has been measured up to 1024 cores.

The outline of the thesis is the following:

  1. Introduction and motivations. Where I introduce relevant information about the GEOtop model. I also discuss the problem of GEOtop calibration, and the need for High-Performance Computing.

  2. Problem, methodology and implementation. Where I state the problem in mathematical terms, but without mathematical rigour. Afterwards, I discuss the tools and implementation details of calibration.

  3. Results and conclusions. Finally, I present the results and scaling of calibration, focusing on the HPC content.

EGDallAmicoR14

S. Endrizzi, S. Gruber, M. Dall'Amico, and R. Rigon. Geotop 2.0: simulating the combined energy and water balance at and below the land surface accounting for soil freezing, snow cover and terrain effects. Geoscientific Model Development, 7(6):2831–2857, 2014. URL: https://gmd.copernicus.org/articles/7/2831/2014/, doi:10.5194/gmd-7-2831-2014.

RT18

J. Rapin and O. Teytaud. Nevergrad — a gradient-free optimization platform. 2018. Last accessed on 2021-2-17. URL: https://GitHub.com/FacebookResearch/Nevergrad.

RBO06

Riccardo Rigon, Giacomo Bertoldi, and Thomas M. Over. Geotop: a distributed hydrological model with coupled water and energy budgets. Journal of Hydrometeorology, 7(3):371 – 388, 01 Jun. 2006. URL: https://journals.ametsoc.org/view/journals/hydr/7/3/jhm497_1.xml, doi:10.1175/JHM497.1.

Tea16

Dask Development Team. Dask: Library for dynamic task scheduling. 2016. URL: https://dask.org.