Rajabi-Kiasari et al. (2026) Forecasting sea level maxima using Machine learning with explainability and extreme value analysis
Identification
- Journal: International Journal of Applied Earth Observation and Geoinformation
- Year: 2026
- Date: 2026-01-06
- Authors: Saeed Rajabi-Kiasari, Delpeche-Ellmann Nicole, Artu Ellmann, Tarmo Soomere
- DOI: 10.1016/j.jag.2025.105064
Research Groups
- Department of Civil Engineering and Architecture, School of Engineering, Tallinn University of Technology, Estonia
- Department of Cybernetics, School of Science, Tallinn University of Technology, Estonia
Short Summary
This study develops a two-fold framework combining machine learning (ML)/deep learning (DL) for short-term daily sea level maxima (SLM) forecasting and extreme value theory (EVT) for long-term extreme sea level analysis in the Baltic Sea. The framework demonstrates that hybrid neural networks (CNN-GRU, MLP) achieve the best short-term forecasting performance (RMSE 7–15 cm), accurately capturing most peaks up to 150 cm, while EVT provides long-term risk assessment for rarer extremes.
Objective
- Characterize sea level maxima (extremes) by magnitude, frequency, and duration.
- Identify the most effective ML/DL models for short-term one-day-ahead forecasting.
- Compare models for peaks/storm event detection.
- Determine the key contributors via Explainable ML.
- Estimate return levels and periods of extreme sea levels for long-term prediction (<100 years).
Study Configuration
- Spatial Scale: Baltic Sea, focusing on six tide gauge stations: Narva-Jõesuu (Gulf of Finland), Ristna (north-eastern Baltic proper), Oulu (Bothnian Bay), Greifswald (south-western Baltic), Władysławowo (southern Baltic Sea), and Kungsholmsfort (southern Baltic proper).
- Temporal Scale: Data collected from 1971 to 2022 (52 years). Short-term forecasting is one-day-ahead. Long-term extreme value analysis estimates return periods up to 100 years.
Methodology and Data
- Models used:
- Machine Learning (ML): Random Forest (RF), Extreme Gradient Boosting (XGB), Multilayer Perceptron (MLP).
- Deep Learning (DL): CNN-LSTM, CNN-GRU (hybrid models).
- Extreme Value Theory (EVT): Block maxima method with Generalized Extreme Value (GEV) distribution.
- Optimization: Bayesian optimization (BO) for hyperparameter tuning.
- Explainability: SHapley Additive exPlanations (SHAP).
- Data sources:
- Hourly relative sea level data from six tide gauge stations (1971–2022), converted to Baltic Sea Chart Datum (BSCD2000). Sources: envir.ee (Estonia), ilmatieteenlaitos.fi (Finland), smhi.se (Sweden), wsv.bund.de (Germany), imgw.pl (Poland).
- Meteorological variables (wind components, surface pressure, evaporation, precipitation, river runoff) from ERA5 re-analysis dataset.
- Baltic Sea Index (BSI) from ERA5.
- Significant Wave Height (SWH) data from SWAN wave model (1971–2005) and WAM wave model (2006–2022).
Main Results
- The CNN-GRU model consistently outperformed other approaches, achieving test Root Mean Square Error (RMSE) values ranging from 7 cm (Kungsholmsfort) to 14.9 cm (Oulu), and R-squared (R2) values from 0.61 (Greifswald) to 0.86 (Ristna).
- The Multilayer Perceptron (MLP) model also performed strongly, particularly at Oulu (RMSE 14.5 cm, R2 0.78) and Władysławowo (RMSE 8 cm, R2 0.85).
- Tree-based models (Random Forest and Extreme Gradient Boosting) frequently exhibited overfitting.
- All models accurately captured most sea level maxima (SLM) events up to 150 cm, but consistently underestimated rarer extreme peaks exceeding 150 cm.
- Explainability analysis (SHAP) of the CNN-GRU model revealed that the previous day’s SLM (prefilling) was the most influential feature across all stations (~40% importance).
- Regional variations in meteorological drivers were observed: eastern stations (Narva, Ristna) were primarily influenced by local pressure and wind, while western and northern stations (Oulu, Kungsholmsfort, Greifswald) were more affected by large-scale atmospheric forcing represented by the Baltic Sea Index (BSI), pressure, and wind.
- Extreme Value Theory (EVT) analysis indicated that winter extremes of approximately 150 cm correspond to return periods of about 5 years at Narva and 7 years at Oulu.
Contributions
- First study to integrate Bayesian Optimization with hybrid CNN-LSTM and CNN-GRU models for sea level forecasting.
- Developed a transparent and explainable framework for daily SLM forecasting using SHAP analysis.
- Demonstrated an improved effective forecasting range for SLM from approximately 100 cm to 150 cm compared to previous regional studies.
- Provided a robust and interpretable framework for extreme sea level prediction by integrating ML/DL-based short-term forecasting with EVT for long-term risk assessment.
- Conducted comprehensive exploratory data analysis, feature selection, model optimization, and evaluation using multiple statistical metrics, including storm event detection.
Funding
- Estonian Research Council grants PRG1129 and PRG1785 DYNAREF.
Citation
@article{RajabiKiasari2026Forecasting,
author = {Rajabi-Kiasari, Saeed and Nicole, Delpeche-Ellmann and Ellmann, Artu and Soomere, Tarmo},
title = {Forecasting sea level maxima using Machine learning with explainability and extreme value analysis},
journal = {International Journal of Applied Earth Observation and Geoinformation},
year = {2026},
doi = {10.1016/j.jag.2025.105064},
url = {https://doi.org/10.1016/j.jag.2025.105064}
}
Original Source: https://doi.org/10.1016/j.jag.2025.105064