Sea Surface Temperature Predictability at the interface between oceanographic modelling and machine learning

doi:10.21203/rs.3.rs-1721732/v1

Download PDF

Article

Sea Surface Temperature Predictability at the interface between oceanographic modelling and machine learning

https://doi.org/10.21203/rs.3.rs-1721732/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Ensemble models, statistical analysis and machine learning (ML) can be used to predict novel conditions in a rapidly changing ocean. Traditionally, ML has been understood as a purely data-driven approach. Recently, success has been reported in training ML on both observational and model data to forecast Sea Surface Temperature (SST) anomalies. Here we use ML trained only on climate model simulations to predict regional SST variations, thereby suggesting a novel role for ML as an ensemble model interpolator. We propose a measure of the predictability provided by different ML implementations as well as by standard time series analysis methods. Weighting each forecast by this predictability measure computed on model data only, provides a significant improvement in forecast skill. We demonstrate the performance of this approach for regions around Australia, the Nino3.4 region (central-eastern equatorial Pacific) and in the eastern equatorial Pacific and discuss the implications for SST predictability as a function of geographical location, area size, seasonality, proximity to the coast and model data quality.

Machine Learning

climate model simulations

Numerical Modelling

Sea Surface Temperature

Prediction

Oceans are changing rapidly under the effect of climate change. Both biophysical and socio-ecological impacts are already felt ¹ and are expected to increase considerably in the coming decades ² as a result of the increasing frequency of extremes in sea surface temperature ^3,4 and sea level rise ^5,6, which in coming decades are likely to compound the effects of deoxygenation ⁷ and acidification ^8,9 leading to an increased risk of ‘perfect storm’ or compound event conditions ¹⁰. The ability to forecast future oceanographic conditions is thus crucial to address both long-term trends and short-lasting extreme events, for the protection of marine and coastal environments and the ecosystems and people depending on them ¹¹. This work contributes to this effort with a specific focus on forecasting sea surface temperature (SST), which in many regions has both a long-term trend as well as increased variability in extreme events known as marine heatwaves ³. The time period of interest is the seasonal time scale, as this is most useful to many marine decision makers ^12,13.

There is considerable interest in SST forecasts, as temperature is readily observed from space and has been shown to influence the distribution and abundance of many marine species ^11,12. These forecasts have been attempted by several methods, including mechanistic modelling, statistical modelling and by expert judgement, as well as coupled dynamic models. In the last few years machine learning (ML) techniques have been added to the forecasting tool kit ^14–16. In a significant contribution to the literature ¹⁷, a convolutional neural network (CNN) trained on an ensemble of CMIP numerical models (see Methods) and Simple Ocean Data Assimilation (SODA) ¹⁸ ocean reanalysis data, was shown to outperform individual numerical models in forecasting the ENSO3.4 index at lead times of up to 18 months. This shows that the output of numerical models can be effectively used, in addition to observations, to train a ML model. Following ¹⁹, we refer to this ML approach as H19.

One weakness of the H19 method is that a different ML model is needed for each combination of target forecast date and lead time and each ML model is trained individually. As a result, when the ML models are used in a forecasting mode, forecasts (or hindcasts) of consecutive months are carried out by independent models and can thus display undesirable high-frequency fluctuations. To address this limitation, in ¹⁹ the authors train a single ML model on longer time series. Again following ¹⁹, we refer to this later ML approach as H21.

Here we propose an alternative approach (summarised in Fig. 1) in which hindcasts produced by ML models with different lead times as well as information about persistence and seasonal periodicity are combined. We use the same ML architecture as in H19. However, unlike H19, we train our CNN with CMIP models only (see below for which sets of CMIP models have been used in different experiments) and do not use SODA reanalysis data for training. As in H19, this training provides a set of ML models, one for each target forecast date and lead time. In addition, from the output of the CMIP models, we extract information about persistence (system inertia) and seasonal periodicity (see step 1 ‘Skill Assessment’ in Fig. 1). Next, for each target forecast date, we assess the prediction skills of i) the ML models with different leads, ii) the persistence for different leads and iii) the contribution of periodicity (see step 2 ‘Skill Assessment’ in Fig. 1). These values are then combined (see step 3 ‘Skill-based Weights’) to generate weights proportional to the skill of each method. These weights reflect how well the different forecast methods performed on the trained data set. We then use the same architecture in forecasting mode. First, we feed the data from the GODAS reanalysis ²⁰ to the trained ML models and generate a set of SST forecasts for different target dates and leads. We also compute the persistence for different leads and the contribution for periodicity (‘Forecasts’ in Fig. 1). Next, we use the skill-based weights previously computed on the CMIP models at step 3 to combine the forecasts and generate the final forecast for each target forecast date (‘Skill- Weighed Forecast’ in Fig. 1). Hereafter, we refer to this approach as Skill Weighed Forecast (SWF).

The skill assessment and generation of the skill-based weights are described in detail in the Methods (Section 4.3) and summarised in Fig. 2. For a given target date, and for different lead times, the green and the red lines show the mean squared error (mse) of the ML forecasts (E^ML) and of the persistence (E^p), respectively. The dashed line shows the mse of the ‘null’ forecast of zero anomaly at each time step (E^N, see Section 4.3). We define the skill of a ML forecast as E^N-E^ML, shown as S^ML in Fig. 2. Similarly, S^p shows the skill of the persistence forecast. To generate a final forecast for the target date, the ML and persistence forecasts are weighted proportionally to their respective skills (see Section 4.3, Eq. 2). Finally, the difference S^p-S^ML gives the predictability gain of forecasting via ML comparted to persistence.

We define the cumulative prediction skill CPS as the sum of a method’s predictive skill over all lead times. The cumulative prediction skill gives an indication of the method’s overall prediction ability. For a completely unpredictable signal, we could do no better that using the null forecast at each lead time. The cumulative predictability would then = 0. For a deterministic, perfectly predictable signal, we would have mse = 0 for each lead time (see Methods). The red and green areas in Fig. 3a and b, show the cumulative prediction skill of persistence and ML, respectively. The difference between these two areas represents the cumulative gain of predicting via ML vs persistence (Fig. 3c).

2.1. ENSO forecast

To ground our results within the current literature, we first compare the SWF method against H21 in the long-range forecast of the Nino3.4 index over the El Niño/Southern Oscillation (ENSO) region (Fig. 4).

An exact comparison between H21 and SWF is not available since these were run under slightly different conditions: i) H21 was trained on both CMIP5 and SODA data, while SWF on CMIP5 models only and ii) H21 forecast 3-months means, while SWF forecast monthly means. The latter is a likely reason for H21 slightly better performances. Nevertheless, the performances of H21 and SWF are very close for up to 11-month lead, with H21 performing better for longer leads.

We draw some preliminary conclusions from this comparison. First, our results fare well against what, to our knowledge, is the state of the art in the forecast on the Nino3.4 index. Second, because we have not used any reanalysis data for training, our results have been obtained by training with model data only. Third, our SWF method is fairly robust in the sense that the mean over 40 runs (blue line Fig. 4) does not differ substantially (data not shown) for each individual run (dashed blue line, as an example of a single run), allowing for a significant computation saving if needed.

As much of the fishing and aquaculture effort takes place within EEZs ²¹, SST forecasts for management applications are needed in smaller regions closer to land ²². Next, we assess to what extent the forecast performance found for the Nino 3.4 area (Fig. 4) might be achievable in other ocean regions. Figure 5 shows the normalised cumulative prediction skill NCPS (see Methods, Section 4.3, Eq. 3) over different regions in the equatorial Pacific Ocean. Figure 5a includes the Nino3.4 region and 3 more regions of decreasing size. Their normalised cumulative prediction skills are very close, suggesting that the oceanography of the Nino3.4 region is fairly homogenous and similar predictive skills can be achieved when focussing on smaller regions. However, if we shift the smallest region closer to land, by moving it eastward towards the South American continent, the normalised cumulative prediction skill decreases significantly (Fig. 5b). Finally, in Fig. 5c, we show four regions of the same size as in Fig. 5a, but close to the coast (see Methods for regions coordinates). Larger regions display higher normalised cumulative prediction skills. However, the largest region has lower cumulative prediction skill than the Nino3.4 region, despite being of the same size, likely due to the proximity to the coast (but see below for possible effects due to data quality). In summary, with the exception of the fairly homogenous Nino3.4 region, it appears that we should expect the normalised cumulative prediction skill to reduce with the area of the region considered and as locations get closer to the coast where land sea interactions may be intensified and become less predictable at seasonal time scales. Plots of the normalised cumulative prediction skill of the persistence and of the predictability gain over the same regions is provided in the Supplementary Material.

2.2. Australian regions

Next, we focus on 12 coastal regions around the Australian continent (Fig. 6) where forecasts are useful for the management of natural resources and livelihoods ^23–27. We focus on management relevant forecasts of up to 12-month lead, which encompasses many critical decision-making time scales ¹¹.

Skill as a function of lead time for all 12 regions is presented in the Supplementary Material. As an example, consider region 3, coastal New South Wales (Fig. 7). The all-season mean squared error as a function of lead time is shown on the right, while the correlation is shown on the left. In both panels, the black line shows the persistence skill and the blue line shows the SWF skill. The green line shows the skill of a ML forecast carried out with a single lead, without accounting for persistence and monthly means, which is equivalent to the H19 method in ¹⁷. For very short lead times, persistence performs better than the single lead forecast, while the opposite is true for leads > 2. The SWF forecast clearly outperforms both persistence and single lead forecast for all leads.

Figure 8a shows the GODAS re-analysis data (black line) and the SWF hindcast (blue line) for a 1-month lead time for the coastal New South Wales region. SWF hindcasts the data fairly accurately, including a number of SST extremes. Figure 8b shows the 3-month lead hindcast. While the overall trend is still hindcast reasonably well, short scale variability, including most SST extremes are hindcast less well. Figure 7b suggests that this drop in performance is due to the rapidly decreasing skill of the persistence component in our approach. This is confirmed by the taupe coloured line in the 1-month lead time hindcast in Fig. 8a, which is obtained by combining multi-lead ML and monthly mean components only. Clearly, the inclusion of the persistence component is crucial in the high skill of SWF for short leads, since it provides information about the current state of the system.

Figure 6 shows the normalised cumulative predictability of the Australian Regions, color-coded as used in Fig. 5, to enable comparison. Plots of the cumulative prediction skill of the persistence and of the predictability gain over these regions are also provided in the Supplementary Material. As expected, the normalised cumulative predictability over each of these regions is close to the smallest coastal region in Fig. 5, which emphasises the constraints on predictability imposed by the proximity to large land masses. Nevertheless, there a significant difference between the largest and smallest normalised cumulative predictability in Fig. 6 (0.22 Region 8 Central WA vs 0.06 for Region 9 North WA; these two regions being almost contiguous). While from an oceanographic perspective variations in SST anomaly predictability reflect likely inherent variability in regional and local dynamical process, from the perspective of the computation involved in our approach, variations in SWF skills must reflect aspects related to the CMIP model outputs, GODAS data, ML implementation or SWF computation. To explore this, we computed the correlation between the normalised cumulative predictability values in Fig. 6 and measures of differences between CMIP models and GODAS times series in each region, expressed as Kolmogorov–Smirnov (KS) distance (see Methods, Section 4.4). We obtained a strong, statistically significant correlation (~ 0.6, p < 0.05). Counterintuitively, this correlation is positive: the more training and observation time series in a region differ, the higher the forecast skill. In addition, the range of the CMIP models output values is larger than the one for the GODAS data in each region, but neither these nor their differences correlate with variations in SWF skills (results not shown). Notice that in the ML training, the training labels (from the CMIP models) and the evaluation labels (from the GODAS times series) vary in each region, but the input SST anomaly and HC maps are the same for each region (see Methods, Section 4.1), and thus cannot be the driver of the positive correlation.

If we think of ML as discovering predictive patterns in the training data, we would probably expect an opposite result (negative correlation): the closer the training data to the evaluation data, the better the forecast skill. If we think of ML as interpolation and function reconstruction ^28–30, then our result appears less counter-intuitive: the ‘richer’ the training data set, the more information is provided to the ML for function reconstruction and interpolation, the better the forecast skill. While we acknowledged that this interpretation is tentative, its implications are significant (as we discuss below) and suggest that a closer examination of the working of the ML as a function of the input SST and HC maps and of the role of the models used for training is warranted.

Our method and results provide new insight into ML-based approaches to forecasting ocean temperatures. First, our results are obtained by applying ML techniques to model data only, without using any observational data in the training. This highlights a new role for ML approaches, as a tool to process ensemble modelling, thereby rather than computing some weighted average of the models’ output, the output of the model ensemble is computed via ML by learning the relation between model input and output from instances provided by several different mechanistic models or model implementations, where this relation may vary depending on different features of interest in the data or process under analysis. This highlights a role for ML as model ensemble emulator or interpolator. When new models become available, these can be used to refine the training via transfer learning, or by including them in the model ensemble for re-training. Training on model data, rather than on observational data, may be perceived as second-best option, suitable only to problems for which large amount of data, or long times series are not available, as in the case for the applications we discussed above. However, this is not necessarily the case. As an example, climate change is expected to impose non-stationary changes in many natural processes such that past records become poor representatives of the future. In these situations, model simulations are the only way to explore novel patterns which may become observable in the future. As another example, rare events are by definition under-represented in observational data, but they may be highly significant in specific applications, as it is the case for extreme heatwaves. In this case, various model-based or statistical approaches to augment the training data may help improve their predictability. In both cases, ML approaches as described in this work can be used to combine information for disparate models into a coherent training and simulation platform.

Second, we propose a simple method to combine forecasts coming from different machines and other different methods into a single forecast, by generating weights of intuitive interpretation, based on the predictive gain each method provides over a user-defined null forecast. Our results on the Nino3.4 index suggest that the approach provides performances comparable to the state of the art. This approach opens the door to the inclusion of more sophisticated time series analysis methods in the forecast.

Third, these predictive gains offer an intuitive method to compare the predictability of sea surface temperature in different regions as a function of the regions’ size, geographical location.

Finally, by studying the differences between re-analysis (GODAS) and model (CMIP56) data and correlating these differences to our approach’s hindcast skills in different geographical locations, we suggest that future improvements in forecasting skill are likely to come from advancement in mechanistic modelling rather than ML method and training per se: rather than focussing on improving the ML ability to detect predictive patterns, our results suggest that advances in prediction skills are more likely to come from providing a more exhaustive sampling of the current and future possible oceanographic features for the ML to interpolate and then use this interpolation to emulate future, possibly yet unseen, patterns.

4.1. Data

For the analysis of the Australian coastal regions, the CNN model is trained using 1861–2001 historical runs from a selection of coupled climate models from Coupled Model Intercomparison Project Phases 5 and 6 (CMIP 5 and 6) ^31,32, as shown in Table 1. For the analysis of the tropical Pacific regions, the CNN model is trained using CMIP5 models only, to allow comparison with ¹⁷.

Table 1

CMIP models used to train the CNN
CMIP5 models	CMIP6 models
CCSM4 CanESM2 CESM1-BGC FGOALS-s2 GFDL-CM3 GFDL-ESM2G GFDL-ESM2M MRI-CGCM3 NorESM1-M	ACCESS-CM2 CanESM5 CESM2 CMCC-CM2-SR5 FGOALS-g3 GFDL-CM4 GFDL-ESM4 MRI-ESM2-0 NorESM2-MM

Training performance was assessed against data from the National Center for Environmental Prediction (NCEP) Global Ocean Data Analysis System (GODAS) ²⁰ between 1982–2017 for the equatorial Pacific Ocean and 1982–2021 for the Australian regions. GODAS product is based on a quasi-global configuration of the Geophysical Fluid Dynamics Laboratory Modular Ocean Model version 3, assimilating from XBTs, TAO, TRITON, PIRATA moorings, and Argo profiling floats.

Sea surface temperature (SST) and upper ocean (0-300m) heat content (HC) maps were used as input for the CNN training as in ¹⁷. For runs over the equatorial Pacific Ocean (Fig. 4 and Fig. 5), the input maps covered the area 0E-360E, 60N-55S, at a 5^o resolution on both longitude and latitude, resulting in a 72*24 pixel grid. The location of the regions in Fig. 5 is included in Table 2.

Table 2

Geographical location of the equatorial Pacific Ocean regions around the Australian continent as in Fig. 5.
Region ident	Longitude (degrees)	Latitude (degrees)
Figure 5a (decreasing size)
1	190:240	-5:5
2	196:234	-4:4
3	203:227	-2:2
4	209:221	-1:1
Figure 5b (shifting eastward)
1	209:221	-1:1
2	229:241	-1:1
3	249:261	-1:1
4	269:281	-1:1
Figure 5c (increasing size)
1	269:281	-1:1
2	257:281	-2:2
3	243:281	-4:4
4	231:281	-5:5

For runs around the Australian continent (Fig. 6 and Fig. 7) input maps covered the area 32E-300E, 18N-36S at a 2^o resolution, resulting in a 135*33 pixel grid. The location of the regions around the Australian continent is included in Table 3.

Table 3

Coastal regions around the Australian continent as in Fig. 6.
Region ident	Region Name	Longitude (degrees)	Latitude (degrees)
1	Great Barrier Reef (GBR)-North	142:150	-17:-9
2	GBR-South	145:153	-25:-17
3	New South Wales	150:155	-35:-25
4	Tasmania & South-East Coast	145:155	-45:-35
5	Victoria-South Australia	135:145	-40:-32
6	Great Australian Bight	125:135	-37:-32
7	Western Australia (WA)-South	116:125	-37:-32
8	WA-Centre	110:116	-32:-22
9	WA-North	115:125	-20:-15
10	Northern Territory	125:135	-15:-10
11	Indian Ocean Territory	115:125	-15:-10
12	Coral Sea	150:160:	-20:-10

4.2. ML implementation and architecture

Our ML implementation follows closely ¹⁷. We employ the same CNN architecture consisting of

one input layer comprised of 6 staked images of SST and HC anomalies over 3 consecutive months (see Section 4.1).
A first convolutional layer of 35 filters of 8*4 pixel kernel size with hyperbolic tangent activation function (tanh activation).
A first max-pooling layer (2*2 pixel kernel size).
A second convolutional layer (4*2 pixel kernel size and tanh activation).
A second max-pooling layer (2*2 pixel kernel size).
A third convolutional layer (4*2 pixel kernel size and tanh activation).
A fully connected layer of 50 neuron
One output layer providing the predictand.

The main differences between our overall implementation and ¹⁷ are:

We use monthly means instead of 3-month means for both training and hindcasting.
We train for 700 epochs.
We do not use transfer training, since in our tests we either do not see improvement or we see worsening training fit. As a result, training is carried out on CMIP model data only.
We use a single training, while ¹⁹ average the hindcasts generated by 40 training runs and ¹⁷ average the hindcasts generated by 4 training runs with different filter sizes.

4.3. Computation of predictability, predictability gain and weighted forecasts

Call Y_t the mean SST anomaly over a region of interest, observed at time t, where \({{Y}_{p=}Y}_{t=1:{T}_{1}}\) are past observations and \({Y}_{f=}{Y}_{t={T}_{1}+1:{T}_{2}}\) are the future values to forecast. Because we work with de-trended anomaly data, in the absence of information about t we assume \(E\left[{Y}_{f}\right]=\widehat{{Y}_{p}}=0\), where E[X] and \(\widehat{X}\) are the expected value and the mean of X, respectively. Call \({F}^{m}={F}_{t={T}_{1}+1:{T}_{2}}^{m}\) the forecast of STT anomaly carried out with method m and \({E}^{m}=d\left({Y}_{f},{F}^{m}\right)\) the forecast error, where d is a measure of distance between Y_f and F. Below we use the mean square error (mse) as measure of distance.

We define \({E}^{N}=d\left({Y}_{p},E\left[{Y}_{f}\right]\right)\) the forecast error when using \(E\left[{Y}_{f}\right]\) as forecast. We consider \(E\left[{Y}_{f}\right]\) as a ‘null’ forecast: in the absence of any information besides past SST anomaly observations, our best bet of Y_f at any \(t={T}_{1}+1:{T}_{2}\) is E[Y_f].

Next, we define the forecast skill as:

\({S}_{l}^{m}=\left\{\begin{array}{c}\left({E}^{N}-{E}_{l}^{m}\right), for {E}^{N}>{E}_{l}^{m}\\ 0, for {E}^{N}\le {E}_{l}^{m}\end{array}\right.\)

Eq 1

\({S}_{l}^{m}\) is a measure of the forecast improvement of method m over E^N at lead l (see S^p and S^ML in Fig. 2 for a geometrical description of this measure). Notice that E^N is constant for all l. Notice also that because \({S}_{l}^{m}=0\) for leads l > l’, where l’ is the lead time at which method m no longer provides improvement over the null forecast E^N, there is no requirement to define a maximum lead over which this computation needs to be performed, as long as we include leads up to l’.

Given M different forecasting methods and their corresponding forecasts F^m, we define the Skilled Weighted Forecast as:

\(\text{S}\text{W}\text{F}=\sum {w}^{m}{F}^{m}\) Eq 2

where the weights are given by \({w}^{m}=\frac{{S}^{m}}{\sum {S}^{m}}\). We define the predictability gain \({G}^{m,m1}=\left({S}^{m}-{S}^{m1}\right)=\left({E}^{m1}-{E}^{m}\right)\). G^m,m1 is a measure of the forecast improvement of method m over method m1 (see Gain=G^ML,p in Fig. 2). We then define the cumulative prediction skill as \({CPS}^{m}={\sum }_{l}{S}_{l}^{m}\), where \({S}_{l}^{m}\) is the skill of method m for forecast with lead time=l and the cumulative prediction gain between methods m and m1is \({CPS}_{m}-{CPS}_{m1}\) (see shaded areas in Fig. 3). Finally, the normalised cumulative prediction skill is given by Eq 3

where L = maximum lead time considered and the denominator provides the normalisation by representing the cumulative loss incurred in using E^N as forecast for all leads. As a result, NCPS^m=0 for a completely unpredictable signal, and NCPS^m=1 for a deterministic, perfectly predictable signal.

4.4. Correlation between NCPS and differences between CMIP models and GODAS times series

For each Australian coastal region, we computed the difference between CMIP models output and GODAS data via the Kolmogorov–Smirnov (KS) test. Next, we computed the correlation between the vector of KS statistics (one value per region) and the vector of normalised cumulative prediction skills. We obtain a correlation = 0.59 with p = 0.043. We also compute the p value of this correlation via permutation test ^33,34 resulting in p = 0.024.

To further corroborate the results, we carried out the same analysis by replacing the KS test with the Wasserstein (‘earth mover's distance’) statistics ³⁵, resulting in correlation = 0.68 and p < 0.01 (computed via permutation test).

Data Availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Author Contributions Statement

FB developed the methodology and carried out computation, analysis and visualization. MF, JH, and XZ carried out the data analysis and curation. FB, MF, JH, AH and XZ contributed to conceptualization and writing. All authors reviewed the manuscript.

Allison, E. H. & Bassett, H. R. Climate change in the oceans: human impacts and responses. Science 350, 778–782 (2015).
Holbrook, N. J. et al. Keeping pace with marine heatwaves. Nature Reviews Earth & Environment 1, 482–493, doi:10.1038/s43017-020-0068-4 (2020).
Oliver, E. C. et al. Longer and more frequent marine heatwaves over the past century. Nature communications 9, 1–12 (2018).
Smale, D. A. et al. Marine heatwaves threaten global biodiversity and the provision of ecosystem services. Nature Climate Change 9, 306–312 (2019).
Nicholls, R. J. et al. Sea-level scenarios for evaluating coastal impacts. Wiley Interdisciplinary Reviews: Climate Change 5, 129–150 (2014).
Siegert, M., Alley, R. B., Rignot, E., Englander, J. & Corell, R. Twenty-first century sea-level rise could exceed IPCC projections for strong-warming futures. One Earth 3, 691–703 (2020).
Limburg, K. E., Breitburg, D., Swaney, D. P. & Jacinto, G. Ocean deoxygenation: A primer. One Earth 2, 24–29 (2020).
Kroeker, K. J., Micheli, F., Gambi, M. C. & Martz, T. R. Divergent ecosystem responses within a benthic marine community to ocean acidification. Proc Natl Acad Sci U S A 108, 14515–14520, doi:10.1073/pnas.1107789108 (2011).
Anthony, K. R. N. Coral Reefs Under Climate Change and Ocean Acidification: Challenges and Opportunities for Management and Policy. Annu Rev Env Resour 41, 59–81, doi:10.1146/annurev-environ-110615-085610 (2016).
Gruber, N., Boyd, P. W., Frölicher, T. L. & Vogt, M. Biogeochemical extremes and compound events in the ocean. Nature 600, 395–407 (2021).
Hobday, A. J. et al. A framework for combining seasonal forecasts and climate projections to aid risk management for fisheries and aquaculture. Frontiers in Marine Science, 137 (2018).
Tommasi, D. et al. Managing living marine resources in a dynamic environment: the role of seasonal to decadal climate forecasts. Progress in Oceanography 152, 15–49 (2017).
Hobday, A. J., Spillman, C. M., Paige Eveson, J. & Hartog, J. R. Seasonal forecasting for decision support in marine fisheries and aquaculture. Fisheries Oceanography 25, 45–56 (2016).
Huang, A., Vega-Westhoff, B. & Sriver, R. L. Analyzing El Niño–Southern Oscillation Predictability Using Long-Short-Term-Memory Models. Earth and Space Science 6, 212–221 (2019).
De Castro Santos, M. A., Vega-Oliveros, D. A., Zhao, L. & Berton, L. Classifying El Niño-Southern Oscillation Combining Network Science and Machine Learning. IEEE Access 8, 55711–55723, doi:10.1109/ACCESS.2020.2982035 (2020).
Broni-Bedaiko, C. et al. El Niño-Southern Oscillation forecasting using complex networks analysis of LSTM neural networks. Artificial Life and Robotics 24, 445–451, doi:10.1007/s10015-019-00540-2 (2019).
Ham, Y.-G., Kim, J.-H. & Luo, J.-J. Deep learning for multi-year ENSO forecasts. Nature 573, 568–572, doi:10.1038/s41586-019-1559-7 (2019).
Carton, J. & Giese, B. in CLIVER Workshop on Ocean Reanalysis. (Citeseer).
Ham, Y.-G., Kim, J.-H., Kim, E.-S. & On, K.-W. Unified deep learning model for El Niño/Southern Oscillation forecasts by incorporating seasonality in climate data. Science Bulletin 66, 1358–1366, doi:10.1016/j.scib.2021.03.009 (2021).
Behringer, D. & Xue, Y. in Proc. eighth symp. on integrated observing and assimilation systems for atmosphere, oceans, and land surface. (Seattle).
Schiller, L., Bailey, M., Jacquet, J. & Sala, E. High seas fisheries play a negligible role in addressing global food security. Science Advances 4, eaat8351 (2018).
Smith, K. E. et al. Socioeconomic impacts of marine heatwaves: Global issues and opportunities. Science 374, doi:10.1126/science.abj3593 (2021).
Brodie, S. et al. Seasonal forecasting of dolphinfish distribution in eastern Australia to aid recreational fishers and managers. Deep Sea Research Part II: Topical Studies in Oceanography 140, 222–229 (2017).
Eveson, J. P., Hobday, A. J., Hartog, J. R., Spillman, C. M. & Rough, K. M. Seasonal forecasting of tuna habitat in the Great Australian Bight. Fish Res 170, 39–49 (2015).
Hobday, A. J., Hartog, J. R., Spillman, C. M. & Alves, O. Seasonal forecasting of tuna habitat for dynamic spatial management. Can J Fish Aquat Sci 68, 898–911 (2011).
Spillman, C. & Hobday, A. Dynamical seasonal forecasts aid salmon farm management in an ocean warming hotspot. Clim. Risk Manage 1, 25–38 (2014).
Vanhatalo, J., Hobday, A. J., Little, L. R. & Spillman, C. M. Downscaling and extrapolating dynamic seasonal marine forecasts for coastal ocean users. Ocean Modelling 100, 20–30 (2016).
Sejnowski, T. J. The unreasonable effectiveness of deep learning in artificial intelligence. Proceedings of the National Academy of Sciences 117, 30033–30038, doi:10.1073/pnas.1907373117 (2020).
Belkin, M., Hsu, D., Ma, S. & Mandal, S. Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proceedings of the National Academy of Sciences 116, 15849–15854, doi:10.1073/pnas.1903070116 (2019).
Bach, F. Breaking the Curse of Dimensionality with Convex Neural Networks. arXiv:1412.8690 [cs, math, stat] (2016).
Taylor, K. E., Stouffer, R. J. & Meehl, G. A. An overview of CMIP5 and the experiment design. Bulletin of the American meteorological Society 93, 485–498 (2012).
Eyring, V. et al. Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization. Geoscientific Model Development 9, 1937–1958 (2016).
Bind, M.-A. & Rubin, D. When possible, report a Fisher-exact P value and display its underlying null randomization distribution. Proceedings of the National Academy of Sciences 117, 19151–19158 (2020).
Moore, J. H. Bootstrapping, permutation testing and the method of surrogate data. Physics in Medicine and Biology 44, L11-L12, doi:10.1088/0031-9155/44/6/101 (1999).
Angelis, M. D. & Gray, A. Why the 1-Wasserstein distance is the area between the two marginal CDFs. 6.

No competing interests reported.

MLSciReportsSupplMat.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

Sea Surface Temperature Predictability at the interface between oceanographic modelling and machine learning

Status:

Version 1

Abstract

Figures

1. Introduction

2. Results

2.1. ENSO forecast

2.2. Australian regions

3. Discussion

4. Method

4.1. Data

4.2. ML implementation and architecture

4.3. Computation of predictability, predictability gain and weighted forecasts

4.4. Correlation between NCPS and differences between CMIP models and GODAS times series

Declarations

Data Availability

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

Author Contributions Statement

FB developed the methodology and carried out computation, analysis and visualization. MF, JH, and XZ carried out the data analysis and curation. FB, MF, JH, AH and XZ contributed to conceptualization and writing. All authors reviewed the manuscript.

References

Additional Declarations

Supplementary Files

Status:

Version 1