<<

https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

A Bayesian network approach to modelling rip and shore-break wave injuries Elias de Korte1,2, Bruno Castelle3,4, and Eric Tellier5,6,7 1Utrecht University, Faculty of Geosciences, Utrecht, The Netherlands 2TU Delft, Delft, The Netherlands 3CNRS, UMR EPOC, Univ. , , 4Univ. Bordeaux, UMR EPOC, CNRS, Pessac, France 5INSERM, ISPED, Centre INSERM U1219 Bordeaux Population Health Research, Univ. Bordeaux, Bordeaux, France 6ISPED, Centre INSERM U1219 Bordeaux Population Health Research, Univ. Bordeaux, Bordeaux, France 7Pôle Urgences Adultes, CHU de Bordeaux, SAMU-SMUR, Bordeaux, France Correspondence: Bruno Castelle ([email protected])

Abstract. A Bayesian network (BN) approach is used to model and predict shore-break related injuries and rip-current drown- ing incidents based on detailed environmental conditions (wave, , weather, beach morphology) on the high-energy coast, southwest France. Six years (2011-2017) of boreal summer (June 15 - September 15) surf zone injuries (SZIs) were analysed, comprising 442 (fatal and non-fatal) drownings caused by rip currents and 715 injuries caused by shore-break waves. 5 Environmental conditions at the time of the SZIs were used to train two separate Bayesian networks (BNs), one for rip cur- rent drownings and the other one for shore-break wave injuries, each one with a hidden and exposure variables. Both BNs were tested for varying complexity using K-fold cross-validation based on multiple performance metrics. Validation (prediction) results slightly improve predictions of SZIs with a poor to fair skill based on a combination of different metrics. Shore-break related injuries appear more predictable than incidents as the shore-break BN systematically 10 performed better than the rip current BN. Sensitivity and scenario analyses were performed to address the influence of envi- ronmental data variables and their interactions on exposure, hazard and resulting life risk. Most of our findings are in line with earlier SZI and physical hazard-based work, that is, that more SZIs are observed for warm sunny days with light winds, long- period waves, with specifically more shore-break related injuries at high tide and for steep beach profiles, and more rip current drownings near low tide with near shore-normal wave incidence and strongly alongshore non-uniform surf zone morphology. 15 The BNs also provided fresh insight, showing that rip current drowning risk is approximately equally distributed between exposure (variance reduction V r = 14.4%) and hazard (V r = 17.4%), while exposure of water user to shore-break waves is much more important (V r = 23.5%) than the hazard (V r = 10.9%). Large surf is found to decrease beachgoer exposure to shore-break hazard, while this is not observed for rip currents. Rapid change in tide elevation during days with large tidal range was also found to result in more drowning incidents, presumably because it favors the rapid onset of rip current activity and 20 can therefore surprise unsuspecting bathers. We advocate that such BNs, providing a better understanding of hazard, exposure and life risk, can be developed to improve public safety awareness campaigns, in parallel with the development of more skillful risk predictors to anticipate high life-risk days.

1 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

1 Introduction

Wave-dominated beaches offer a playground for a variety of activities, but at the same time they pose a threat to water users. 25 There are two primary causes of surf zone injuries (SZIs), which can sometimes co-exit at the same beach (Castelle et al., 2018): i) rip currents resulting in drowning incidents and ii) shore-break waves which can result in e.g. spine and shoulder dislocations. Rip currents are intense seaward-flowing narrow currents which can form through different driving mechanisms related to breaking waves (Dalrymple et al., 2011; Castelle et al., 2018). They form close to the shoreline and often extend beyond the surf zone. Therefore they can transport unsuspecting bathers offshore, who potentially and drown (Drozdzewski et al., 30 2012; Brighton et al., 2013). The shore-break wave hazard has received little attention in the literature compared to rip current hazard. However, shore-break waves can cause a large amount of injuries (Puleo et al., 2016), including severe spine injuries (Robbles, 2006). At certain beaches, shore-break waves can even be the primary cause of SZIs, e.g. up to 88% at Ocean City, Maryland (Muller, 2018). Rip flow speed, which is a proxy of rip current hazard, has been addressed on many beaches through both field measurements 35 and numerical modelling (see Castelle et al., 2016, for a review). In brief, rip flow speed generally increases with increasing wave height and period (e.g. MacMahan et al., 2006), more shore-normal incidence (e.g. MacMahan et al., 2005), generally lower tide levels (e.g. Brander and Short, 2001; Austin et al., 2010; Bruneau et al., 2011; Houser et al., 2013) and more alongshore-variable surf zone morphology (Moulton et al., 2017). It is also well known that shore-break waves are associated with steep beaches and longer period waves (Battjes, 1974; Balsillie, 1985). In addition, the number of SZIs is also greatly 40 influenced by the number of beachgoers exposing themselves to surf zone . Given that warm sunny days with low winds typically result in increased beach attendance (Ibarra, 2011; Dwight et al., 2007), it is expected that during such days the life risk, and thus the number of SZIs, are increased. Prominent environmental controls on SZIs were identified by comparing the frequency distribution of an environmental

variable (e.g. significant wave height Hs, tide elevation η) during an injury, with the background frequency distribution of that 45 variable (Scott et al., 2014; Castelle et al., 2019). The difference between two frequency distributions shows the disproportion- ate amount of conditions that are associated with SZIs. At two different beaches along the Atlantic coast of Europe, Scott et al. (2014) and Castelle et al. (2019) showed that the number of drowning incidents occur during warm sunny days with light wind, maximizing beach attendance, and shore-normally incident long-period waves, maximizing rip current activity. Although such analysis provides an indication of the prominent environmental controls, it does not uncover the interplay between variables, 50 and the relative magnitude of each variable. A related challenge based on current research is filtering the effect of how water

users choices are influenced by environmental conditions (e.g. wave height Hs). For instance, it is expected that high surf and heavy shore-break waves discourage an amount of the beachgoers from entering the water, even on warm sunny days, resulting in less exposure. Finally, the respective contributions of hazard and exposure to the overall life risk for shore-break waves and rip current are virtually unknown. 55 Prediction of SZIs together with a better understanding of the interplay between weather and marine conditions and effect on life risk at the beach, could help a better anticipation of high risk, mass rescue days and further improve public safety

2 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

awareness campaigns on surf zone hazards. This requires a high order statistical approach like a Bayesian Network (BN). BNs are probabilistic graphical models that are based on a joint probability distribution of a set of variables with a possible mutual causal relationship. BNs have been previously successfully used in coastal science, estimating morphological changes, changes 60 in wave parameters in the surf-zone and coastal flood risks (Gutierrez et al., 2011; Plant and Holland, 2011; Fienen et al., 2013; Pearson et al., 2017). Stokes et al. (2017) compared a BN to a multiple linear regression approach to model exposure, hazard and, in turn, life risk to beach users at 113 lifeguarded beaches in UK. Even though the multiple linear regression method moderately outperformed the BN, Stokes et al. (2017) acknowledged the benefits of a BN approach to identify of the characteristics of high risk beaches from a large data set. More recently, Doelp et al. (2019) used a BN to predict SZIs on the 65 Delaware Coast, which are primarily caused by shore-break waves (Puleo et al., 2016). They showed that a BN approach can improve predictions 69.7 % of the time, but also acknowledged limitations in predicting anomalous injuries. A BN approach has the potential both to show good prediction skill to assist decision-making and to provide a better understanding of rip current and shore-break hazards. In this paper, a data set (2011-2017) of 442 drowning injuries (fatal and non-fatal) and 715 shore-break injuries occurring in 70 boreal summer (June 15 - September 15) and corresponding environmental conditions along the Gironde coast in south-west France are used to create BNs for rip current related drownings and shore-break injuries. The study area and SZI dataset are described in Section 2. Section 3 presents the development of the BNs and the method used to train them and address their performance. Results are shown in Section 4 and are further discussed in Section 5 before conclusions are drawn.

2 Environmental and SZI dataset along the Gironde Coast

75 2.1 Study area

The Gironde coast is located in southwest of France and stretches approximately 140 km from the La Salie Beach (La Teste) in the south to the Gironde Estuary in the north, and is interrupted by the tidal inlet (Figure 1a). It is a meso-macrotidal environment with spring tidal range reaching 5 m. Wave conditions vary seasonally with a 99.5% exceedance significant wave

height Hs of 5.6 m, and occasional severe storms with Hs> 8 m. Summers are associated with smaller waves with a mean Hs 80 of around 1.2 m and a dominant W-NW incidence (Castelle et al., 2019). The coast is composed of high-energy sandy beaches backed by high and wide coastal dunes. Beaches are intermediate double barred, with deep and more or less regular rip channels incising the intertidal inner bar with an average spacing of approximately 400 m (Figure 1a and b). Intense rip currents can flow through the rip channels, with rip flow intensity potentially

exceeding 1 m/s even for low-energy (Hs < 1 m) long-period waves (Castelle et al., 2016). Rip current flow is strongly 85 modulated by the tide level, with maximum rip current activity occurring between low and mid tide in typical summer wave conditions (Bruneau et al., 2011). In winter, more energetic wave conditions drive a more dissipative gently sloping beach face. In contrast, the upper part of the beach is steeper in summer due to smaller waves. Beach slope and rip channel morphology also show a large interannual variability enforced by large interannual variability in the wave climate (Dodet et al., 2019). Overall, beach states are similar along the coast but with increasingly steep beach face, deeper and more spaced rip channels

3 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

90 southwards. Noteworthy, beach morphology dramatically changes along sectors adjacent to the Arcachon lagoon and Gironde estuary where rip current activity decreases, but tide-driven currents become substantial (> 0.2 m/s during ebb and flood). The Gironde coast is known for a large population of tourists visiting the beaches, which results in large amounts of injuries sustained by beachgoers and surfers of all levels (Figure 1c) (Castelle et al., 2018; Tellier et al., 2019). Beaches are patrolled by lifeguards during the summer months of July and August. Patrolled periods are extended approximately from the 15th of 95 June to the 15th of September at the busiest beaches. A designated and supervised bathing zone is delimited by two blue flags. However, many remote beach access paths through coastal dune tracks and many access points are situated on unpatrolled sections of beaches, kilometres away from any lifeguard presence (Castelle et al., 2019).

Soulac Gironde 45.5° (a) estuary (b) (c) Grayan l’hôpital Naujac 45.25°

Carcans 20 km 45°

Latitude Le PorgeArcachon Truc Vert lagoon 44.75° Tide gauge Wave Cap Ferret La Teste 44.5° buoy

44.25° -1.25° -0.75° -0.25° Longitude

Figure 1. (a) Location map of the Gironde Coast, southwest France. Black circles indicate municipalities where injuries were reported. Locations of Truc Vert beach and wave and tide data used in this study are also indicated; (b) aerial photograph of Truc Vert beach at low tide, exposing rip channels (Ph. V. Marieu); (c) crowded Lacanau beach at summer during a high tide (Ph. J. Lestage).

2.1.1 SZI data

The SZI dataset used herein is detailed in Tellier et al. (in revision, i). In short, SZIs were recorded by the medical emergency 100 call center SAMU (Service d’Aide Médicale d’Urgence) of Bordeaux for the Gironde department. Calls from beachgoers and lifeguards dealing with drowning or rescues received between January 2011 and November 2017 were used here. Excluding training calls, duplicates and calls lacking victims, a total of 5022 injuries were collected. Table 2 shows that the discrepancy between the total number of injuries and the combined shore-break and rip current related SZIs is due to the large number of calls with insufficient information collected. Noteworthy, the 916 surfing-related injuries (Table 1) occurring during this period 105 were also disregarded for analysis. The reason for this being that a large number of surfer injuries involve collision with other surfers and are likely influenced by other factors (e.g. surf break quality, surf school activity) which are not related to physical hazards.

4 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

A SZI was classified as shore-break when the medical file stated explicitly: "shore-break" or a French equivalent. Given that along this coast approximately 80% of the drowning incidents involving bathers are caused by rip currents (Castelle et al., 110 2018), rip-related SZIs (drownings) were determined if a drowning stage (according to standardized medical classification) was reported in the medical file, with two notable exceptions. Drownings that were related to shore-break waves were classified as shore-break, because they were presumably not associated with rip currents. Similarly, surfing related drownings were excluded as there is no evidence that most of the drownings of surfers are related to rip currents. Six mutually exclusive classes were found based on activity (see Table 2). Even though the activity was unknown for 1943 of the SZIs, drowning stage and the 115 shore-break classifier provided the information to classify some of these SZIs as shore-break or rip related drowning. Amongst the population, 45% was male, 33% female and for 21% the sex was not recorded. The population is relatively young with 43% between 6 and 19 years old. A slightly elevated number of SZIs was found for the age group between 36-45 years old (see Figure 2). This demographics is in line with another, shorter, dataset described in Castelle et al. (2018). By far most SZIs occurred at Lacanau beaches (26%), which is one of the most popular beaches that consequently attracts crowds in summer 120 (see Figure 2).

Table 1. Activity distribution SZIs as indicated on the medical Table 2. Post-processed categories to distinguish between rip files between 08-Jan-2011 and 18-Nov-2017 related drownings and shore-break injuries

Activity nb SZIs Class nb SZIs

swimming 1229 swimming (non-drowning) 282 surfing 827 rip related drowning 575 body-boarding 89 shore-break 750 beach-related 898 surfing/body-boarding 916 skim-boarding 36 beach-related 934 unknown activity 1943 unknown 1565 total 5022 total 5022

For the purpose of this study, only summer periods between June 15 and September 15 were taken for each year because outside of this period SZIs become extremely rare events, which poses challenges for BN training. In the summer periods 442 drowning SZIs and 715 shore-break SZIs where found. This is the final population that was used in the Bayesian network.

2.2 Environmental data

125 Environmental conditions were estimated at the time of each SZI by using a data set comprising tide, wave and weather data. The dataset is described in detail in Castelle et al. (2019). Hourly weather data was collected at the Météo-France weather sta- tion Cap Ferret (Figure 1a) from the RADOME (Réseau d’Acquisition de Données d’Observations Météorologiques Etendu). A tidal component analysis of a 3-month time series of continuous, storm-free, Eyrac tide gauge data (Fig. 1) was performed to reconstruct a tide level time series at 10-min interval. The average phase lag between the Eyrac tide gauge and beaches of

5 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Age 0.3 (a) 0.2 % 0.1

0 0-5 6-12 13-19 20-25 26-35 36-45 46-65 >65 unknown

Location 0.3 (b) c) 0.2 % 0.1

0 Lège Hourtin Soulac Naujac La Teste Montalivet Le Porge Lacanau Grayan lHopital

Figure 2. Distribution of (a) age and (b) location of SZIs between 08-Jan-2011 and 18-Nov 2017. Beaches are ordered from North (left) to South (right)

130 the study area was estimated using tide charts from the Service Hydrographique et Océanographique de la Marine (France), resulting in a maximum tide elevation error of 0.3 m at all sites. A wave model hindcast was used to provide continuous wave conditions at the coast. The WaveWatch 3 (Tolman, 2014) hindcast was performed on an unstructured grid with a resolution increasing from 10 km offshore to 200 m near the coast (Boudière et al., 2013). Wave conditions were extracted at an in situ directional wave buoy location c. 10 km offshore of Truc Vert at ca. 50-m depth, and have been extensively validated with 135 field data (e.g. Castelle et al., 2020). The primary metocean variables used are: tidal elevation (η), significant wave height

(Hs), mean wave period (T02), wave direction (θ), (T ), wind speed (U), Insolation (I). From tidal elevation, tidal range (TR) and tidal gradient (dη) where derived. Maximum, minimum, mean and standard deviation summer statistics are summarized in Table 3. Previous work along this stretch of coast showed, qualitatively, the importance of beach upper beach slope and rip channel 140 development on shore-break related injuries and drowning incidents, respectively (Castelle et al., 2019). To further quantita- tively address this link with the longer dataset used herein, we used monthly to bimonthly topographic surveys performed at Truc Vert beach since 2003 (the reader is referred to Castelle et al., 2020, for a detailed description of this beach monitoring program). This dataset was used to derive two morphological metrics. First, the inverse foreshore slope (IFS) was calculated

6 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Table 3. Statistics of environmental conditions of SZI records during summers between June 15 - September 15

Environmental variable maximum minimum mean standard deviation

Significant wave height Hs (m) 3.13 0.23 1.17 0.38

Mean wave period T02 (s) 11.64 2.66 6.19 1.94

Wave direction θ (◦) 340.10 247.30 291.67 7.82 Tidal elevation η (m) 2.21 -2.26 -0.03 1.09 Tidal range TR (m) 4.52 1.64 3.12 0.69 1 Tidal gradient dη (m H− ) 0.51 -0.51 0.05 0.26

Temperature T (C◦) 36.23 15.47 25.07 3.36 1 Insolation I (min H− ) 60 0 48.17 14.77 1 Wind speed U (m s− ) 12.65 1.2 4.56 1.26

as: 1 145 IFS = (1) tan(β)

where tan(β) is the beach slope between 1 m and 3 m above mean sea level (amsl) from a linear regression. To filter out extreme alongshore variations in IFS, the slope was averaged over four cross-shore transects that were systematically surveyed during the monitoring program (Figure ). Sinuosity (S) of the mean sea level iso-contour line was used to provide a measure rip channel development. It was defined 150 as: L S = t (2) Ls

where Lt is the true length and Ls is the shortest Euclidean distance between the first and last point of the contour line (Figure 4). A value larger than 1 indicates a high degree of sinuosity, whereas values close to 1 indicate a low degree of sinuosity. Before calculating the sinuosity of the shoreline, a high pass filter was used to remove sinuous signals larger than 400 m. This 155 was done to filter out larger-scale undulating patterns that are not enforced by the inner-bar rip channels (Castelle et al., 2015). The metocean and topographic data collected at Cap Ferret or near Truc Vert are both located approximately in the centre of the study area. This data was assumed to be representative of wave and weather conditions along the entire study area. When constructing a BN (see next section), probabilities of a SZI occurrence must be compared to a probability of non-SZI. Therefore, a discretization in time is needed. A 1-hour time window was chosen to count the number of SZIs. To avoid a 160 spurious distribution of non-SZIs, only daily hours between 7h and 21h were used.

7 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

elevation a.m.s.l. (m) 240 (a) transects 6 0 200 0 0

0 0 0 160 0 4

120 2 cross-shore (m) 80 0 40 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 alongshore (m)

50 40 (b) summers 30 20

IFS (slope) 10 0 2012-01 2013-01 2014-01 2015-01 2016-01 2017-01 date 150 (c) summers 100

50

0 2012-01 2013-01 2014-01 2015-01 2016-01 2017-01 date Nb Shore-break related SZIs

Figure 3. (a) Example of a digital elevation model of Truc Vert beach on 29-Apr-2013, with the colorbar showing elevation amsl. The four cross-shore transects used to compute the inverse beach slope IFS between 1 m and 3 m amsl are indicated by the dashed black lines. (b) Time series of IFS and (c) summer shore-break related injuries.

8 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

elevation a.m.s.l. (m) 240 (a) Ls (shortest Eucledian) Lt (MSL-contour) 6 0 200 0 0

0 0 0 160 0 4

120 2 cross-shore (m) 80 0 40 400 500 600 700 800 900 1000 1100 1200 1300 1400 1500 1600 alongshore (m)

1.4 (b) summers

1.2 Sin (Lt/Ls) 1 2012-01 2013-01 2014-01 2015-01 2016-01 2017-01 date 100 (c) summers

50

0 Nb Rip-related SZIs 2012-01 2013-01 2014-01 2015-01 2016-01 2017-01 date

Figure 4. (a) Example of a digital elevation model of Truc Vert beach on 29-Apr-2013, with the colorbar showing elevation amsl. The red line indicates the shortest Euclidean distance. Time series of (b) sinuosity S and (c) number of summer rip-current related drownings.

3 Bayesian networks

3.1 Bayes’ Theorem and BN structure

Bayesian networks (BN) are based on Bayes Theorem (Korb and Nicholson, 2010). This theorem states that probabilities of a certain event can be updated, given new evidence and can be stated as (Bayes, 1763): 9

P (e h)P (h) 165 P (h e) = | (3) | P (e) https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

where P (h e) is the probability for a hypothesis h, given the evidence e. In Eq. (3), P (e h) resembles the likelihood and | | P (h) corresponds to the prior probability of h before any evidence was given. Dividing the numerator by P (e) is a means of normalizing, so that conditional probabilities sum to 1. For example P (h e) could be the probability of a rip-current hazard, | given that the tide was low. 170 A BN is a graphical representation of the probabilistic relations between a set of variables, using Bayes’ Theorem to describe the relation between variables (Korb and Nicholson, 2010). The links between nodes represent the direct dependency between variables (or nodes). A constraint on linking variables is that links cannot return to the beginning node, completing the cycle. Therefore the graphical representation of a BN is often referred to as a directed acyclic graph. If there is an arc from variable A to B, variable A is termed the parent variable and variable B the child variable. The relation between variables is often 175 assumed to be causal, but is not necessarily the case. Once the structure is established, relations between variables are quantified according to conditional probability tables (CPTs), in the case of discrete variables. The probability of a value for a child variable is calculated for each possible value that the parent variable can take. Given that this is done for all variable nodes in the BN, two types of probabilistic reasoning become possible. Firstly, predictive reasoning, where a value is specified for each input variable. This results in a predicted probability for a target variable. Secondly, diagnostic reasoning is the other type for 180 which, for example, given a SZI the BN can specify the probability that it was low tide.

3.2 of the rip-current and shore-break related BNs

Constructing a BN requires a trade-off between complexity and predictability. This is determined by the amount of variables chosen, the way variables are discretized and how the variables are linked. In general a simpler model is preferred over a complex model with the same performance, according to the principle of Ockham’s razor (Jefferys and Berger, 1992). In this 185 section we describe the choices that were made related to this trade-off, using the BN software package Netica v. 6.05 (Norsys, 1998). Based on earlier work on the environmental controls on SZIs in southwest France (Castelle et al., 2019) and some preliminary

BN tests, the rip-current BN is made of: i) a hazard component that depends on hydrodynamic forcing parameters Hs, T02, θ, η and dη and a morphological component S and ii) an exposure component that depends on the hour of the day H, temperature T 190 and insolation I (see Figure 5). The shore-break BN has a similar set-up, but the shoreline sinuosity is replaced by the inverse foreshore slope (IFS), and the tidal gradient (TG) is replaced by tidal range (TR). Such structure was motivated by the fact that sometimes a simpler and computationally less expensive network can be obtained by adding so-called hidden or latent variables that limit the amount of links between variables or the amount of variables to include in the network (Russel and Norvig, 2010, p.817). In this case, we used two hidden exposure and hazard variables which are known to control life risk and 195 the amount of SZIs (Stokes et al., 2017). In order to compare the probability of an injury with the probability of a non-injury, injuries were counted per hour for all summers. Consequently, the variable injury count was discretized as a binary variable with two possibilities: no injury or an injury. Where the amount of injuries per hour exceeded 1, the cases were duplicated proportionally. Often hidden learned variables tend to be discretized with a small amount of bins, as they do not have any prior information available. After testing

10 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

(a) Rip current BN (b) Shore-break BN TG(m/h) hidden variables (learned) TR(m) hidden variables (learned) input variables input variables output variable output variable

η (m) U (m/s) η (m) U (m/s)

Hs (m) Hour (h) Hs (m) Hour (h)

T (s) T02 (s) T (C°) 02 T (C°) Shore- Rip break Exposure Hazard Exposure Hazard θ (°) θ (°) I(min/H) I(min/H)

Injury Injury count count Sinuosity IFS

Figure 5. Bayesian Networks for (a) Rip current related injuries and (b) Shore-break injuries. Both BN are defined by input variables, hidden variables, an output variable and their linkages.

200 both BNs, 3 dummy bins were chosen for the exposure variable and 2 dummy bins for the hazard variable. The amount of bins chosen for the input variables determine the performance of the model to a larger extent. Therefore, different discretization options were tested, keeping an equal bin width. This is shown in Section 3.4.

3.3 Bayesian network training

The probability tables of the hidden variables were calculated by using an algorithm that calculates the most likely distribution 205 of the data, given the probability distributions of the other variables. The expectation maximization (EM) algorithm is widely used in BNs to determine the most likely model given the data (Russel and Norvig, 2010). In a similar manner the algorithm finds the most likely value for occasionally missing data. The algorithm proceeds in the direction of the steepest gradient to find the minimum negative log likelihood for a model, given the data. The amount of bins used to discretize variables in the BN determines how well data is described. The larger the number of bins, the better it describes the data until the point where 210 there might be a bin for each value. On the other hand, a larger number of bins degrades the prediction skill of the BN, as it becomes harder to predict the correct bin. A BN might be trained slightly differently from one run to another, because it is a probabilistic process. Therefore, we used K-fold cross-validation to eliminate any bias that single model runs might hold. After Fienen and Plant (2015); Gutierrez et al. (2015); Pearson et al. (2017) the cases were separated in k random partitions,

11 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

where n n/k cases were used for training and calibration, and n/k cases were left out for testing/validation. We used k = 10 − 215 so that test cases make up 10 % of the total data set. After 10 folds, mean values of performance metrics were taken to evaluate the performance of the BN.

3.4 BN performance metrics

Different performance metrics were used to address BN predictive performance. Here we used three relevant metrics: skill sk, log likelihood ratio LLR, area under ROC (receiver operating characteristic) AUC. 220 Skill was adapted from Fienen and Plant (2015) and was computed as:

2 σe sk = [1 2 ] 100% (4) − σo ×

where σe is the mean squared error between observations and the BN forecast and σo is the variance of observations. The skill metric in Eq. (4) expresses how close predictions of an injury match with observations of an injury, with sk = 1 meaning perfect prediction. 225 Because skill is not an optimal measure for binary output variables, the AUC (area under ROC curve) was chosen as a complementary metric (Marcot, 2012). AUC is based on the ratio between the true positive predictions of the BN and the false positive predictions. Figure 6a shows the sensitivity on the y-axis (true positive rate) and the specificity on the x-axis (false positive rate) of a typical ROC curve from one of the model runs. If the dashed random classification line is equal to the ROC curve, this indicates that the model is not able to distinguish an injury from a non-injury. This corresponds to AUC = 0.5. 230 Figure 6b shows the confusion matrix on which the sensitivity and specificity is based.

(a) Shore-break ROC-curve, AUC = 0.76 (b) Confusion matrix 100

80 prediction

60 injurynon-injury non-injury 879879 13 40 observation sensitivity injury 52 4

20 shore-break injuries random classification line 0 0 20 40 60 80 100 1-specificity

Figure 6. (a) Receiver operating characteristic curve (ROC) of the shore-break BN of one of the fold runs; (b) Confusion matrix with false positives in orange and true positives in green

12 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

The third metric is the log likelihood ratio (LLR), adapted from Plant and Holland (2011) and Fienen et al. (2013). The LLR compares the prior probability of an injury with the posterior probability of a prediction, given the evidence (the input variables), which reads:

LLR = log10(p(Fi Oj)F =O ) log10(p(Fi)F =O ) (5) | i j0 − i j0

235 where Fi is a forecast, in this case of a SZI, Oj is an independent observation that was withheld from the forecast (e.g. a tidal elevation of -2.0 m). The LLR expresses the change in likelihood due to certain evidence in the form of observations. A LLR that exceeds zero indicates that the BN offers a better forecast than the prior probability. A LLR that is lower than zero, indicates that the prior probability is a better forecast than the BN forecast. The LLR can be calculated for each predicted case and each variable and can then be summed over the entire BN ( LLR). The LLR penalizes wrong but confident predictions 240 more than wrong predictions that are uncertain (Plant and Holland,P 2011; Pearson et al., 2017). Therefore, it is a suitable metric to verify whether the BN is over fitting or not. Finally, in order to address how each input variable influences the target variable (SZI), the percentage of variance reduction V r that was caused by updating the BN based on the evidence was computed as: V (F ) V (F O) V r = − | 100% (6) V (F ) ×

245 where V (F ) is the variance of a forecast prior to any evidence, and V (F O) is the variance of the forecast, given the new | evidence. V (F ) and V (F O) are calculated as: | N V (F ) = p(f )(f E(f ))2 (7) j j − j j=1 X

N N V (F ) = p(f o )(f E(f o ))2 (8) j| i j − j| i i=1 j=1 XX

250 where p(fj) is the prior probability of the jth forecast, fj is the current value of the jth forecast, E(fj) is the expected value predicted by the BN of the jth forecast, p(f o ) is the predicted value of the jth forecast given the ith evidence case, E(f o ) j| i j| i is the predicted value of the jth forecast given the ith evidence case, M represents the number of evidence data and N the number of predictions.

4 Results

255 4.1 BN performance

To find the best BNs, a varying number of bins was tested to evaluate the trade-off between calibration and validation. For calibration, the BN was used to make predictions of an injury based on the input variables of the training cases. For validation, predictions of an injury were made based on the input variables of the 10 % left-out cases. Generally an increase in complexity

13 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

to a decrease in predictive capability and vice versa (Fienen and Plant, 2015; Fienen et al., 2013). Figure 7 and 8 show 260 performance metrics for both the shore-break and rip-current BNs, respectively, as a function of the number of bins for the input variables.

Skill AUC Σ LLR 1 250 Calibration Calibration Validation Validation 0.6 0.9 200 150 0.8 0.4 100 Skill LLR AUC 0.7 50 0.2 0.6 Calibration 0 (a) Validation (b) (c) 0 0.5 -50 3 4 5 6 7 3 4 5 6 7 3 4 5 6 7 nb bins nb bins nb bins

Figure 7. Performance metrics of shore-break BN as a function of the number of bins of the input variables and for validation and calibration : (a) skill sk, (b) area under ROC curve AUC and (c) the summed log likelihood ratio PLLR.

Skill AUC Σ LLR 1 250 Calibration Calibration Validation Validation 0.6 0.9 200 150 0.8

0.4 100

Skill LLR AUC 0.7 50 0.2 0.6 Calibration 0 (a) Validation (b) (c) 0 0.5 -50 3 4 5 6 7 3 4 5 6 7 3 4 5 6 7 nb bins nb bins nb bins

Figure 8. Performance metrics of rip-current BN as a function of the number of bins of the input variables and for validation and calibration : (a) skill sk, (b) area under ROC curve AUC and (c) the summed log likelihood ratio PLLR.

It can be observed that the shore-break BN performs slightly better than the rip-current BN, as all performance metrics score better. Calibration results of the BNs are fair with sk and AUC ranging 0.15-0.43 and 0.89-0.98, respectively. Validation sk is smaller and ranges 0.078-0.12 and 0.035-0.06 for the shore-break and rip-currents BNs, respectively. Validation AUC 265 are better, ranging 0.71-0.8 and 0.63-0.68 for the shore-break and rip-current BNs, respectively. The sum of the LLR is systematically smaller than 0 for the validation of both BNs. This is either an indication that the prior estimate is on average

14 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

better than the prediction of the model, or that there are anomalous cases where the wrong but confident prediction is heavily punished by highly negative LLR values that result in a negative or near zero LLR sum. Table 4 and 5 show that, depending on the number of bins chosen, model predictions are better than the prior probability 270 estimate 62.21% - 79.9% of the time. When 5 bins are chosen for the rip current BN, 79% of the time the model prediction is of added value. When 5 bins are chosen for the shore-break BN, 72% of time the model prediction performs better. This shows that the negative and near zero sums of the LLR displayed in Figure 7 and 8 must be caused by anomalous events (the remaining percentages) that are confidently predicted wrong.

Table 4. The percentage of LLR > 0 for rip current SZIs, in- Table 5. The percentage of LLR > 0 for shore-break SZIs, in- dicating whether the prediction of the model is better than the dicating whether the prediction of the model is better than the prior probability prior probability

Nb bins Prediction rip current SZIs (% LLR>0) Nb bins Prediction shore-break SZIs (% LLR>0)

3 bins 73.86% 3 bins 70.18% 4 bins 77.30% 4 bins 70.08% 5 bins 79.90% 5 bins 72.56% 6 bins 61.60% 6 bins 65.10% 7 bins 62.21% 7 bins 69.50%

The amount of bins was varied from 3 to 7 bins to choose the best trade-off between complexity and accuracy. Only the 275 number of input variable bins were adjusted, keeping the output variables exposure, hazard and injury count the same. In general, an increase of the number of bins leads to a better descriptive capability and a worse predictive capability (Fienen and Plant, 2015). When the number of bins is increased from 3 to 4, a small decrease in sk and AUC can be noticed. However, a further increase in the number of bins does not significantly to worse sk, AUC or LLR. AUC and sk show a small increase at 6 bins for the shore-break BN (Figure 7a,b) and at 6 / 7 bins for the rip BN (FigureP 7). Contrary to what is generally 280 observed, validation sk, AUC and LLR did not drop dramatically when complexity was increased. However, the percentage LLR > 0 did drop for both BNs, whenP increasing the number of bins, which is in line with expectations.

4.1.1 Input variable sensitivity

Figure 9a shows the sensitivity of the shore-break BN to the input variables (6 bins) and to the hidden variables exposure and shore-break hazard. For the shore-break BN, the learned variable exposure (V r =23.5%) has the strongest influence followed 285 by the shore-break hazard variable (V r = 10.9%). This suggests that exposure of water users has a more dominant control on the injury count than the shore-break hazard forcing variables. This is also reflected in Figure 9b, where the hour of the day, temperature and insolation have larger values for V r. These variables are followed by tide elevation which is the most important shore-break hazard control (V r = 0.17%). Consequently, mean wave period (V r = 0.07%), tidal range (V r = 0.06%), significant wave height (V r = 0.05%) and wind speed (V r = 0.039%) follow. Even though beach profiles were taken at only

15 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

290 one location along the coast, the inverse foreshore slope IFS still has a noticeable impact (V r = 0.065%) on the shore-break hazard. Wave direction is least sensitive to the injury count with V r = 0.025%.

Shore-break Shore-break w/o hidden Rip current Rip current w/o hidden

Exp Hour Exp Temp Shorebreak Rip Temp I Hour Temp I Hour Temp I Tide Hs I Hour Tide T02 Hs T02 T02 TR T02 Dir TR Hs Dir Tide Hs Tide U TG U TG IFS Sin IFS Sin Dir U Dir U (a) (b) (c) (d)

0 10 20 30 0 1 2 3 0 5 10 15 20 0 0.5 1 1.5 2 % variance reduction % variance reduction % variance reduction % variance reduction

Figure 9. Variance reduction V r without hidden learned variables for the shore-break BN (7 bins) (a) with hidden learned variables and (b) without learned hidden variables and for the rip-current BN (6 bins) (c) with hidden learned variables and (d) without hidden learned variables

Figure 9c shows the sensitivity of the rip-current BN to the input variable (7 bins) including the hidden variables exposure and rip-current hazard. Rip-current hazard (V r = 17.6%) and exposure (V r = 15.4%) have similar influence, even though parents of exposure (insolation, temperature and hour) are more dominant in Figure 9d. It suggests that it is the combined 295 effects of input variables that cause a rip current (e.g. tide, wave direction and wave height) that have a strong influence on the occurrence of drowning incidents. This is different from what was observed for the shore-break BN. Most sensitive input variables for the rip-current BN are insolation (V r = 1.67%), temperature(V r = 1.7%) and hour of the day (V r = 1.56%). These are followed respectively by the hazard-related variables, tidal elevation, wave direction, significant wave height, tidal

range, wave period and sinuosity of the shoreline. Hs and T02 have the highest V r with 0.36% and 0.26%, suggesting that 300 wave energy is the most important control on rip-current hazard. They are closely followed by wave direction with 0.25%. Shoreline sinuosity S reduces variance by 0.065%.

4.2 Scenario analysis

Apart from the predictive ability of a BN, probabilistic scenario analysis can be a useful tool to understand how multiple variables interact. Figure 10a shows the prior joint probability distribution of the shore-break BN without updating based on 305 any evidence. The two hidden variables, exposure (3 bins) and hazard (2 bins) do not contain any prior information and thus have equal probabilities for each bin. Figure 10b shows the joint probability distribution for a trained shore-break BN updated for the evidence that there was a shore-break SZI. In the latter, the distribution of tidal elevation shifts towards high tide.

16 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Additionally, there is a shift towards higher mean wave periods (T02) and a slight increase in probabilities of larger wave

heights (Hs). Furthermore, temperature, hour of the day and insolation show a pronounced shift towards higher , 310 less cloud cover and the afternoon between 14h and 16h30. Noteworthy, when the BN was updated for larger wave heights, the probability of a shore-break related injury increased. However, when the BN was updated with the evidence of an injury, intermediate wave height bins 0.75-1.5 m and 1.5-2.25 m

showed increased probability. This supports our hypothesis that large shore-break waves (Hs > 2.5 m) can discourage bathers from entering the water, which results in less SZIs despite that the shore-break hazard is increased.

Tidal Range (m) Wind Speed (m/s) Tidal Range (m) 1 to 1.67 0.74 Wind Speed (m/s) 1.67 to 2.33 15.7 0 to 4.17 27.5 1 to 1.67 0.92 (b) Updated probability shore-break BN 2.33 to 3 28.5 (a) Prior probability shore-break BN 4.17 to 8.33 51.1 1.67 to 2.33 16.0 0 to 4.17 29.5 3 to 3.67 34.7 8.33 to 12.5 6.13 2.33 to 3 26.5 4.17 to 8.33 50.3 3.67 to 4.33 17.4 12.5 to 16.67 15.0 3 to 3.67 34.7 8.33 to 12.5 4.31 4.33 to 5 2.92 16.67 to 20.83 0.18 3.67 to 4.33 18.8 12.5 to 16.67 15.7 3.07 ± 0.73 20.83 to 25 .023 4.33 to 5 3.10 16.67 to 20.83 0.20 20.83 to 25 .032 6.64 ± 4.2 3.09 ± 0.75 6.54 ± 4.3

Tidal Elevation (m) Tidal Elevation (m) -2.3 to -1.53 6.24 Hour (h) -2.3 to -1.53 6.01 -1.53 to -0.77 22.1 Hour (h) -0.77 to 0 19.4 7 to 9.33 19.9 -1.53 to -0.77 19.4 0 to 0.77 21.8 9.33 to 11.67 13.4 -0.77 to 0 15.8 7 to 9.33 5.02 0.77 to 1.53 23.9 11.67 to 14 13.1 0 to 0.77 22.5 9.33 to 11.67 10.4 1.53 to 2.3 6.57 14 to 16.33 20.0 0.77 to 1.53 27.7 11.67 to 14 18.3 0.0372 ± 1.1 16.33 to 18.67 13.5 1.53 to 2.3 8.48 14 to 16.33 44.8 18.67 to 21 20.1 16.33 to 18.67 6.89 0.168 ± 1.1 18.67 to 21 14.7 14.1 ± 4.2 14.8 ± 3.1 WaveHeight (m) 0 to 0.75 14.7 WaveHeight (m) 0.75 to 1.5 61.5 Temperature (deg C) 0 to 0.75 11.9 1.5 to 2.25 19.2 0.75 to 1.5 62.9 12 to 16.5 20.6 Temperature (deg C) 2.25 to 3 3.43 Exposure 1.5 to 2.25 19.7 Shorebreak Hazard 16.5 to 21 26.5 12 to 16.5 12.9 3 to 3.75 0.92 low 33.3 2.25 to 3 4.00 Exposure true 50.0 21 to 25.5 36.2 Shorebreak Hazard 16.5 to 21 7.17 3.75 to 4.5 0.23 high 33.3 3 to 3.75 1.20 low 0 + false 50.0 25.5 to 30 10.8 true 100 21 to 25.5 46.8 medium 33.3 3.75 to 4.5 0.30 high 100 1.24 ± 0.61 false 0 + 25.5 to 30 22.4 0.5 ± 0.5 30 to 34.5 3.58 1 ± 0.82 1.28 ± 0.62 medium 0 + 34.5 to 39 2.29 6.06e-8 ± 0.00025 30 to 34.5 7.25 1 ± 0.00035 34.5 to 39 3.46 21.3 ± 5.4 23.9 ± 5.5 T02 (s) T02 (s) 2 to 3.67 3.39 3.67 to 5.33 40.0 2 to 3.67 2.99 5.33 to 7 39.0 Insolation (m/h) 3.67 to 5.33 36.0 7 to 8.67 14.1 5.33 to 7 41.9 0 to 10 44.3 Insolation (m/h) 8.67 to 10.33 3.00 7 to 8.67 15.6 10 to 20 4.03 0 to 10 20.4 10.33 to 12 0.45 Shorebreak Injury count 8.67 to 10.33 3.22 20 to 30 3.84 10 to 20 4.13 5.74 ± 1.5 10.33 to 12 0.28 Shorebreak Injury count 0 94.4 30 to 40 3.62 20 to 30 4.60 1 5.60 40 to 50 4.20 5.85 ± 1.5 0 0 30 to 40 3.37 0.752 ± 1.2 50 to 60 40.0 1 100 40 to 50 4.66 50 to 60 62.8 28.9 ± 24 5 ± 2.3 Wave Direction (deg) Wave Direction (deg) 40.6 ± 21 247 to 265.84 0.60 265.84 to 284.67 14.1 247 to 265.84 0.76 284.67 to 303.5 64.3 265.84 to 284.67 15.0 303.5 to 322.33 18.0 284.67 to 303.5 64.1 322.33 to 341.17 3.00 303.5 to 322.33 16.5 341.17 to 360 .023 322.33 to 341.17 3.57 296 ± 14 341.17 to 360 .015 295 ± 14

Inverse Slope 5 to 12.5 52.7 Inverse Slope 12.5 to 20 14.3 5 to 12.5 52.8 20 to 27.5 12.7 12.5 to 20 13.8 27.5 to 35 13.6 20 to 27.5 11.7 35 to 42.5 3.62 27.5 to 35 14.3 42.5 to 50 3.11 35 to 42.5 4.14 42.5 to 50 3.23 17 ± 11 17.2 ± 11

Figure 10. (a) Prior probability distribution of the shore-break BN; (b) Updated probability distribution where the probability of an injury occurring was set to 100%

315 Apart from an injury hindcast scenario, various other scenarios provided insight in variable interaction. For the shore-break BN an intermediate slope during a shore-break hazard is slightly more likely during low tide (18.9%) than during high tide (13%) (see Figure 11a and b). This might be explained by the hypothesis that different parts of the foreshore slope are active during different tidal elevation levels. A similar scenario analysis was performed for the rip current BN. Figure 12a shows prior probabilities of the rip-current BN. 320 The rip current BN shows that according to the prior probability a shore-break injury (5.60%) is more likely than a rip-current related injury (3.23%). Figure 12b shows the updated probability distributions for the rip current BN, given that there is a 100% chance of an injury. Larger tidal gradients (both negative and positive) show a slight increase in probability, supporting the hypothesis that a rapid change in tidal elevation can surprise water users by driving the rapid onset of rip current activity. Low become slightly more likely when there is evidence of a rip-current related drowning incident. Increased probabilities

17 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Tidal Range (m) Tidal Range (m) Wind Speed (m/s) Wind Speed (m/s) 1 to 1.67 0.58 1 to 1.67 0.81 0 to 4.17 27.6 1.67 to 2.33 12.4 (a) Scenario high tide shore-break BN 0 to 4.17 27.6 1.67 to 2.33 17.3 (b) Scenario low tide shore-break BN 4.17 to 8.33 50.9 2.33 to 3 21.4 4.17 to 8.33 50.9 2.33 to 3 36.1 8.33 to 12.5 6.11 3 to 3.67 48.4 8.33 to 12.5 6.11 3 to 3.67 31.7 12.5 to 16.67 15.1 3.67 to 4.33 13.9 12.5 to 16.67 15.1 3.67 to 4.33 10.1 16.67 to 20.83 0.18 4.33 to 5 3.36 16.67 to 20.83 0.18 4.33 to 5 4.02 20.83 to 25 .023 20.83 to 25 .023 3.15 ± 0.68 2.97 ± 0.72 6.64 ± 4.2 6.64 ± 4.2

Tidal Elevation (m) Tidal Elevation (m) -2.3 to -1.53 0 -2.3 to -1.53 100 Hour (h) Hour (h) -1.53 to -0.77 0 -1.53 to -0.77 0 7 to 9.33 19.9 -0.77 to 0 0 7 to 9.33 19.9 -0.77 to 0 0 9.33 to 11.67 13.5 0 to 0.77 0 9.33 to 11.67 13.5 0 to 0.77 0 11.67 to 14 13.4 0.77 to 1.53 0 11.67 to 14 13.4 0.77 to 1.53 0 14 to 16.33 20.0 1.53 to 2.3 100 14 to 16.33 20.0 1.53 to 2.3 0 16.33 to 18.67 13.2 16.33 to 18.67 13.2 -1.92 ± 0.22 1.92 ± 0.22 18.67 to 21 19.9 18.67 to 21 19.9 14.1 ± 4.2 14.1 ± 4.2

WaveHeight (m) WaveHeight (m) 0 to 0.75 0 0 to 0.75 0 0.75 to 1.5 100 0.75 to 1.5 100 Temperature (deg C) Temperature (deg C) 1.5 to 2.25 0 1.5 to 2.25 0 12 to 16.5 20.4 Exposure 12 to 16.5 20.4 2.25 to 3 0 Exposure 2.25 to 3 0 Shorebreak Haza... Shorebreak Haza... 16.5 to 21 27.1 16.5 to 21 27.1 3 to 3.75 0 low 33.1 3 to 3.75 0 low 33.1 true 100 21 to 25.5 37.2 true 100 21 to 25.5 37.2 3.75 to 4.5 0 high 20.3 3.75 to 4.5 0 high 20.3 false 0 25.5 to 30 10.4 false 0 25.5 to 30 10.4 medium 46.6 1.13 ± 0.22 medium 46.6 1.13 ± 0.22 30 to 34.5 3.13 0 30 to 34.5 3.13 0 1.14 ± 0.88 1.14 ± 0.88 34.5 to 39 1.80 34.5 to 39 1.80 21.2 ± 5.2 21.2 ± 5.2 T02 (s) T02 (s) 2 to 3.67 0 2 to 3.67 0 3.67 to 5.33 0 3.67 to 5.33 0 5.33 to 7 100 5.33 to 7 100 Insolation (m/h) Insolation (m/h) 7 to 8.67 0 7 to 8.67 0 0 to 10 44.5 0 to 10 44.5 8.67 to 10.33 0 8.67 to 10.33 0 10 to 20 3.99 10 to 20 3.99 10.33 to 12 0 Shorebreak Injury c... 10.33 to 12 0 Shorebreak Injury c... 20 to 30 3.77 20 to 30 3.77 0 84.0 6.17 ± 0.48 0 84.0 30 to 40 3.44 6.17 ± 0.48 30 to 40 3.44 1 16.0 1 16.0 40 to 50 4.42 40 to 50 4.42 50 to 60 39.9 1.22 ± 1.9 50 to 60 39.9 1.22 ± 1.9 Wave Direction (deg) 28.9 ± 24 Wave Direction (deg) 28.9 ± 24 247 to 265.84 0 247 to 265.84 0 265.84 to 284.67 0 265.84 to 284.67 0 284.67 to 303.5 100 284.67 to 303.5 100 303.5 to 322.33 0 303.5 to 322.33 0 322.33 to 341.17 0 322.33 to 341.17 0 341.17 to 360 0 341.17 to 360 0 294.1 ± 5.4 294.1 ± 5.4

Inverse Slope Inverse Slope 5 to 12.5 52.0 5 to 12.5 67.5 12.5 to 20 16.3 12.5 to 20 3.74 20 to 27.5 12.5 20 to 27.5 5.26 27.5 to 35 13.0 27.5 to 35 18.9 35 to 42.5 3.50 35 to 42.5 2.47 42.5 to 50 2.68 42.5 to 50 2.06 16.8 ± 11 15.6 ± 11

Figure 11. (a) Scenario with high tide resulting in a steep slope with a high probability; (b) Scenario with low tide resulting in a steep slope with a high probability and a intermediate slope with a slightly elevated probability

325 of larger Hs and T02 suggest such drowning incidents occur for increased incident wave energy. Wave direction shows small increase in drowning probability for the NW oriented directions. More sinuous shorelines (larger S values) show increased probability of rip-current related drowning, indicating that more alongshore-variable surf zone morphology increases the rip- current hazard. Wind speed has only little influence, although low wind speeds are slightly more likely during a drowning incident. Furthermore, the hour of the day shows a distribution that corresponds with the expected beach attendance. However, 330 there is a disproportionate peak in the evening between 19h and 21h. Furthermore, the highest peak is much earlier between 13h-15h compared to shore-break injuries. This might be explained by the fact that in summer more high tides with larger tidal ranges occur during late afternoon (Castelle et al., 2019). Temperature and insolation show comparable patterns to the shore-break BN with warm sunny days between 20-28 C° having the highest probability. Other rip-current BN scenarios were tested, providing insight in variable interactions. For instance, an interaction between 335 the magnitude of shoreline sinuosity S and wave angle of incidence was explored, given average wave energy conditions and a 100% chance that there was a rip hazard (Figure 13a and b). It can be observed that a low sinuosity (1-1.06) is correlated

with higher probabilities for the shore-normal angle of wave incidence (around 279◦) and a high sinuosity is associated with a larger probability for more NW angles of incidence (295.43-311.57°).

18 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Tidal Gradient (m/hour) Tidal Gradient (m/hour) wind Speed (m/s) -0.6 to -0.43 3.64 (a) Prior probability rip current BN -0.6 to -0.43 5.59 (b) Updated probability rip current BN 0 to 3.57 17.4 -0.43 to -0.26 20.7 wind Speed (m/s) -0.43 to -0.26 19.0 -0.26 to -0.086 19.5 3.57 to 7.14 54.3 -0.26 to -0.086 19.9 0 to 3.57 18.0 3.57 to 7.14 53.1 -0.086 to 0.086 14.1 7.14 to 10.71 13.4 -0.086 to 0.086 13.5 7.14 to 10.71 12.7 0.086 to 0.26 21.0 10.71 to 14.29 14.3 0.086 to 0.26 23.6 10.71 to 14.29 15.6 14.29 to 17.86 0.38 0.26 to 0.43 17.4 14.29 to 17.86 0.42 0.26 to 0.43 16.3 17.86 to 21.43 0.12 0.43 to 0.6 3.43 17.86 to 21.43 0.14 21.43 to 25 .035 21.43 to 25 .033 0.43 to 0.6 2.32 -0.0141 ± 0.29 6.35 ± 3.6 -0.0158 ± 0.27 6.3 ± 3.5

Hour of Day 7 to 9 5.98 Tida Elevation (m) Hour of Day (h) Tida Elevation (m) 7 to 9 13.4 9 to 11 12.9 -2.3 to -1.64 4.18 9 to 11 13.2 -2.3 to -1.64 5.35 11 to 13 16.0 -1.64 to -0.99 17.7 11 to 13 13.3 -1.64 to -0.99 18.4 13 to 15 22.0 13 to 15 13.3 -0.99 to -0.33 20.1 -0.99 to -0.33 17.4 15 to 17 18.4 15 to 17 13.5 -0.33 to 0.33 16.8 -0.33 to 0.33 17.0 17 to 19 13.3 17 to 19 9.16 19 to 21 20.1 0.33 to 0.99 19.3 19 to 21 15.6 0.33 to 0.99 21.3 14.4 ± 4.2 0.99 to 1.64 15.1 0.99 to 1.64 18.3 1.64 to 2.3 4.94 14.5 ± 3.6 1.64 to 2.3 4.16 -0.0568 ± 1.1 0.0329 ± 1.1

Temperature (deg C) Temperature (deg C) Mean Wave Period (s) Mean Wave Period (s) 12.6 to 16.37 20.2 12.6 to 16.37 10.3 2 to 3.43 2.83 2 to 3.43 4.52 16.37 to 20.14 5.68 16.37 to 20.14 19.1 3.43 to 4.86 19.2 3.43 to 4.86 26.1 20.14 to 23.91 36.2 20.14 to 23.91 34.1 4.86 to 6.29 38.1 4.86 to 6.29 40.2 23.91 to 27.69 33.5 23.91 to 27.69 16.9 6.29 to 7.71 27.3 6.29 to 7.71 22.6 27.69 to 31.46 7.59 7.71 to 9.14 8.09 27.69 to 31.46 4.91 31.46 to 35.23 4.25 7.71 to 9.14 6.60 9.14 to 10.57 2.48 31.46 to 35.23 2.76 35.23 to 39 2.50 9.14 to 10.57 1.45 10.57 to 12 0.27 35.23 to 39 2.07 10.57 to 12 0.22 23.7 ± 5 5.91 ± 1.6 Exposure Exposure 21.4 ± 5.3 Rip Hazard 5.71 ± 1.5 Rip Hazard low 0 + low 33.3 true 100 true 50.0 high 100 high 33.3 false 0 + false 50.0 medium 0 + medium 33.3 6.72e-8 ± 0.00026 Wave Height (m) 0.5 ± 0.5 Insolation (min/hour) Wave Height (m) 1 ± 0.00042 Insolation (min/hour) 1 ± 0.82 0 to 0.64 8.93 0 to 8.57 43.8 0 to 0.64 13.2 0 to 8.57 19.5 0.64 to 1.29 53.9 8.57 to 17.14 3.61 0.64 to 1.29 41.8 8.57 to 17.14 2.18 1.29 to 1.93 31.1 17.14 to 25.71 2.65 1.29 to 1.93 28.1 17.14 to 25.71 3.13 1.93 to 2.57 9.21 25.71 to 34.3 3.11 1.93 to 2.57 6.16 25.71 to 34.3 3.29 2.57 to 3.21 3.46 34.3 to 42.86 2.72 2.57 to 3.21 2.10 34.3 to 42.86 2.80 3.21 to 3.86 0.89 42.86 to 51.43 4.03 3.21 to 3.86 0.57 42.86 to 51.43 4.17 3.86 to 4.5 0.29 Injury Hourly Count 51.43 to 60 65.8 Injury Hourly Count 3.86 to 4.5 0.26 51.43 to 60 39.2 1.3 ± 0.69 0 0 42.2 ± 21 0 96.8 1 100 1.23 ± 0.6 1 3.23 28.9 ± 24 4.5 ± 2 0.629 ± 0.84

Wave Direction (deg) Wave Direction (deg) 247 to 263.14 0.44 247 to 263.14 0.86 263.14 to 279.29 4.34 263.14 to 279.29 6.87 279.29 to 295.43 48.2 279.29 to 295.43 50.7 295.43 to 311.57 29.1 295.43 to 311.57 34.1 311.57 to 327.71 12.5 311.57 to 327.71 8.96 327.71 to 343.86 2.56 327.71 to 343.86 1.38 343.86 to 360 0 + 343.86 to 360 .011 296 ± 16 296 ± 14

Sin Sin 1 to 1.06 10.9 1 to 1.06 12.6 1.06 to 1.11 47.2 1.06 to 1.11 46.6 1.11 to 1.17 20.4 1.11 to 1.17 23.7 1.17 to 1.23 9.88 1.17 to 1.23 9.27 1.23 to 1.29 6.19 1.23 to 1.29 5.46 1.29 to 1.34 4.35 1.29 to 1.34 3.44 1.34 to 1.4 0 + 1.34 to 1.4 .012 1.121 ± 0.074 1.12 ± 0.069

Figure 12. (a) Prior probability distribution of the rip-current BN; (b) updated probability distribution of the rip current BN when the probability of an injury occurring was set to 100%

5 Discussion

340 5.1 BNs as a predictive tool for SZIs

Contrary to previous works, a same BN approach was used to address shore-break and rip-current related SZIs co-existing at a given site. Two separate BNs were created for shore-break SZIs and rip current SZIs. This allowed to use different beach morphology metrics based on prior understanding of the physics of shore-break waves and rip current dynamics. In addition, in line with Stokes et al. (2017) two hidden variables (exposure and hazard) were introduced for both BNs to decrease the amount 345 of connections and increase BN efficiency. Importantly, Doelp et al. (2019) used population data to test a SZI ratio, normalized by the population, in addition to the binary injury likelihood. Although results were not dramatically improved in Doelp et al. (2019), including accurate water user data should improve BN model predictions along the Gironde Coast. However, such data does not exist and will require future research effort. Performance metrics indicate that the BNs improve prior estimates, but that BNs still have a significant percentage of wrong 350 but confident predictions. This is due to over fitting, which is a common issue with training a BN on rare events (unbalanced

19 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Tidal Gradient (m/hour) Tidal Gradient (m/hour) wind Speed (m/s) wind Speed (m/s) -0.6 to -0.43 1.33 (a) Scenario low sinuosity rip current BN -0.6 to -0.43 2.50 (b) Scenario high sinuosity rip current BN -0.43 to -0.26 11.8 0 to 3.57 18.0 -0.43 to -0.26 24.8 0 to 3.57 18.0 -0.26 to -0.086 20.3 3.57 to 7.14 53.1 -0.26 to -0.086 34.2 3.57 to 7.14 53.1 -0.086 to 0.086 15.3 7.14 to 10.71 12.7 -0.086 to 0.086 4.99 7.14 to 10.71 12.7 0.086 to 0.26 24.3 10.71 to 14.29 15.6 0.086 to 0.26 10.3 10.71 to 14.29 15.6 0.26 to 0.43 22.5 14.29 to 17.86 0.37 0.26 to 0.43 14.7 14.29 to 17.86 0.37 0.43 to 0.6 4.54 17.86 to 21.43 0.10 0.43 to 0.6 8.51 17.86 to 21.43 0.10 21.43 to 25 .023 21.43 to 25 .023 0.0604 ± 0.26 -0.0456 ± 0.3 6.34 ± 3.6 6.34 ± 3.6

Hour of Day Hour of Day 7 to 9 13.4 7 to 9 13.4 Tida Elevation (m) Tida Elevation (m) 9 to 11 13.2 9 to 11 13.2 -2.3 to -1.64 0 11 to 13 13.3 -2.3 to -1.64 0 11 to 13 13.3 -1.64 to -0.99 100 13 to 15 13.3 -1.64 to -0.99 100 13 to 15 13.3 -0.99 to -0.33 0 15 to 17 13.5 -0.99 to -0.33 0 15 to 17 13.5 -0.33 to 0.33 0 17 to 19 13.3 -0.33 to 0.33 0 17 to 19 13.3 0.33 to 0.99 0 19 to 21 20.1 0.33 to 0.99 0 19 to 21 20.1 0.99 to 1.64 0 0.99 to 1.64 0 1.64 to 2.3 0 14.4 ± 4.2 1.64 to 2.3 0 14.4 ± 4.2 -1.31 ± 0.19 -1.31 ± 0.19

Temperature (deg C) Temperature (deg C) Mean Wave Period (s) Mean Wave Period (s) 12.6 to 16.37 20.0 12.6 to 16.37 20.0 2 to 3.43 0 2 to 3.43 0 16.37 to 20.14 19.4 16.37 to 20.14 19.4 3.43 to 4.86 0 3.43 to 4.86 0 20.14 to 23.91 35.2 20.14 to 23.91 35.2 4.86 to 6.29 0 4.86 to 6.29 0 23.91 to 27.69 17.0 23.91 to 27.69 17.0 6.29 to 7.71 100 6.29 to 7.71 100 27.69 to 31.46 4.49 27.69 to 31.46 4.49 7.71 to 9.14 0 7.71 to 9.14 0 31.46 to 35.23 2.27 31.46 to 35.23 2.27 9.14 to 10.57 0 9.14 to 10.57 0 35.23 to 39 1.57 35.23 to 39 1.57 10.57 to 12 0 10.57 to 12 0 21.3 ± 5.1 21.3 ± 5.1 7 ± 0.41 Exposure 7 ± 0.41 Exposure Rip Hazard Rip Hazard low 40.4 low 40.4 true 100 true 100 high 21.6 high 21.6 false 0 false 0 medium 38.0 medium 38.0 Wave Height (m) 0 Insolation (min/hour) Wave Height (m) 0 Insolation (min/hour) 0.976 ± 0.88 0.976 ± 0.88 0 to 0.64 0 0 to 8.57 43.8 0 to 0.64 0 0 to 8.57 43.8 0.64 to 1.29 0 8.57 to 17.14 3.60 0.64 to 1.29 0 8.57 to 17.14 3.60 1.29 to 1.93 100 17.14 to 25.71 3.12 1.29 to 1.93 100 17.14 to 25.71 3.12 1.93 to 2.57 0 25.71 to 34.3 3.28 1.93 to 2.57 0 25.71 to 34.3 3.28 2.57 to 3.21 0 34.3 to 42.86 2.80 2.57 to 3.21 0 34.3 to 42.86 2.80 3.21 to 3.86 0 42.86 to 51.43 4.16 3.21 to 3.86 0 42.86 to 51.43 4.16 3.86 to 4.5 0 Injury Hourly Co... 51.43 to 60 39.2 3.86 to 4.5 0 Injury Hourly Co... 51.43 to 60 39.2 1.61 ± 0.18 0 80.7 28.9 ± 24 1.61 ± 0.18 0 80.7 28.9 ± 24 1 19.3 1 19.3 1.27 ± 1.8 1.27 ± 1.8

Wave Direction (deg) Wave Direction (deg) 247 to 263.14 0.34 247 to 263.14 0.63 263.14 to 279.29 3.45 263.14 to 279.29 6.47 279.29 to 295.43 60.8 279.29 to 295.43 23.9 295.43 to 311.57 25.9 295.43 to 311.57 51.2 311.57 to 327.71 9.07 311.57 to 327.71 17.0 327.71 to 343.86 0.39 327.71 to 343.86 0.74 343.86 to 360 0 + 343.86 to 360 0 + 294 ± 13 300 ± 14

Sin Sin 1 to 1.06 100 1 to 1.06 0 1.06 to 1.11 0 1.06 to 1.11 0 1.11 to 1.17 0 1.11 to 1.17 0 1.17 to 1.23 0 1.17 to 1.23 100 1.23 to 1.29 0 1.23 to 1.29 0 1.29 to 1.34 0 1.29 to 1.34 0 1.34 to 1.4 0 1.34 to 1.4 0 1.03 ± 0.017 1.2 ± 0.017

Figure 13. (a) Scenario with low sinuosity and shore-normal (around 279°) wave direction; (b) Scenario with medium sinuosity and NW wave angle

dataset) (Cheon et al., 2009). When the primary objective of a BN is prediction rather than description, a synthetic data set can be created with an even distribution of events although it degrades the BN descriptive ability. A similar suggestion to cope with this problem, is to remove anomalous confident but wrong predictions (Doelp et al., 2019). Another issue limiting the BN predictive ability is that simple beach morphology parameters S and IFS were derived from a single site (Truc Vert). 355 Summer beach profiles acquired along the different beaches should help improving IFS estimation and, in turn, prediction of rip-current drowning incidents. Similarly, optical satellite images should be explored to derive beach sinuosity S at the different beaches along the coast. As indicated earlier, an estimation of beach attendance or water users should also improve BN predictive ability. Therefore, at this stage other tools should be used for life-risk prediction, like for instance models based on simple correlations between meteorological, oceanographic conditions and the incidence (Lushine, 1991; Lascody, 1998; 360 Dusek and Seim, 2013) or on the numerical modelling of rip flow speed (Austin et al., 2013). Recently, using the same SZI dataset a logistic regression model was found to predict the risk of drowning the Gironde up to three days in advance with good skill (Tellier et al., in revision). Lastly, there were 1565 unknown injuries that could not be retrieved to either a shore-break or a rip-current related injury. Theoretically, a well trained BN could estimate which of the SZI is more likely based on the environmental conditions. How-

20 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

365 ever, since the BNs are still limited in prediction this should be explored in the future using improved BNs. Such BN could help to retrospectively improve SZI statistics along surf coasts.

5.2 Environmental controls on SZIs and implications for beach safety management

In other studies frequency analysis was used to identify disproportionate environmental conditions during SZIs (Scott et al., 2014; Castelle et al., 2019). Some of the BN results are essentially in line with previous work. In short, more SZIs are observed 370 for warm sunny days with light winds. Rip-current related drowning incidents increase with increasing incident wave energy (height and period), more shore-normal incidence, and lower tide level. In contrast, shore-break related injuries are sustained at high tide levels and moderate wave height. In addition to previous work, here we proposed a method to quantify the role of beach morphology on SZIs. Beach sinuosity S, which is a measure of the alongshore variability of surf zone morphology, and inverse beach slope IFS were found to influence the occurrence of rip-current related drowning incidents and shore- 375 break-related injuries, respectively. These results are in agreement with current knowledge of rip flow intensity increasing with increasingly alongshore-variable surf zone morphology (Moulton et al., 2017), and shore-break waves occurring for steeper beach face (Battjes, 1974; Balsillie, 1985). We also found that rapid, positive or negative, change in tide level elevation (large dη) increase the probability of rip current related drownings. This can explain why in previous work large tidal ranges were found to result in more rip current drownings (Scott et al., 2014; Castelle et al., 2019). Although this essentially applies to 380 meso- to macro-tidal beaches and given that dη does not affect the hazard posed, this demonstrates that rapid changes in tidal elevation driving the rapid onset of rip current activity can surprise unsuspecting bathers and carry them offshore. To our knowledge, the rapid onset of hazardous surf zone currents for days with large tidal range is not emphasized in current public safety awareness campaigns on rip currents. In addition to the primary environmental controls on SZIs, in this study it was for the first time possible to identify the 385 interaction between multiple input variables. For instance, it was found that it is the combined effects of tide elevation, wave direction and wave height that control rip-current hazard. In other words, even if you have shore-normally incident waves near low tide, if wave height is very small, there is no hazard and consequently a low probability of rip-current related drownings. Such interactions, which were not possible to address in previous work (Scott et al., 2014; Castelle et al., 2019), are in line with the understanding of rip flow response to wave and tide conditions (Castelle et al., 2016). Similarly, evidence was found 390 that rips can be activated during various angles of wave incidence, depending on the degree of shoreline sinuosity S. This is also in line with observations and model outputs showing that, for the same obliquely incident wave conditions, rip cell circulation are transformed into an undulating, less hazardous, longshore current for weakly (small S) alongshore variable surf zone morphology, while rip cell circulation can be sustained for deep rip channels (MacMahan et al., 2008; Dalrymple et al., 2011). This shows that BNs including a wisely pre-defined hidden hazard variable can provide insight into the influence of the 395 primary input variables and their interactions on the hazard posed. Therefore, it could also be applied to other injuries, e.g. related to surfing activity, for which the causes (e.g. environmental, behavioural) and their interplay are poorly understood. Studies addressing the environmental controls on shore-break related SZIs are scarce (Puleo et al., 2016; Doelp et al., 2019) compared to drowning studies. The shore-break BN developed herein for the Gironde coast suggests that, with decreased

21 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

exposure for Hs > 2.5 m, large surf, and thus heavy shore-break waves at the shoreline, discourage the beachgoers to enter 400 the water near high tide. Importantly, this was not observed for rip-current related drownings, which have a tendency to occur at low tide with the inner surf zone located on a much more gently sloping part of the beach profile. We hypothesize that in such less adverse conditions, beachgoers are less discouraged to enter the water, as opposed to facing large shore-break waves. However, further investigation on beachgoer behaviour in the presence of shore-break waves is required to test this hypothesis. In addition, our variable sensitivity analysis indicates the shore-break related injuries are more controlled by the exposure 405 than by hazard, contrary to rip-current related drowning for which life risk is approximately equally distributed between hazard and exposure. This indicates that shore-break injuries are more likely to occur during busy days, whether moderate or heavy shore-break conditions are present. In contrast, the presence of intense rip currents is critical to drowning incidents.

6 Conclusions

A Bayesian network (BN) approach was used to model life risk and the controls and interactions of environmental (metocean 410 and morphological) data on SZIs along a high-energy meso-macrotidal coast where shore-break and rip-current hazards co- exist. In line with previous work, the BNs show limited predictive skill. Although the shore-break and rip-current BNs improves prior estimates, they still have a large percentage of wrong but confident predictions, which is not tenable for life-risk prediction on beaches. However, the BNs provide fresh insight into the different environmental controls, their interactions, and their respective contribution to hazard and exposure. For the first time, the respective contributions of exposure and hazards to 415 the overall life risk were quantified, showing the shore-break related injuries are more controlled by the exposure than by hazard, contrary to rip-current related drowning for which contributions are approximately equal. These results can guide the future development, or modification, of public education messaging, particularly on the shore-break hazard that received little attention so far compared to rip currents, despite the large number of severe injuries sustained in shore-break waves along the Gironde Coast. We advocate that such BNs should be developed in parallel with other risk predictors showing high predictive 420 skill but providing much less diagnostic Tellier et al. (in revision).

Data availability. The wave buoy data are publicly available through the French Candhis network operated by CEREMA. Weather station and tide gauge data are available from the Météo France Radome network and the SHOM, respectively. Wave hindcast is available from the MARC platform (Modelling and Analysis for Research in Coastal environment) at https://marc.ifremer.fr.The injury data collected on beaches are not publicly available due to restriction from the French National Committee for the Protection of Data Privacy.

425 Author contributions. BC and ET designed the research presented here. EdK designed the experiments, performed the modelling experiments and analysed all the results. BC and ET consulted on the experiments. EdK prepared the manuscript, and BC and ET reviewed and edited the manuscript. Funding provided by BC.

22 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Competing interests. The authors declare that they have no conflict of interest.

Acknowledgements. We thank Dr. Bruno Simonnet who contributed to the injury data collection. Constructive suggestions to improve read- 430 ability by Rieke Santjer were much appreciated. This study includes the beach monitoring study site of Truc Vert labelled by the Service National d’Observation (SNO) Dynalit, which surveys are financially supported by Observatoire de la Côte Aquitaine (OCA) and SNO Dynalit.

23 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

References

Austin, M., Scott, T. M., Brown, J. W., Brown, J. A., MacMahan, J. H., Masselink, G., and Russell, P.: Temporal observations of rip current 435 circulation on a macro-tidal beach, Cont. Shelf. Res., 30, 1149–1165, https://doi.org/10.1016/j.csr.2010.03.005, 2010. Austin, M., Scott, T. M., Russell, P. E., and Masselink, G.: Rip Current Prediction: Development, Validation and Evaluation of an Operational Tool, J. Coast. Res., 29, 283–300, https://doi.org/10.2112/JCOASTRES-D-12-00093.1, 2013. Balsillie, J. H.: Redefinition of shore-breaker classification as a numerical continuum and a design shore-breaker, J. Coast. Res., 1, 247–254, 1985. 440 Battjes, J. A.: Surf similarity, in: 14th Int. Conf. on Coastal Eng., pp. 466–480, ASCE, 1974. Bayes, T.: An Essay Towards Solving a Problem in the Doctrine of Chances, Philos. Trans. R. Soc. Lond., 53, 370–418, 1763. Boudière, E., Maisondieu, C., Ardhuin, F., Accensi, M., Pineau-Guillou, L., and Lepesqueur, J.: A suitable metocean hindcast database for the design of Marine energy converters, Int. J. Mar. Ener., 3-4, e40 – e52, https://doi.org/10.1016/j.ijome.2013.11.010, 2013. Brander, R. W. and Short, A. D.: Flow kinematics of low-energy rip current systems, J. Coast. Res., 17, 468–481, 2001. 445 Brighton, B., Sherker, S., Brander, R., Thompson, M., and Bradstreet, A.: Rip current related drowning deaths and rescues in Australia 2004–2011, Nat. Hazards Earth Syst. Sci., 13, 1069–1075, https://doi.org/10.5194/nhess-13-1069-2013, 2013. Bruneau, N., Bonneton, P., Castelle, B., and Pedreros, R.: Modeling rip current circulations and vorticity in a high-energy mesotidal- macrotidal environment, J. Geophys. Res. Oceans, 116, https://doi.org/10.1029/2010JC006693, 2011. Castelle, B., Marieu, V., Bujan, S., Splinter, K. D., Robinet, A., Sénéchal, N., and Ferreira, S.: Impact of the winter 2013–2014 series of 450 severe Western Europe storms on a double-barred sandy coast: Beach and dune erosion and megacusp embayments, Geomorphology, 238, 135–148, https://doi.org/10.1016/j.geomorph.2015.03.006, 2015. Castelle, B., Scott, T., Brander, R., and McCarroll, R.: Rip current types, circulation and hazard, Earth Sci. Rev., 163, 1–21, https://doi.org/10.1016/j.earscirev.2016.09.008, 2016. Castelle, B., Brander, R., Tellier, E., Simonnet, B., Scott, T., McCarroll, J., Campagne, J.-M., Cavailhes, T., and Lechevrel, P.: Surf zone 455 hazards and injuries on beaches in SW France, Nat. Hazards, 93, 1317–1335, https://doi.org/10.1007/s11069-018-3354-4, 2018. Castelle, B., Scott, T., Brander, R., McCarroll, J., Robinet, A., Tellier, E., de Korte, E., Simonnet, B., and Salmi, L.-R.: Environmental controls on surf zone injuries on high-energy beaches., Nat. Hazards Earth Syst. Sci., 19, https://doi.org/10.5194/nhess-19-2183-2019, 2019. Castelle, B., Bujan, S., Marieu, V., and Ferreira, S.: 16 years of topographic surveys of rip-channelled high-energy meso-macrotidal sandy 460 beach, Sci. Data.s, 7, https://doi.org/10.1038/s41597-020-00750-5, 2020. Cheon, S.-P., Kim, S., Lee, S.-Y., and Lee, C.-B.: Bayesian networks based rare event prediction with sensor data, Knowl. Based Systs., 22, 336 – 343, https://doi.org/10.1016/j.knosys.2009.02.004, 2009. Dalrymple, R. A., MacMahan, J. H., Reniers, A. J., and Nelko, V.: Rip Currents, Annu. Rev. Fluid Mech., 43, 551–581, https://doi.org/10.1146/annurev-fluid-122109-160733, 2011. 465 Dodet, G., Castelle, B., Masselink, G., Scott, T., Davidson, M., Floc’h, F., Jackson, D., and Suanez, S.: Beach recovery from ex- treme storm activity during the 2013–14 winter along the Atlantic coast of Europe, Earth Surf. Process. Landf., 44, 393–401, https://doi.org/10.1002/esp.4500, 2019. Doelp, M. B., Puleo, J. A., and Plant, N. G.: Predicting surf zone injuries along the Delaware coast using a Bayesian network, Nat. Hazards, 98, 379–401, https://doi.org/10.1007/s11069-019-03697-y, 2019.

24 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

470 Drozdzewski, D., Shaw, W., Dominey-Howes, D., Brander, R., Walton, T., Gero, A., Sherker, S., Goff, J., and Edwick, B.: Surveying rip current survivors: preliminary insights into the experiences of being caught in rip currents, Nat. Hazards Earth Syst. Sci., 12, 1201–1211, https://doi.org/10.5194/nhess-12-1201-2012, 2012. Dusek, G. and Seim, H.: A probabilistic rip current forecast model, J. Coast. Res., 29, 909–925, https://doi.org/10.2112/JCOASTRES-D-12- 00118.1, 2013. 475 Dwight, R. H., Brinks, M. V., SharavanaKumar, G., and Semenza, J. C.: Beach attendance and bathing rates for Southern California beaches, Ocean Coast. Manage., 50, 847 – 858, https://doi.org/10.1016/j.ocecoaman.2007.04.002, 2007. Fienen, M. N. and Plant, N. G.: A cross-validation package driving Netica with python, Environ. Model. Softw., 63, 14–23, https://doi.org/10.1016/j.envsoft.2014.09.007, 2015. Fienen, M. N., Masterson, J. P., Plant, N. G., Gutierrez, B. T., and Thieler, E. R.: Bridging groundwater models and decision support with a 480 Bayesian network, Water Resour. Res., 49, 6459–6473, https://doi.org/10.1002/wrcr.20496, 2013. Gutierrez, B. T., Plant, N. G., and Thieler, E. R.: A Bayesian network to predict coastal vulnerability to sea level rise, J. Geophys. Res. Earth Surf., 116, https://doi.org/10.1029/2010JF001891, 2011. Gutierrez, B. T., Plant, N. G., Thieler, E. R., and Turecek, A.: Using a Bayesian network to predict barrier island geomorphologic character- istics, J. Geophys. Res. Earth Surf., 120, 2452–2475, https://doi.org/10.1002/2015JF003671, 2015. 485 Houser, C., Arnott, R., Ulzhofer, S., and Barrett, G.: Nearshore circulation over transverse bar and rip morphology with oblique wave forcing, Earth Surf. Process. Landf., 38, 1269–1279, https://doi.org/10.1002/esp.3413, 2013. Ibarra, E.: The use of webcam images to determine tourist-climate aptitude: Favourable weather types for sun and beach tourism on the Alicante coast (Spain), Int. J. Biometeorol., 55, 373–385, https://doi.org/10.1007/s00484-010-0347-8, 2011. Jefferys, W. H. and Berger, J. O.: Ockham’s razor and Bayesian analysis, Am. Sci., 80, 64–72, 1992. 490 Korb, K. B. and Nicholson, A. E.: Bayesian artificial intelligence, CRC press, London UK, https://doi.org/10.1201/b10391, 2010. Lascody, R. L.: East central Florida rip current program, Natl. Weather Dig., 22, 25–30, 1998. Lushine, J. B.: A study of rip current drownings and related weather factors, Natl. Weather Dig., 16, 13–19, 1991. MacMahan, J. H., Thornton, E. B., Stanton, T. P., and Reniers, A. J. H. M.: RIPEX-observations of a rip current system, Mar. Geol., 218, 113–134, https://doi.org/10.1016/j.margeo.2005.03.019, 2005. 495 MacMahan, J. H., Thornton, E. B., and Reniers, A. J. H. M.: Rip current review, Coast. Eng., 53, 191–208, https://doi.org/10.1016/j.coastaleng.2005.10.009, 2006. MacMahan, J. H., Thornton, E. B., Reniers, A. J., Stanton, T. P., and Symonds, G.: Low-Energy Rip Currents Associated With Small Bathymetric Variations, Mar. Geol., 255, 156 – 164, https://doi.org/10.1016/j.margeo.2008.08.006, 2008. Marcot, B. G.: Metrics for evaluating performance and uncertainty of Bayesian network models, Ecol. model., 230, 50–62, 500 https://doi.org/10.1016/j.ecolmodel.2012.01.013, 2012. Moulton, M., Elgar, S., Raubenheimer, B., Warner, J. C., and Kumar, N.: Rip currents and alongshore flows in single channels dredged in the surf zone, J. Geophys. Res. Oceans, 122, 3799–3816, https://doi.org/10.1002/2016JC012222, 2017. Muller, M. W.: Beach replenishment and surf-zone injuries along the coast of Delmarva, USA, Ocean & Coastal Management, 151, 127–133, https://doi.org/10.1016/j.ocecoaman.2017.10.013, 2018. 505 Norsys, N. A.: Norsys Software Corp, https://www.norsys.com, v6.05, 1998. Pearson, S., Storlazzi, C. D., van Dongeren, A., Tissier, M., and Reniers, A.: A Bayesian-based system to assess wave-driven flooding hazards on coral -lined coasts, J. Geophys. Res. Oceans, 122, 10 099–10 117, https://doi.org/10.1002/2017JC013204, 2017.

25 https://doi.org/10.5194/nhess-2021-36 Preprint. Discussion started: 28 January 2021 c Author(s) 2021. CC BY 4.0 License.

Plant, N. G. and Holland, K. T.: Prediction and assimilation of surf-zone processes using a Bayesian network: Part I: Forward models, Coast. Eng., 58, 119–130, https://doi.org/10.1016/j.coastaleng.2010.09.003, 2011. 510 Puleo, J., Hutschenreuter, K., Cowan, P., Carey, W., Arford-Granholm, M., and McKenna, K.: Delaware surf zone injuries and associated environmental conditions, Nat. Hazards, 81, 845–867, https://doi.org/10.1007/s11069-015-2108-9, 2016. Robbles, L.: Cervical spine injuries in ocean bathers: wave-related accidents, Neurosurgery, 58, 920–923, https://doi.org/10.1227/01.NEU.0000209941.18102.35, 2006. Russel, S. and Norvig, P.: Artificial Intelligence: A Modern Approach Thrid Edition, Person Education, Boston Munich, 515 https://doi.org/10.1016/j.artint.2011.01.005, 2010. Scott, T., Masselink, G., Austin, M. J., and Russell, P.: Controls on macrotidal rip current circulation and hazard, Geomorphology, 214, 198–215, https://doi.org/10.1016/j.geomorph.2014.02.005, 2014. Stokes, C., Masselink, G., Revie, M., Scott, T., Purves, D., and Walters, T.: Application of multiple linear regression and Bayesian belief network approaches to model life risk to beach users in the UK, Ocean & Coast. Manage., 139, 12 – 23, 520 https://doi.org/10.1016/j.ocecoaman.2017.01.025, 2017. Tellier, E., Simonnet, B., Gil-Jardine, C., Castelle, B., Bailhache, M., and Salmi, L. R.: Characteristics of drowning victims in a surf environ- ment: a 6-year retrospective study in southwestern France, Inj. Epidemiol., 6, https://doi.org/10.1186/s40621-019-0195-x, 2019. Tellier, E., Simonnet, B., Gil-Jardine, C., Castelle, B., and Salmi, L. R.: Predicting drowning from sea and weather forecasts: development and validation of a model on surf beaches of southwestern France, Inj. Prev., in revision. 525 Tolman, H. L.: User manual and system documentation of WAVEWATCH III version 4.18, in: NOAA/NWS/NCEP/MMAB Technical Note 316, p. 194pp., 2014.

26