Inferring causal associations in hydrological systems: A comparison of methods

doi:10.21203/rs.3.rs-4643196/v1

Download PDF

Research Article

Inferring causal associations in hydrological systems: A comparison of methods

https://doi.org/10.21203/rs.3.rs-4643196/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Many research issues in hydrological systems are intrinsically causal, aiming to determine whether and how one factor affects another. Although causal inference methods have been applied more or less in hydrology, there still remains a lack of systematic comparison between different methods. Here, four popular methods in the causal inference community, including the cross-correlation function (CCF), convergent cross mapping (CCM), transfer entropy (TE), and a causal network learning algorithm (PCMCI+) were selected, with a detailed explanation of their basic principles and underlying assumptions. Next, the performances of these methods were evaluated in large sample tests and sensitivity analysis using synthetic time series generated by a conceptual hydrological model with two predesigned causal structures. Then, the four methods were applied in two real-world cases to further understand their characteristics. The findings show the superior performance of the PCMCI + method in synthetic cases and a commendable level of interpretability in real cases, thus warranting its broader application in hydrological systems. The limitations of the other three methods, especially in effectively addressing confounding and mediating factors, led to several unreasonable causal links. Furthermore, the emergence of conflicting results among different methods in real-world applications underscores the necessity for a multifaceted understanding based on their particular assumptions and constraints. A comprehensive application of diverse methods according to the specific issue is encouraged for the robustness of conclusions, with their assumptions clearly stated in advance. Overall, our research reveals the potential and limitations of different causal inference methods in comprehension of complex interactions within hydrological systems, serving as a useful guide for their further prosperity in hydrology.

causal inference

cross-correlation function (CCF)

convergent cross mapping (CCM)

transfer entropy (TE)

PCMCI+

hydrological systems

The hydrological systems, which encompass the interaction of multiple variables (precipitation, runoff, evaporation, etc.) across spatiotemporal scales, are complex giant systems with both deterministic and stochastic dynamics (Sang et al., 2015). Many hydrological questions are inherently causal, which aim to understand whether and how the cause variables impact the effect variables. For instance, how to identify and quantify interactions among variables in the hydrologic cycle (Good et al., 2015; Kleidon and Renner, 2013); how to select appropriate influence factors for robust hydrologic predictions (Apaydin and Sibtain, 2021; Wang et al., 2023; Chen et al., 2011); how to determine the direct and indirect factors in the analysis of streamflow or hydrologic extremes (i.e., flood and drought) under the changing environment (Liang et al., 2023; Zhang et al., 2021a; Peng et al., 2024). It is widely acknowledged that correlation does not imply causation (Altman and Krzywinski, 2015). Associations can arise between variables in the presence (i.e., X causes Y) and absence (i.e., they have a common cause) of a causal relationship, as stated in Reichenbach’s common cause principle (Reichenbach, 1956). Relying solely on correlation-based methods may not fully encapsulate the intrinsic causal mechanisms in hydrological systems, especially within the complex spatiotemporal fabric characterized by numerous variables. Besides, Blöschl et al. (2019) recently stated in the 23 unsolved problems for hydrologic sciences that“Questions remain focused on process-based understanding of hydrological variability and causality at all space and time scales”, which highlights the urgent need for the causal perspective and framework to promote a better understanding of complex hydrological systems.

Generally, replicated interventional experiments on variables provide an explicit way to understand causal processes. However, it is often unfeasible or unethical in the real world, such as operating and intervening in regional or global hydrological variables. Additionally, with the development of monitoring technology, several experimental catchments have been established in recent years (Zhang et al., 2021b). However, the construction and maintenance are costly, and due to the spatial heterogeneity and scale problems of hydrological systems (Bergström and Graham, 1998; Blöschl et al., 2019), the extensibility and upscaling of discovered mechanisms are also challenging. Moreover, data-informed computer simulations may provide alternative randomized controlled experiments, while these are often time-consuming and inaccurate, and demand massive expert knowledge, which in turn may impose strong mechanistic assumptions on the system (Runge et al., 2019a). Fortunately, with the development of observational techniques and data sciences, the amount of available time series data in hydrologic sciences has been increasing over the years, which comes from satellite remote sensing observations, station-based or field site measurements, earth system modeling, and reanalysis products (Li et al., 2023a). Such data repositories, along with growing computational efficiency, open up an alternative avenue to use data-driven methods without the intervention of hydrological systems, namely observational causal inference.

Over the past decades, with the progress of statistics and computer science, methods for causal inference from observational time series have been developed rapidly. The most classic work can be traced back to the Granger causality (GC; Granger, 1969), which approves the causal impact from the variable X to Y if the past of X contributes to the prediction of the future of Y. However, the validity of the GC framework remains controversial since it only reveals the statistical associations via linear vector autoregression (Shojaie and Fox, 2022). Later, the nonlinear extension of the original GC framework was developed by integrating information theory, that is, the transfer entropy (TE; Schreiber, 2000). Except for the GC framework, some complementary perspectives emerged, from the nonlinear dynamics system based on the reconstruction of attractors, known as the convergent cross mapping (CCM; (Sugihara et al., 2012)). More recently, the graph-based causal network learning algorithms, such as the Peter–Clark momentary conditional independence (PCMCI) proposed by Runge et al. (2019b), become increasingly popular in the causal community. Unlike the data-driven machine learning methods that primarily concentrate on classification or prediction (Zounemat-Kermani et al., 2021), the purpose of all causal methods is to discover and quantify the causal interactions of the underlying system based on the measured time series (Runge et al., 2023). Since explainable machine learning has attracted great interest of researchers recently (Angelov et al., 2021), it is still challenging for machine learning, even deep learning models to extract complex mechanisms from this black box (Rudin, 2019). Thus, causal inference methods are significant for supplementing predictive machine learning to enhance our theoretical cognition of underlying systems (Reichstein et al., 2019). Over the years, causal inference methods have shown huge potential in numerous fields across Earth system science, for example, the regularity discovery, process understanding, hypothesis validation, and physical model improvement (Su et al., 2023; (Faybishenko, 2017).

Unfortunately, in hydrology, traditional correlation and regression methods are still the most commonly used tools (Massei et al., 2006; Xu et al., 2022; Yu et al., 2011), and the application of modern causal methods remains incipient, such as the feedback between rainfall and soil moisture (Luo et al., 2023; Wang et al., 2018), the interactions between groundwater and streamflow (Bonotto et al., 2022; Zhao et al., 2023b), and the hydrological connectivity in the karstic areas (Delforge et al., 2022; Li et al., 2023b). The modern causal inference methods do have huge potential in hydrological systems if their underlying assumptions and methodological challenges have been fully considered (Runge et al., 2019a). Since limited exploratory studies on the applications of causal inference methods in hydrology have been conducted recently, there still remains much confusion about how and when to apply them, and how to interpret their results, thus requiring a comprehensive comparison and evaluation between them to guide the further popularization in hydrological sciences.

In this study, four popular methods in the causal inference community, namely cross-correlation function (CCF), convergent cross mapping (CCM), transfer entropy (TE), and a causal network learning algorithm (PCMCI+) were selected, with a detailed explanation of their basic principles and underlying assumptions. Then, the performances of the methods were systematically evaluated in both synthetic and real hydrological systems, thus revealing their potential and limitations for application in hydrology. The remainder of this paper is organized as follows. Section 2 provides a comprehensive description of the four causal inference methods in our comparison study. Section 3 successively illustrates the setup of the hydrological model structure, evaluation metrics, and experimental results in the synthetic case, which includes the large sample test and sensitivity analysis. Section 4 presents the performances of the causal inference methods in two real case studies. Further discussion and conclusions are illustrated in Sections 5 and 6, respectively.

Four causal inference methods, including the CCF, CCM, TE, and PCMCI+, which are popular and representative in the causal inference community (Runge et al., 2019a; Runge et al., 2023), were chosen for our research. The basic characteristics of these methods are listed in Table 1.

Table 1

Summary of the characteristics of the four causal inference methods adopted in this research.
	Cross-correlation function (CCF)	Convergent cross mapping (CCM)	Transfer entropy (TE)	Causal network learning algorithm (PCMCI+)
Linear/nonlinear	linear	nonlinear	nonlinear	linear/nonlinear^a
Type of system	stochastic	deterministic	stochastic	stochastic
Detection of contemporaneous causal links	No	Yes/No^b	No	Yes
Other assumptions	/	nonlinear dynamical system	/	causal sufficiency, causal Markov, causal faithfulness
References	(Cryer and Chan, 2008)	(Sugihara et al., 2012); (Ye et al., 2015)	(Schreiber, 2000)	(Runge, 2020)
Note: ^aDepends on the algorithms for conditional independence detection. ^bDepends on whether the principle of priority of the cause is adopted.

2.1 Cross-correlation function

For a driving variable 𝑋 and a response variable 𝑌, causality can be identified through the cross-correlation function (CCF) with the principle of priority of the cause. As shown in Fig. 1a, the variable 𝑋_𝑡−d is considered to be the cause of variable Y_𝑡 if the Pearson’s correlation coefficient ρ between X_t − d and Y_t is significant during their common time domain for at least one value of d up to d_max:

where Cov(X_t−d, Y_t) represents the cross-covariance at lag time d; σ_x and σ_y denote the standard deviations of time series x and y, respectively. The significance of the hypothesis (i.e., 𝜌 is different from zero) is usually evaluated using the Student’s t-test (Student, 1908) with a p-value. Given a significance level 𝛼, significant relationships are considered if the p-value is lower than 𝛼.

In this study, significant correlations identified by the CCF method are considered as a way to reveal potentially causal links between two time series at a causal delay. However, according to Reichenbach’s common cause principle (Reichenbach, 1956), this linear correlation may not imply causation, owing to the interference of confounders, autocorrelation of variables, and nonlinear dynamical associations (Dean and Dunsmuir, 2016). Despite these inherent limitations as a bivariate linear method, the CCF method has still been widely applied in many disciplines including hydrology due to its simplicity, efficiency, and linear interpretability (Massei et al., 2006; Xu et al., 2022; Yu et al., 2011).

2.2 Convergent cross mapping

Convergent cross mapping (CCM), proposed by Sugihara et al. (2012), aims to infer causality between two time series in nonlinear dynamical systems using the theory of time delay embedding (Takens, 1981). It detects topological similarities between reconstructed attractor manifolds (i.e., consistent behaviors when revisiting similar states). The basic assumption of the CCM is that variables X and Y belong to the same nonlinear dynamical system, and if X causes Y, the cause variable X leaves informative features in the affected variable Y, and thus X can be estimated from its features in Y (Fig. 1b).

Specifically, to identify the causation between variables X and Y, the first stage is to reconstruct the shadow manifold by the lagged time series of each variable. For instance, the trajectories M_x={X_t, X_t−τ, X_t−2τ} and M_y={Y_t, Y_t−τ, Y_t−2τ} are the shadow manifold of variables X and Y, respectively, and τ denotes the time delay. Secondly, for a single point in M_y, k (= E + 1) nearest neighbors ${Y_{{t_m}}}$(m = 1, 2,…k) are identified based on their Euclidean distance to Y, where E is the embedding dimension. Next, time indices of these identified nearest neighbors are utilized to map corresponding points ${X_{{t_m}}}$in M_x, and their weighted average values considering the Euclidean distances are calculated to estimate X, shown as follows:

$${\mathop X\limits^{ \wedge } _t}|{M_y}=\sum\limits_{{m=1}}^{k} {{\omega _m}} {X_{{t_m}}}$$

$${\omega _m}=\frac{{{u_m}}}{{\sum\limits_{{n=1}}^{k} {{u_n}} }}$$

$${u_m}={e^{ - \frac{{d({Y_t},{Y_{{t_m}}})}}{{d({Y_t},{Y_{{t_1}}})}}}}$$

where ${\mathop X\limits^{ \wedge } _t}$represents the estimated values of X_t from M_y, and d(X, Y) denotes the Euclidean distance between vectors X and Y. Finally, the cross-mapping skill, represented by Pearson's correlation coefficient 𝜌 between observed X_t and the estimated ${\mathop X\limits^{ \wedge } _t}$, quantifies the strength of causality from variable X to Y. Besides, 𝜌 will increase from 0 to a convergent value as the cross-mapping process undergo all points of the shadow manifold seriatim. Referred to Ombadi et al. (2020), the significance threshold of 𝜌 was set as 0.3.

Given that the CCM method is based on the chaos theory, it should be applied under the strict assumptions of the deterministic mathematical models, namely, low-dimensional systems with infinite length, no noise, and acyclic and non-intermittent relations (Khatibi et al., 2012; Sivakumar, 2004). Additionally, according to Sugihara et al. (2017), convergence can be achieved when variables are subject to synchrony, a well-known phenomenon resulting from strong coupling or dynamic resonance, which might lead to spurious bidirectional causal relationships, especially for hydrological variables (Ombadi et al., 2020; Zhao et al., 2023a). Therefore, in this study, the time-lagged CCM (Ye et al., 2015), which aims to overcome the synchrony with the asymmetric patterns of time dependencies (i.e., the principle of priority of the cause), was adopted for our research. Distinguished from the traditional CCM method that uses Y_t to cross map X_t, the time-lagged CCM method adopts Y_t+d instead. Note that d is the lag time of cross-mapping, ranging from 0 to d_max, which is different from the embedding delay τ in reconstruction of the shadow manifold.

2.3 Transfer entropy

Transfer Entropy (TE) is an information-theoretic approach that quantifies the temporally asymmetric transfer of information between two time series (Schreiber, 2000). It can be deemed as the nonparametric extension of the Granger Causality (Granger, 1969) and be adopted to infer the causal association from X to Y if knowing the information about the past of variable X can reduce the uncertainty of variable Y. Due to its no assumptions on the underlying structure of data and applicability to nonlinear separable systems, the TE method has been widely applied in hydrology (Bennett et al., 2019; Konapala et al., 2020; Moges et al., 2022). To calculate TE, the concept of mutual information is introduced (Kinney and Atwal, 2014), which can be considered as the overlapping information between two random variables X and Y, shown as follows:

$$I(X;Y)=H(X)+H(Y) - H(X,Y)$$

where I(X; Y) denotes the mutual information between variables X and Y; H is the Shannon entropy (Shannon, 1948), a measure of a variable’s average uncertainty; H(X) and H(X, Y) represent the univariate and bivariate information entropy, respectively, calculated as:

$$H(X)= - \int\limits_{X} {p(x){{\log }_2}(p(x))dx}$$

$$H(X,Y)= - \int\limits_{Y} {\int\limits_{X} {p(x,y){{\log }_2}(p(x,y))dxdy} }$$

where p represents the probability distribution of the random variable. Then, TE can be understood as the shared mutual information between the past of the cause variable X and the present of the affected variable Y, conditioned on the past of Y (Fig. 1c), calculated as:

$$T{E_{X \to Y}}=I({Y_t};{X_{t - d}}|{Y_{t - 1}})$$

$$I(X;Y|Z)=H(XZ)+H(Y|Z) - H(X,YZ)$$

$$H(X|Y)=H(X) - I(X;Y)$$

where d is the lag time in the transfer of information between X and Y; I(X;Y|Z) and H(X|Y) denote the conditional mutual information and conditional entropy, respectively. Besides, Y_t−1 was selected to represent the past of variable Y based on the assumption of the Markov process that the immediate history always provides the most information (Budakoti et al., 2021; Ruddell and Kumar, 2009). To estimate the probability distribution of the variables, following Ruddell and Kumar (2009), a histogram-based approach was adopted with 11 bins. Considering that the hydrological time series includes several zero values, which may cause deviation in the calculation of entropy, we used the approach proposed by Gong et al. (2014) to handle the zero and nonzero values separately in the binning. Additionally, the significance of TE was tested by the Monte Carlo methods and Student’s t-test. The TE values were in comparison with a series of 500 randomly shuffled surrogates in which any temporal correlations between the two time series were broken (Moges et al., 2022). The TE value is considered significant if it exceeds the threshold, defined as the value corresponding to the 95th percentile of the random samples.

2.4 Causal network learning algorithms

The causal network learning algorithms, based on a series of graphical rules that dominate the identification of system causal associations, have been developed for reconstructing large-scale causal graphs with high dimensionality of variables (Runge et al., 2019b). To infer the structure of the causal graph, three underlying assumptions, namely causal sufficiency, causal Markov, and causal faithfulness are usually required (Runge, 2018). The causal sufficiency assumes that there are no other unobserved (or latent) variables that directly or indirectly impact any pair of our variable sets (i.e. all variables are directly observed). The other two assumptions allow to relate d-separations to conditional independence in the graph under this distribution: X d-sep Y|Z ⟺ X⊥Y|Z, where causal Markov and causal faithfulness assumptions correspond to the forward and backward arrows, respectively. Two nodes can be deemed as d-separated given Z if and only if all paths are blocked given Z (Pearl, 2009). The causal faithfulness assumption precludes the case where X affects Y in two directions with positive and negative effects canceling out.

In this study, the latest causal network learning algorithm, namely Peter-Clark momentary conditional independence plus (PCMCI+), which solves the problems for detecting contemporaneous and time-lagged causal links in autocorrelated high dimensional time series (Runge, 2020), was adopted. The algorithm can be divided into two stages, namely the skeleton discovery phase and the orientation phase (Fig. 1d). In the skeleton discovery phase stage, the Peter–Clark (PC) algorithm begins with a fully connected graph and then tests for the elimination of links between variables iteratively by conditioning sets of increasing cardinality (Spirtes and Glymour, 1991). Specifically, it starts with a graph where all unconditionally (p = 0) dependent variable pairs are connected with the assumption of the stationarity for causal links, and iteratively tests conditional independence with an increasing number of conditions p. Lagged links are oriented forward in time based on the principle of the priority of cause, whereas contemporaneous links are left undirected (circle marks at the ends) in this skeleton discovery phase. For instance, X_t−1 and Z_t (black nodes) have already been correctly identified as independent in the second iteration step (p = 1) where the dependence through Y_t−1 (blue box) is conditioned out, while we need to condition on two variables to test whether Z_t−2 and W_t are independent (p = 2). To further eliminate spurious links for all ordered pairs and increase the detection power, the contemporaneous conditions are iterated over by momentary conditional independence (MCI) tests with partial correlation:

$MCI:X_{{t - d}}^{i} \bot X_{t}^{j}|\mathop P\limits^{ \wedge } (X_{t}^{j})\backslash \{ X_{{t - d}}^{i}\} ,\mathop P\limits^{ \wedge } (X_{{t - d}}^{i})$ (11)

where $\widehat{P}\left({X}_{t}^{j}\right)$ and $\widehat{P}\left({X}_{t-d}^{i}\right)$ represent the lagged parents of ${X}_{t}^{j}$ and ${X}_{t-d}^{i}$ identified in the previous PC step, respectively; ‘\’ means precluding ${X}_{t-d}^{i}$ from $\widehat{P}\left({X}_{t}^{j}\right)$. After the iteration through subsets S $\subset$ X_t of contemporaneous links, the spurious adjacencies can be fully removed. The MCI partial correlation values were used to represent the link strength of causality, with the significance level α_pc set to 0.05 in the Student’s t-test.

In the second orientation phase, the left contemporaneous links can finally be directed by a series of rules. For example, when finding that W_t−1 and Z_t are independent conditional on Z_t−1, while not independent conditional on W_t, the causal links from Z_t to W_t can be identified since the other causal directions are not in accordance with the observed conditional independencies. However, there may not exist such rules to distinguish the direction between the X_t and Y_t due to the Markov equivalence class (Pearl, 2009). More details about the PCMCI + algorithm, along with its pseudo code, can be found in Runge (2020). To the best of our knowledge, the PCMCI + algorithm has not been applied in hydrological systems hitherto, and this study provides the first evaluation of its potential for further applications.

3.1 Construction of conceptual hydrological model

To evaluate the ability of the above four methods for identifying causal associations in hydrological systems, firstly, a conceptual hydrological model was employed to generate the synthetic time series, where the underlying causal associations have already been acquired and can be deemed as true relationships. The EXP-HYDRO model (Patil and Stieglitz, 2014), which characterizes the complexity of hydrological processes while maintaining as few variables and parameters as possible to ensure the interpretability of the constructed causal network, was chosen as our basic model framework. We simplified and split the original EXP-HYDRO model into two structures, namely the rainfall-runoff structure (Model S1, Fig,2a) and the snowmelt-runoff structure (Model S2, Fig. 2b).

Model S1 includes four variables: effective precipitation P, soil water storage S, interflow I, and runoff Q. Besides, the model contains three parameters: the maximum soil water storage S_max, and nonlinear storage-discharge parameters K_s, and γ. The model structure and corresponding causal network are shown at the top of Fig. 2a. The only forcing variable P was generated by a stochastic weather generator (WeaGETS; Chen et al., 2010), including a Markov chain for occurrence and a gamma distribution for quantity. Specifically, the occurrence of precipitation was generated with a binary variable $\widehat {P}$ (i.e., wet or dry days) using the first-order Markov chain (Richardson, 1981):

${\widetilde {p}_{i,j}}(t)=probability({\widehat {P}_t}=j|{\widehat {P}_{t - 1}}=i){\text{ ; }}i,j=0,{\text{ }}1{\text{ ; }}t>1$ (12)

where ${\widetilde {p}_{i,j}}$is the transition probability from the state i to j; the states 1 and 0 represent the wet and dry days, respectively. Considering that precipitation either occurs or not on a given day, ${\widetilde {p}_{0,1}}+{\widetilde {p}_{0,0}}={\widetilde {p}_{1,0}}+{\widetilde {p}_{1,1}}=1$. In this study, we set ${\widetilde {p}_{0,0}}={\widetilde {p}_{1,1}}=0.8$, ${\widetilde {p}_{0,1}}={\widetilde {p}_{1,0}}=0.2$. The values of ${\widetilde {p}_{0,0}}$and ${\widetilde {p}_{1,1}}$ were set relatively high in order to raise the persistence in the model, which could make some obstacles to the detection of causal associations. Next, the quantity of precipitation was generated by a Gamma distribution:$\widehat {P}\sim Gamma(\alpha ,\beta)$; Generally, for precipitation data, the parameter α is smaller than 1 while β has a wide range of possible values (Geng et al., 1986). In this study, α and β were set as 0.6, and 6, respectively. Besides, adding some noise to the forcing variable helps to approach the real situation and test the anti-interference ability for causal inference methods. Here, ${P_t}={\widehat {P}_t}+{\eta _p}$, and ${\eta _p}$is the white Gaussian noise (zero mean and $\sigma _{P}^{2}$variance). The $\sigma _{P}^{2}$ can be calculated as the expectation value of P² divided by the signal-to-noise ratio (SNR), as follows: ${\sigma _P}^{2}=E{\text{ }}[{P^2}]/SNR$ and the SNR can be transformed into the unit of dB:$SNR(dB)=10{\log _{10}}SNR$.

The other variables S, I, and Q were determined by the equations of water balance and nonlinear storage-discharge relationship, as:

$$\frac{{d{S_t}}}{{dt}}={P_{t - 1}} - {I_{t - 1}}$$

$${I_t}={K_s} \times S_{{t - 1}}^{\gamma }$$

$${Q_t}=\left\{ \begin{gathered} {S_{t - 1}}+{P_{t - 1}} - {S_{\hbox{max} }}{\text{ }};{\text{ }}{S_t} \geqslant {S_{\hbox{max} }} \hfill \\ {\text{ }}0{\text{ ; }}{S_t}<{S_{\hbox{max} }} \hfill \\ \end{gathered} \right.$$

where K_s represents the speed of reduction in water storage, and γ is an exponent for S.

Model S2 includes four variables: temperature T, snowmelt M, soil water storage S, and interflow I, and three parameters: the nonlinear storage-discharge parameters K_s, and γ, and snow-melting parameter K_melt. The model structure and corresponding causal network are shown at the top of Fig. 2b. The only forcing variable T was generated by a Normal distribution with a mean of zero. We also added some noise to this forcing variable: ${T_t}={\widehat {T}_t}+{\eta _T}$, where ${\eta _T}$is the white Gaussian noise (zero mean and $\sigma _{T}^{2}$variance). The calculation of $\sigma _{T}^{2}$ is similar to $\sigma _{P}^{2}$.

The other variables S, I, and M were determined by the equations of water balance and nonlinear storage-discharge relationship, as:

$${I_t}={K_s} \times S_{{t - 1}}^{\gamma }$$

$$\frac{{d{S_t}}}{{dt}}={M_{t - 1}} - {I_{t - 1}}$$

$${M_t}=\left\{ \begin{gathered} {K_{melt}} \times {T_{t - 1}}{\text{ }};{\text{ }}{T_{t - 1}} \geqslant 0 \hfill \\ {\text{ }}0{\text{ ; }}{T_{t - 1}}<0 \hfill \\ \end{gathered} \right.$$

The parameters of the conceptual hydrological model are listed in Table 2 and the corresponding frequency distributions of the generated hydrological variables are shown at the bottom of Fig. 2. The frequency distributions are estimated with a sample size of 2000 and SNR(dB) of 40. Since most hydrological series follow a skewed distribution and have considerable zeros values (Gong et al., 2014), especially for the forcing variable rainfall/snow, the synthetic series match these properties and can represent the real hydrological system to some extent.

Besides, for each model structure, 100 datasets were generated to ensure the robustness of the assessment. In the sensitivity evaluation of causal methods, sample size varies from 100 to 2000; SNR(dB) from 2 to 40; data missing rate from 10–60% with two scenarios, namely synchronous and asynchronous sparsity. In the former scenario, all variables are missing at the same time indices that are generated randomly, while in the latter, the time indices vary from the variables. The missing values are complemented by the arithmetic mean of observed values of the same variable (Gao et al., 2018).

Table 2

The parameters of the conceptual hydrological model.
	Parameter	Value
	maximum soil water storage (S_max)	20
Model S1	storage-discharge parameter 1 (K_s)	0.01
	storage-discharge parameter 2 (γ)	1.5
	snow-melting parameter (K_melt)	2
Model S2	storage-discharge parameter 1 (K_s)	0.5
	storage-discharge parameter 2 (γ)	0.6

In this study, three metrics, namely True Positives Rate (TPR), False Positives Rate (FPR), and Accuracy Rate (AR), were adopted to evaluate the performance of causal inference methods:

$$TPR=\frac{{TP}}{{TP+FN}}$$

$$FPR=\frac{{FP}}{{FP+TN}}$$

$$AR=\frac{{TP+TN}}{{TP+FP+TN+FN}}$$

where TP and FN denote the true positive and false negative, respectively, which means that the causal link generated by the model is correctly and incorrectly identified, respectively; TN and FP are true negative and false positive, respectively, which means that the non-causal link is correctly and incorrectly identified, respectively. The lower values of FPR and the higher values of TPR and AR indicate the better performance of the tested causal method.

3.2 Performance in large sample tests

In this section, 100 datasets were generated with a sample length of 2000 to evaluate the performance of different causal methods. Figure 3 presents the causal structures of the hydrological model identified by four causal methods. The results represent the average behavior of the experiment, i.e., a causal link exists only if it is identified by more than half of the 100 simulations. The detection of the true causal structures in Models S1 and S2 is mainly disturbed by confounding and mediating variables, respectively, which are typical challenges for causal inference. In Model S1, all methods correctly identified the causal links P → Q, P → S, and S → I. The CCF method incorrectly identified the links Q → I and Q → S due to the influence of the confounding variables P (simultaneously affecting S and Q) and S (simultaneously affecting Q and I) respectively. Besides, the indirect link P → I results from the mediating variable S. These incorrect links in other causal methods (CCM, TE, and PCMCI+) also share the similar reasons. As the bivariate unidirectional causal methods, the CCF and TE methods miss the link I → S because the opposite direction S → I has a more significant causal effect. It should be noted that since the CCM method successfully identified all of the predefined causal links, many new false connections, i.e., Q → P, Q → S, S → P, I → P, were generated with the control of confounding and mediating variables. One reasonable explanation is that the synthetic system is strongly coupled, such that the CCM method easily mistakes unidirectional causal relationships for bidirectional relationships. In Model S2, PCMCI + perfectly restores all predefined causal links, while the other three methods (CCF, CCM, and TE) incorrectly identified many indirect causal links due to the transitivity of causal relationships. Similar to the results of Model 1, the CCF and TE methods miss the causal link I → S, and CCM also mistakes all unidirectional links for bidirectional ones.

Table 3 further lists the evaluation results of the causal tests in the synthetic case. The PCMCI + method shows the best overall performance in both model structures, with the TPR, AR, and FPR values of 0.97(1.00), 0.85(0.96), and 0.23(0.05) in Model S1(S2). The CCF and TE methods show relatively lower TPR and AR values, and relatively higher FPR values. In contrast, the CCM method shows the lowest AR (lower than 0.5) and the highest TPR and FPR (higher than 0.9). Besides, in situations where direct and indirect causality are not distinguished, that is, the indirect links identified by the methods are not considered as the wrong links (the values in parentheses of Table 3), the performances of CCF, CCM, and TE methods improve significantly, especially in Model S2, indicating that the three methods are strongly affected by mediating variables and the identified causal links are likely to contain indirect effects, which might hinder our understanding of the real mechanism in hydrological systems. It should be noted that in such situations, the TPR values are not changed because the true positives (TP, i.e., the causal links generated by the hydrological model are correctly identified) remain the same. In addition, due to the complexity of the hydrological model, the overall performance of all causal methods in Model S2 is better than that in Model S1 except for the CCM method, which shows extremely high TPR and FPR simultaneously owing to the mistakes for bidirectional links. Therefore, in the following real case studies, the identified causal links were revised to unidirectional links, that is, the direction with a significant maximum causal effect was regarded as the only causal direction, thus to avoid misidentification.

Table 3

Evaluation results of the causal tests in the synthetic case.
		CCF	CCM	TE	PCMCI+
	TPR	0.60	1.00	0.60	0.97
Model S1	AR	0.58 (0.60)	0.42 (0.50)	0.66 (0.69)	0.85 (0.87)
	FPR	0.43 (0.40)	1.00 (1.00)	0.30 (0.22)	0.23 (0.22)
	TPR	0.75	1.00	0.75	1.00
Model S2	AR	0.67 (0.89)	0.36 (0.49)	0.67 (0.89)	0.96 (1.00)
	FPR	0.38 (0.00)	0.95 (0.93)	0.38 (0.00)	0.05 (0.00)
Note: The values in parentheses represent situations where direct and indirect causality are not distinguished, i.e., the indirect links identified by the methods are not considered as the wrong links. The TPR values are not changed in such situations.

3.3 Performance in sensitivity tests

In this section, the sensitivity of each method in sample length, noise, and missing data was analyzed. Figure 4 presents the impact of sample length with the sizes of 100, 300, 500, 1000, and 2000. 100 datasets were generated to ensure the robustness of the results. As shown in Figs. 4a and b, in both Model S1 and S2, for CCF and CCM methods, the TPR is not sensitive to variations in sample length; while for TE and PCMCI + methods, the TPR increases with increasing sample length. For the CCF method, which is based on linear lag correlation, 100 samples are enough to detect all the causal links. This number is also applicable to the CCM method, which is based on the deterministic dynamical system theory. In contrast, TE is based on the probability density framework, which needs to estimate the probability density function from the histogram of the frequency distribution, as well as to implement statistical hypothesis testing for conditional independence, thus requiring sufficient sample length. Similarly, the PCMCI + method also requires sufficient length samples in conditional independence tests to keep iterating and removing initialized redundant connections. To achieve relative stability, the TE and PCMCI + methods need at least 1000 and 500 samples for Model S1 and S2, respectively. This difference results from the complexity of the model structure.

As for the Accuracy Rate (AR), the values in each causal method fluctuate or even decrease with the increasing sample length, especially for TE and CCM methods (Figs. 4c and d). This is attributed to the limitations of causal methods in dealing with the impacts of confounding and mediating variables, which become more striking with the increase of simple length and lead to a significant increase in false positive rate. Additionally, the dashed lines in Fig. 4 represent the situations where direct and indirect causality are not distinguished. The difference in trends between the solid and dashed lines can be further analyzed to determine whether variations in sample length affect the identification of indirect causal links. The CCF, CCM, and PCMCI + methods show a similar trend between the solid and dashed lines in both model structures, while for the TE method, the variation rate is slightly inconsistent or even the reverse, due to the increasing detection rate of indirect links P → I in Model S1, and T → S, T → Q, M → Q in Model S2, with the increase of sample length. Besides, in the situation where direct and indirect causality is not distinguished, the improvement in the performance of causal methods in Model S2 is more pronounced than that in Model S1, which is consistent with the setup of model structure, i.e., the former is mainly controlled by indirect causality.

Figure 5 presents the impact of noise with the SNR(dB) of 2, 3, 5, 10, 20, 30, and 40. The sample length was fixed as 2000, and 100 datasets were generated to ensure the robustness of the results. As shown in Fig. 5a, in model S1, the noise has little impact on the TPR values for the CCF and CCM methods with the increase of noise level (i.e., the decrease of SNR). In contrast, for the TE method, the TPR increases with the increasing noise level, especially when the SNR (dB) is below 20. This is attributed to the assumption of the nonlinear stochastic system for the TE method, and appropriate noise helps to identify causal relationships. For PCMCI+, the TPR decreases with the decreasing SNR(dB) and shows a slight increase when the SNR (dB) is lower than 10. However, in Model S2, the noise has little impact on the TPR values for all causal methods (Fig. 5b), owing to the insensitivity of this model structure to input noise. As for the Accuracy Rate (AR), the values in CCF and TE are relatively stable with the variations of the noise level in both model structures (Figs. 5c and d), while for the PCMCI + method, the AR decreases with the decreasing SNR in Model S1, and for the CCM method, the AR increases as the SNR(dB) drops below 10 in both model structures. Additionally, all causal methods show a similar trend between the solid and dashed lines in both model structures, indicating that the noise does not have significant impacts on the identification of indirect causal links.

Figure 6 presents the impact of missing data on the performance of each causal method with the sparsity rate increasing from 10–60%. Two scenarios of missing data, namely synchronous and asynchronous sparsity, which may occur in real hydrologic data due to equipment errors and defective storage technologies, were constructed with 100 tests. To ensure the computability of all causal methods, the missing values were complemented by the arithmetic mean of observed values of the same variable. As shown in Figs. 6a and b, with the increase of synchronous missing rates, the TPR values remain relatively stable for the CCF method and begin to decline at a critical missing value for the other three methods, especially for CCM and PCMCI+. In comparison with the synchronous sparsity, the TPR values remain relatively stable for the PCMCI method in the scenario of asynchronous sparsity, while for the CCM method, the TPR values decline significantly over the 30% point of missing rate (Figs. 6e and f). One reasonable explanation is that the CCM method requires state space reconstruction by continuous time series data, while the missing data can lose numerous effective information for the state space, especially in the asynchronous scenario, thus disturbing the detection of causal relationships.

As for the Accuracy Rate (AR), with the increasing missing rate, the values in the CCF method remain relatively stable, while in the other three methods (CCM, TE, and PCMCI+), the values show fluctuation (Figs. 6c, d, f and g). It should be noted that for the CCM method, the AR values increase substantially, especially over the critical missing value of 30%, which seems implausible. One reasonable explanation is that as the missing rate increases, effective information of variables in their state space is gradually lost, and causally linked variables in the system may no longer maintain the information signature of each other; therefore the efficiency of cross-mapping decreases. The detection rate of both correct and incorrect causal links declines simultaneously, and the latter is more pronounced. In addition, the CCF and PCMCI + methods show a similar trend between the solid and dashed lines, while for the TE method, the variation rate is slightly inconsistent or even the reverse over the 50% point of missing rate due to the decreasing detection rate of indirect links P → I in Model S1, and T → S and T → Q, M → Q in Model S2, with the increasing missing rate. For the CCM method, the variation rate is extremely inconsistent over the 30% point of the missing rate due to the declining detection rate of all indirect links (P → I, I → Q, T → S, T → Q, M → Q) simultaneously.

At the end of this section, the results of the sensitivity tests can be summarized as follows: In the impact of sample length, the CCF and CCM methods show relatively stable performance, while the TE and PCMCI + methods require at least 500 samples to achieve relative stability; In the impact of noise, the performance of causal models is affected by the model structure, and the TE and PCMCI + methods perform more unstably for Model S1; In the impact of missing data, all causal methods show relatively stable performance within 30% of the missing rate, and the performance of the CCM method deteriorates rapidly over 30%.

In this section, two real study cases are presented to further evaluate the applicability of different causal inference methods in complex hydrological systems.

4.1 Application to real case 1

4.1.1 Study area and data

The Shale Hills Catchment (SHC; Fig. 7(a)), located in central Pennsylvania, USA, is a V-shaped small (0.08 km²) forested headwater catchment with comparatively steep slopes and narrow ridges (Guo et al., 2014). The surface elevation ranges from 256 to 310 m, with relatively homogenous land cover and lithology, and relatively heterogeneous soil thickness and organic matter content (Lin, 2006). The catchment belongs to a temperate continental climate region and the average air temperature varies from − 3°C (January) to 22°C (July). The annual average precipitation is around 980 mm, and the monthly distribution is relatively uniform with a small maximum in summer when the rainfall is usually characterized with high intensity and short duration (Jiang et al., 2023). The first-order stream of the catchment converges to the Juniata River, and is usually dry during summer months (Liu et al., 2020). Besides, given that the snowfall mainly occurs from November to April, this period was set as the snow-cover period in this study, while the other period over the water year (May to October) was divided as the snow-free period, thus to explore different causal mechanisms during wet and dry conditions.

The hydrological series, including discharge (Q), precipitation (P), groundwater level (GW), soil moisture (SM), and snow depth (SD), were obtained from the Critical Zone Observatory Data Site (http://www.czo.psu.edu/data_time_series.html). The dataset is a reanalysis result from the Flux-PIHM model (Shi et al., 2013), and has been subject to strict quality control. The temporal resolution is hourly with the period of 2009/11/1 ~ 2010/10/31. The min-max normalized values of the time series in snow-free (May to October) and snow-cover (November to April) periods are shown in Figs. 7(b) and (c), respectively.

4.1.2 Results

Figure 8 presents the causal structures identified by different causal inference methods for the Shale Hills Catchment (SHC) during the snow-free period. Generally, all methods can identify the main causal relationships P → SM, P → GW and P → Q, which represent the basic hydrological processes from precipitation to soil water, groundwater and streamflow, respectively, but show differences in other relationships. For the link between GW and Q, The CCF method presents a forward direction, while the CCM and TE methods present a backward direction. Only the PCMCI + method shows a bidirectional relationship, which contributes to the understanding of potential interaction processes between groundwater and streamflow. For the link between SM and GW, both CCF and CCM methods present a lagged forward link, while the PCMCI + method shows a contemporary backward link. Besides, the CCF and CCM methods present a forward link between SM and Q, which is omitted by the other two methods. In terms of causal strength, all methods support a relatively weak association for P → Q and P → GW, while the CCF and CCM methods tend to favor a stronger causal interaction between GW and Q, and the TE and PCMCI + methods tend to favor a stronger link P → SM.

Figure 9 presents the causal structures identified by different causal inference methods for the SHC during the snow-cover period. The snow depth (SD) variable is added to this hydrological system. Similar to the results during the snow-free period, all methods can detect the main causal relationships P → Q and P → GW, and show differences in other relationships between GW and Q, SM and Q, and SM and GW. However, limitations of causal methods are exposed to this more complex hydrological system. The unreasonable causal links GW → SD and Q → SD identified by the CCF and CCM methods might be attributed to their shortcomings in dealing with confounding variables. The causal link SD → SM identified by CCF, CCM, and PCMCI + methods is consistent with the snowmelt process. However, distinguished from the negative causal strength in the PCMCI + method, the CCF method shows an incomprehensible positive causal strength, which might be attributed to the influence of the confounding variable P (simultaneously affecting SM and SD). In comparison with the snow-free period, all causal methods in the snow-cover period support a weaker association between P and SM and a stronger association between P and GW except for the TE method (due to the lack of this causal link) during the snow-cover period.

4.2 Application to real case 2

4.2.1 Study area and data

The Chuosijia River Basin (CRB; Fig. 10(a)) is situated on the southeastern edge of the Tibetan Plateau, China, with a drainage area of 14813 km² and elevation ranging from 2440 to 5403m (Yang et al., 2023). Impacted by the westerly circulation and the southwest monsoon, the monsoon climate here is remarkable with distinct dry and wet seasons, and the same period of rain and heat. The multi-annual average temperature and precipitation are 8.6 ℃ and 740mm, respectively. The intra-annual distribution of precipitation is uneven. More than 80% of the precipitation is concentrated from June to October and usually in the form of heavy rainfall due to the warm moisture from the southwestern Indian Ocean (Chen and Alexander, 2022). The multi-year average runoff depth is 392mm, and runoff is primarily formed by precipitation, followed by snowmelt and groundwater. The first-order stream of the basin finally converges to Minjiang River, a main tributary of the upper Yangtze River (Liang et al., 2023). Besides, the CRB has little human interference and can be deemed as a natural basin, which is suitable for investigating causal interactions in natural hydrological systems. Considering that the snowfall mainly occurs from November to May, this period was set as the snow-cover period in this study and the other period over the water year (June to October) was divided as the snow-free period, thus to explore different causal mechanisms during dry and wet conditions.

The hydrological series includes precipitation (P), soil moisture (SM), evaporation (E), snow water equivalent (SWE), and discharge (Q). The precipitation data was obtained from CN05.1(Wu and Gao, 2013), a gridded dataset (0.25°×0.25°; https://ccrc.iap.ac.cn/) based on more than 2400 observed meteorological stations. The soil moisture data (0-100 cm) was gained from SMCI1.0 (Li et al., 2022), a long-term high-resolution soil moisture dataset (1km×1km; https://data.tpdc.ac.cn/home/) based on the measurements of 1789 observed stations across China. The evaporation and snow water equivalent data were obtained from GLEAM (0.25°×0.25°; https://www.gleam.eu/) and ERA5-land (0.1°×0.1°; https://cds.climate.copernicus.eu/#!/home/), respectively, which have been widely applied in the Tibetan Plateau recently (Chen et al., 2022; Li et al., 2019; Yang et al., 2020). The discharge data was gained from the Hydrological Yearbook of the Yangtze River Basin. All gridded data was aggregated to the whole basin scale, with the daily temporal resolution and the whole period from 2008 to 2018. The min-max normalized values of the time series in snow-free (June to October) and snow-cover (November to May) periods during water year 2015–2016 are shown in Figs. 10(b) and (c), respectively.

4.2.2 Results

Figure 10 presents the causal structures identified by different causal inference methods for the Chuosijia River Basin (CRB) during the snow-free period. In general, all methods can identify the main causal relationships P → SM and P → Q, which denote the basic hydrological processes from precipitation to soil water and streamflow, respectively, but show differences in other relationships. For the link between SM and Q, the CCF and CCM methods present a lagged forward link, while the PCMCI + method shows a contemporary backward link. For the link between E and P, all methods show a forward direction except for the CCM method. For the links between E and SM, and Q and E, the CCF and PCMCI + methods present forward links, while the CCM method shows backward links, providing a complementary understanding of the interactions among daily evaporation, soil moisture, and streamflow. With respect to the causal strength, all methods support a relatively strong association for P → Q.

Figure 11 shows the causal structures identified by different causal inference methods for the CRB during the snow-cover period. The snow water equivalent (SWE) variable is added to this hydrological system. Generally, all methods can identify the causal relationships E → P and P → Q, which reflect the basic processes of the hydrologic cycle, i.e., from evaporation to precipitation and then to streamflow, but show large differences in other relationships. For the link between P and SM, the CCF and PCMCI + methods present a forward link, while the CCM and TE methods show a backward link. Besides, the CCF and CCM methods identify the causal links E → Q and E → SM, which are omitted by the other two methods. Besides, except for the TE method, all methods identify the causal links SM → Q and E → SWE, which reflect subsurface flow and snowmelt processes, respectively. Only the CCM method detects the snow cover process (P → SWE). Nevertheless, many unreasonable causal relationships emerge in this five-variable system, for instance, the SWE → P detected by the CCF and TE methods, and the Q → SWE detected by the CCM method. Compared with the snow-free period, all causal methods in the snow-cover period support a stronger association between E and P and a weaker association between P and Q in the snow-cover period, and the CCF and CCM methods support a stronger association between SM and Q. The results indicates the increasing contribution of baseflow to streamflow during the snow-free period, as well as the diminishing effect of precipitation on streamflow.

5.1 Comparison of four causal inference methods

Identifying causal associations in hydrological systems is challenging. Applying different causal methods to artificially generated and real-world hydrological data has been confirmed to yield some incomprehensible or even contradictory results, which leads to some reasonable doubts about the reliability of the currently popular methods, and highlights the importance of maintaining critical attention to the limitations of individual methods.

The CCF method, which is rooted in traditional linear lag correlation theory, shows numerous significant connections whether in synthetic or real cases. Since reducing the significance level may control the number of false links (Rinderera et al., 2018), yet, significant associations can still be detected as the p-value decreases to 0.001. This might be attributed to relatively strong coupling and synchronization of the hydrological systems. Besides, the essence of the CCF method lies in statistical dependencies, without excluding indirect and confounding factors (Dean and Dunsmuir, 2016), which leads to many spurious causal associations, such as Q → S (Fig. 3a), M → I (Fig. 3e) in the synthetic case, and GW → SD (Fig. 9a), SWE → P (Fig. 12a) in the real cases 1 and 2, respectively. Nevertheless, the CCF method is simple and efficient, and presents a stable performance in the sensitivity tests, with little variations induced by the sample length, noise, and missing values. Therefore, the CCF method can be used as a preliminary means of inferring causal associations in hydrological systems, and a benchmark for the comparison of other causal inference methods.

For the CCM method, results in the synthetic case show extremely high false positive rates (almost 1) in both model structures, even if all preset causal relationships were identified simultaneously (Figs. 3b, f). The irrational bi-directional causality was also reported by Ombadi et al. (2020), which highlights the difficulty for the CCM method to address strong coupling in hydrological systems. In real cases, the causal directions were revised to that with the significant maximum causal effect, while some unreasonable causal links were still identified, such as GW → SD, Q → SD (Fig. 9b), and Q → SWE (Fig. 12b) in the real case 1 and 2, respectively, which further confirms the limitations of CCM in dealing with confounding factors (Delforge et al., 2022). Nevertheless, the CCM method still helps to understand complex hydrological processes, which requires us to return to the basic principles and assumptions of this method, that is, the nonlinear causality identified by CCM would only make sense under the framework of nonlinear dynamic system (NDS) with its concept fully understood (Zhao et al., 2023a). The CCM method might contribute to exploring hidden hydrological causal mechanisms from the perspective of NDS, such as the interactions between groundwater and streamflow (Bonotto et al., 2022), and within cryospheric hydrological cycle (Zhao et al., 2023b), which may not be directly observed from the geophysical level, and thus in turn helps to develop models based on physical foundations.

For the TE method, the results exhibit better performance in synthetic large sample tests, compared with the CCF and CCM methods, especially in cases where direct and indirect causality are not distinguished. In real cases, the identified significant causal relationships are relatively few, but most of them can be explained by basic hydrological laws, such as the links P → SM (Figs. 8c, 9c) and P → Q (Figs. 11c, 12c). Based on information theory, the TE method is not constrained by linear assumptions and partly overcomes the autocorrelation of hydrological time series using conditional mutual information, thus can be well applied in complex hydrological systems even with the threshold effect (Moges et al., 2022; Tennant et al., 2020). However, similar to the CCF and CCM methods, it fails to fundamentally avoid the interference of confounders and indirect causality, and some unreasonable connections still need to be cautiously explained. Besides, the sensitivity analysis indicates the necessity of sufficient sample length (i.e., at least 500–1000) for TE analysis, due to the challenge of estimating three-dimensional probability density, which has also been emphasized in other research (Kathpalia et al., 2022; Ruddell and Kumar, 2009)

The PCMCI + method, which is based on causal network learning, shows optimal performance in synthetic large-sample tests, and satisfactory interpretability in real cases. This method infers causality from the network graph of multivariate time series, utilizing iterative conditional independence tests to largely overcome the interference of autocorrelation and confounding factors, and distinguish direct and indirect causality in hydrological systems (Fig. 3d, h). Notwithstanding, contemporary causal associations in real cases, partly due to the coarse time resolutions (Runge et al., 2019a), need to be explained with caution. In terms of the algorithms for the conditional independence test, partial correlation (ParCorr) was selected in this study, which has also been widely adopted in other research (Karmouche et al., 2023; Nowack et al., 2020). Since other testing algorithms, such as conditional mutual information (CMI), seems suitable for discovering nonlinear associations in hydrological systems, yet it is fairly time-consuming (Runge et al., 2019b), and recent research showed a low recall rate in synthetic datasets, and an unstable performance in real cases (Delforge et al., 2022), which highlights the importance of choosing appropriate testing algorithms. Moreover, the graph-based method requires the assumption of causal sufficiency, i.e. the absence of hidden variables. In general, this assumption cannot be directly tested, and researchers can only consider known factors, as many as possible, in the causal network analysis through their prior expert knowledge, which may add potential uncertainty to the results. Yet, some recent work indicates that the assumption could be appropriately relaxed, especially in the real world (Gerhardus, 2020; Runge, 2018).

In summary, we recommend a flexible strategy for selecting causal inference methods based on the purpose of research. For exploratory work, such as investigating possible causal connections, the CCF and CCM methods are suggested, even if some irrational links are inevitably produced. The former detects all possible linear associations in hydrological systems, while the latter can reveal the hidden causal interactions in nonlinear dynamic systems. For confirmatory work, such as selecting significant predictors, the TE method can be selected with relatively low false detections. The PCMCI + method, with optimal integrative performance and remarkable interpretability, can be applied to both works mentioned above, and is suggested to deeply understand the interaction mechanisms among multivariates in complex hydrological systems.

Nevertheless, each method improves, more or less, our understanding of hydrological processes, which calls into our multi-perspective comprehension of the findings obtained by these methods under the context of their particular assumptions and constraints. Specifically, the causality identified by the CCF method can be understood as the linear connection, while identified by the CCM, TE, and PCMCI + methods need to be understood from nonlinear dynamics, information theory, and causal network perspectives, respectively. Therefore, it is important to combine the priori expert knowledge with the assumptions and limitations of each method, to comprehensively explain the reasonable or implausible causal associations. Moreover, the application of different causal methods may yield conflicting results, which highlights the importance to state the assumptions when drawing conclusions of causal issues. To this end, hydrological researchers are encouraged to be more explicit in elaborating assumptions that enable more robust conclusions, and in interpreting and evaluating conclusions under alternative sets of assumptions (Runge et al., 2023).

5.2 Direct and indirect causality

As the central focus of hydrology, the water cycle contains numerous components, including rainfall, evaporation, soil moisture, runoff, groundwater, snow, etc., which are connected in direct (e.g., rainfall-soil moisture) or indirect (e.g. groundwater-evaporation) way. Causal inference contributes to revealing the functional connectivity of these components in hydrological systems (Delforge et al., 2022; Rinderera et al., 2018). However, the causal methods may erroneously identify indirect causal influences as direct ones due to the transitivity of the causal relationship (Park et al., 2023). On the other hand, if two components are not connected directly, a causal influence may also exist, but it must be indirect. It is of great importance to remove the effects of indirect causality and thus to determine the direct causal relationships, as the latter serves as the foundation for modeling, prediction, and control of the system (Leng et al., 2020). In comparison with other algorithms, the PCMCI + method can effectively distinguish between direct and indirect causal links (Fig. 3; Table 3), with explicit indirect impact mechanisms from the graph of multivariate time series. Nevertheless, discriminating between direct and indirect causality for the real hydrological systems is still challenging, because both relationships are often intertwined. Taking the P-SM-Q process for example, the precipitation falling on land can first supplement soil moisture through infiltration and then move to the outlet of the basin through subsurface processes (i.e., the indirect process), or directly transfer through surface processes such as saturation overland flow or infiltration excess runoff (i.e., the direct process) (Kidron, 2021). How to quantify such complex causal interactions among three or even more variables in hydrological systems remains to be further considered (Goodwell and Bassiouni, 2022; Weijs et al., 2018).

5.3 Progress, limitations, and future perspectives

In comparison with the previous research (Delforge et al., 2022; Ombadi et al., 2020), our study focuses on hydrological systems, introduces the latest causal network learning algorithm PCMCI+, and serves the CCF method as the link between causality and correlation, and as the benchmark for the comparison of other methods. Moreover, in the synthetic cases, we expanded the structures of the conceptual hydrological models, enriched the sensitivity tests, and discussed the direct and indirect causality in hydrological systems for the first time. Two real cases deepen our understanding of different causal methods at different spatiotemporal scales, respectively. Our research in both cases systematically investigated the question of when and how to apply these methods, and how to interpret their results, thus contributing to their further popularity in the hydrology community.

Nonetheless, this study concentrates more on the methodological issues, the inner mechanisms of hydrological processes across spatio-temporal scales in the study area are beyond the scope of this study, which need further research (Liu et al., 2020; Wen et al., 2021; Hao et al., 2022). Besides, under different causal frameworks, testing causal mechanisms either unduly relies on assumptions or lacks theoretical examination (Su et al., 2023). Thus, the discovery and test of causal mechanisms in complicated hydrological systems with unknown causal structures is still challenging. Moreover, the hydrological systems may involve certain latent variables, such as unknown/hidden or unobservable variables, which can introduce some confounding factors and make the detected links spurious. Fortunately, the LPCMCI algorithm, proposed by Runge (2020) recently, may overcome this limitation to some extent and can be further applied in hydrology. Additionally, some new methods, such as the Partial cross mapping (PCM; Leng et al., 2020) and the method proposed by Park et al. (2023), which address the issue of indirect causality based on nonlinear dynamics theory and monotonic ODE model, respectively, can be tested and evaluated in the future.

Recently, the questions–assumptions–data (QAD) template, proposed by Runge et al. (2023), helps to guide researchers on how to phrase and tackle their issues in the framework of causal inference. Yet, some typical challenges, such as the hidden confounding, non-stationarity, contemporaneous causality, and preprocessing of variables, need extra attention. Fortunately, the era of big data provides new opportunities for research in this field, as it could comprehensively characterize the structural information between variables, verify the spurious causal links in the causal network structure, and infer latent variable structures that are difficult to observe (Li et al., 2023). To this end, causal inference helps to build the bridge between data-driven machine learning and prior expert knowledge, thus promoting the understanding of complex hydrological processes and the robust causal prediction.

In this research, the performances of four popular causal inference methods (CCF, CCM, TE, and PCMCI+) were systematically evaluated in hydrological systems using both synthetic and real-world time series. For the synthetic cases, the PCMCI + method shows the best performance in large sample tests, while the other three methods present relatively poor performances due to the interference of confounding and mediating factors. In sensitivity tests, the CCF method is less affected by sample length, noise, and missing values, while the CCM method is significantly impacted by the missing rate, especially over the critical value of 30%. Besides, the TE and PCMCI + methods need at least 500 samples to achieve relative stability, and are vulnerable to the interference of noise. For the real cases, the PCMCI + method shows the best interpretability, while the other three methods produce many inexplicable causal links. Additionally, some contradictory results among different methods emerge, owing to their different assumptions and limitations.

In summary, the PCMCI + method serves as a favorable choice for conducting causal inference within hydrological systems, while some strong assumptions, such as the causal sufficiency, need to be considered with caution. Nonetheless, each method improves, more or less, our understanding of hydrological processes, which requires our multi-perspective comprehension of the findings obtained by these methods under the context of their particular assumptions and constraints. A comprehensive application of diverse methods based on the specific issue is encouraged for the robustness of conclusions, with the assumptions stated explicitly in advance. Promisingly, the causal inference methods provide a complementary data-driven avenue to unlock the inner mechanisms of complex hydrological systems, and have broad application prospects in the hydrology community.

Acknowledgements

The research is supported by the National Nature Science Foundation of China (No. 52379017).

CRediT authorship contribution statement

Hanxu Liang: Methodology, Investigation, Formal analysis, Visualization, Writing-original draft. Wensheng Wang: Supervision, Funding acquisition, Writing - Review & Editing. Bin Chen: Data curation. Li Guo: Writing - Review & Editing. Hu Liu: Writing - Review & Editing. Siyi Yu: Validation. Dan Zhang: Conceptualization, Validation.

Declaration of Competing Interest

All authors agreed to the published version of the manuscript and declare no conflicts of interests.

Altman, N., and M. Krzywinski (2015), Association, correlation and causation, Nat. Methods 12(10), 899-900. https://doi.org/10.1038/nmeth.3587.
Angelov, P. P., E. A. Soares, R. C. Jiang, N. I. Arnold, and P. M. Atkinson (2021), Explainable artificial intelligence: an analytical review, Wires. Data. Min. Knowl. 11(5). https://doi.org/10.1002/widm.1424.
Apaydin, H., and M. Sibtain (2021), A multivariate streamflow forecasting model by integrating improved complete ensemble empirical mode decomposition with additive noise, sample entropy, Gini index and sequence-to-sequence approaches, J. Hydrol. 603. https://doi.org/10.1016/j.jhydrol.2021.126831.
Bennett, A., B. Nijssen, G. X. Ou, M. Clark, and G. Nearing (2019), Quantifying Process Connectivity With Transfer Entropy in Hydrologic Models, Water Resour. Res. 55(6), 4613-4629. https://doi.org/10.1029/2018wr024555.
Bergström, S., and L. P. Graham (1998), On the scale problem in hydrological modelling, J. Hydrol. 211(1-4), 253-265. https://doi.org/10.1016/s0022-1694(98)00248-0.
Blöschl, G., M. F. P. Bierkens, A. Chambel, C. Cudennec, and G. Destouni (2019), Twenty-three unsolved problems in hydrology (UPH) - a community perspective, Hydrol. Sci. J. 64(10), 1141-1158. https://doi.org/10.1080/02626667.2019.1620507.
Bonotto, G., T. J. Peterson, K. Fowler, and A. W. Western (2022), Identifying Causal Interactions Between Groundwater and Streamflow Using Convergent Cross-Mapping, Water Resour. Res. 58(8). https://doi.org/10.1029/2021wr030231.
Budakoti, S., T. Chauhan, R. Murtugudde, S. Karmakar, and S. Ghosh (2021), Feedback From Vegetation to Interannual Variations of Indian Summer Monsoon Rainfall, Water Resour. Res. 57(5). https://doi.org/10.1029/2020wr028750.
Chen, J., F. P. Brissette, and R. Leconte (2010), A daily stochastic weather generator for preserving low-frequency of climate variability, J. Hydrol. 388(3-4), 480-490. https://doi.org/10.1016/j.jhydrol.2010.05.032.
Chen, F., W. T. Crow, P. J. Starks, and D. N. Moriasi (2011), Improving hydrologic predictions of a catchment model via assimilation of surface soil moisture, Adv. Water Resour. 34(4), 526-536. https://doi.org/10.1016/j.advwatres.2011.01.011.
Chen, R., M. X. Yang, X. J. Wang, G. N. Wan, and H. Y. Li (2022), Thermal regime variations of the uppermost soil layer in the central Tibetan Plateau, Catena 213. https://doi.org/10.1016/j.catena.2022.106224.
Chen, Y., and D. Alexander (2022), Integrated flood risk assessment of river basins: Application in the Dadu river basin, China, J. Hydrol. 613. https://doi.org/10.1016/j.jhydrol.2022.128456.
Cryer, J. D., and K. Chan (2008), Time series analysis with applications in R. New York, NY: Springer.
Dean, R. T., and W. T. M. Dunsmuir (2016), Dangers and uses of cross-correlation in analyzing time series in perception, performance, movement, and neuroscience: The importance of constructing transfer function autoregressive models, Behav. Res. Methods 48(2), 783-802. https://doi.org/10.3758/s13428-015-0611-2.
Delforge, D., O. de Viron, M. Vanclooster, M. Van Camp, and A. Watlet (2022), Detecting hydrological connectivity using causal inference from time series: synthetic and real karstic case studies, Hydrol. Earth Syst. Sci. 26(8), 2181-2199. https://doi.org/10.5194/hess-26-2181-2022.
Faybishenko, B. (2017), Detecting dynamic causal inference in nonlinear two-phase fracture flow, Adv. Water Resour. 106, 111-120. https://doi.org/https://doi.org/10.1016/j.advwatres.2017.02.011.
Gao, Y. B., C. Merz, G. Lischeid, and M. Schneider (2018), A review on missing hydrological data processing, Environ. Earth. Sci. 77(2). https://doi.org/10.1007/s12665-018-7228-6.
Geng, S., F. Devries, and I. Supit (1986), A simple method for generating daily rainfall data, Agric. Meteorol. 36(4), 363-376. https://doi.org/10.1016/0168-1923(86)90014-6.
Gerhardus, A. a. R., Jakob (2020), High-recall causal discovery for autocorrelated time series with latent confounders. Advances in Neural Information Processing Systems, volume 33, pages 12615–12625. Curran Associates, Inc.
Gong, W., D. W. Yang, H. V. Gupta, and G. Nearing (2014), Estimating information entropy for hydrological data: One-dimensional case, Water Resour. Res. 50(6), 5003-5018. https://doi.org/10.1002/2014wr015874.
Good, S. P., D. Noone, and G. Bowen (2015), Hydrologic connectivity constrains partitioning of global terrestrial water fluxes, Science 349(6244), 175-177. https://doi.org/10.1126/science.aaa5931.
Goodwell, A. E., and M. Bassiouni (2022), Source Relationships and Model Structures Determine Information Flow Paths in Ecohydrologic Models, Water Resour. Res. 58(9). https://doi.org/10.1029/2021wr031164.
Granger, C. W. (1969), Investigating causal relations by econometric models and cross‐spectral methods, Econometrica: Journal of the Econometric Society, 37(3), 424–438. https://doi.org/10.2307/1912791.
Guo, L., J. Chen, and H. Lin (2014), Subsurface lateral preferential flow network revealed by time-lapse ground-penetrating radar in a hillslope, Water Resour. Res. 50(12), 9127-9147. https://doi.org/10.1002/2013wr014603.
Hao, Y., F. B. Sun, H. Wang, W. B. Liu, Y. J. Shen, Z. Li, and S. J. Hu (2022), Understanding climate-induced changes of snow hydrological processes in the Kaidu River Basin through the CemaNeige-GR6J model, Catena 212. https://doi.org/10.1016/j.catena.2022.106082.
Jiang, Y. J., Y. L. Zhang, B. H. Fan, J. H. Wen, H. Liu, C. R. Mello, J. F. Cui, C. Yuan, and L. Guo (2023), Preferential flow influences the temporal stability of soil moisture in a headwater catchment, Geoderma 437. https://doi.org/10.1016/j.geoderma.2023.116590.
Karmouche, S., E. Galytska, J. Runge, G. A. Meehl, A. S. Phillips, K. Weigel, and V. Eyring (2023), Regime-oriented causal model evaluation of Atlantic-Pacific teleconnections in CMIP6, Earth System Dynamics 14(2), 309-344. https://doi.org/10.5194/esd-14-309-2023.
Kathpalia, A., P. Manshour, and M. Palus (2022), Compression complexity with ordinal patterns for robust causal inference in irregularly sampled time series, Sci. Rep. 12(1). https://doi.org/10.1038/s41598-022-18288-4.
Khatibi, R., B. Sivakumar, M. A. Ghorbani, O. Kisi, K. Kocak, and D. F. Zadeh (2012), Investigating chaos in river stage and discharge time series, J. Hydrol. 414, 108-117. https://doi.org/10.1016/j.jhydrol.2011.10.026.
Kidron, G. J. (2021), Comparing overland flow processes between semiarid and humid regions: Does saturation overland flow take place in semiarid regions?, J. Hydrol. 593. https://doi.org/10.1016/j.jhydrol.2020.125624.
Kinney, J. B., and G. S. Atwal (2014), Equitability, mutual information, and the maximal information coefficient, Proc. Natl. Acad. Sci. U. S. A. 111(9), 3354-3359. https://doi.org/10.1073/pnas.1309933111.
Kleidon, A., and M. Renner (2013), Thermodynamic limits of hydrologic cycling within the Earth system: concepts, estimates and implications, Hydrol. Earth Syst. Sci. 17(7), 2873-2892. https://doi.org/10.5194/hess-17-2873-2013.
Konapala, G., S. C. Kao, and N. Addor (2020), Exploring Hydrologic Model Process Connectivity at the Continental Scale Through an Information Theory Approach, Water Resour. Res. 56(10). https://doi.org/10.1029/2020wr027340.
Leng, S. Y., H. F. Ma, J. Kurths, Y. C. Lai, W. Lin, K. Aihara, and L. N. Chen (2020), Partial cross mapping eliminates indirect causal influences, Nat. Commun. 11(1). https://doi.org/10.1038/s41467-020-16238-0.
Li, Q. L., G. S. Shi, W. Shangguan, V. Nourani, J. D. Li, L. Li, F. N. Huang, Y. Zhang, C. Y. Wang, D. G. Wang, J. X. Qiu, X. J. Lu, and Y. J. Dai (2022), A 1 km daily soil moisture dataset over China using in situ measurement and machine learning, Earth. Syst. Sci. Data 14(12), 5267-5286. https://doi.org/10.5194/essd-14-5267-2022.
Li, X., M. Feng, Y. H. Ran, Y. Su, F. Liu, C. L. Huang, H. F. Shen, Q. Xiao, J. B. Su, S. W. Yuan, and H. D. Guo (2023a), Big Data in Earth system science and progress towards a digital twin, Nat. Rev. Earth Env. https://doi.org/10.1038/s43017-023-00409-w.
Li, Z. W., X. L. Xu, and K. L. Wang (2023b), Effects of distribution patterns of karst landscapes on runoff and sediment yield in karst watersheds, Catena 223. https://doi.org/10.1016/j.catena.2023.106947.
Li, X. Y., D. Long, Z. Y. Han, B. R. Scanlon, Z. L. Sun, P. F. Han, and A. Z. Hou (2019), Evapotranspiration Estimation for Tibetan Plateau Headwaters Using Conjoint Terrestrial and Atmospheric Water Balances and Multisource Remote Sensing, Water Resour. Res. 55(11), 8608-8630. https://doi.org/10.1029/2019wr025196.
Liang, H. X., D. Zhang, W. S. Wang, S. Y. Yu, and S. Nimai (2023), Evaluating future water security in the upper Yangtze River Basin under a changing environment, Sci. Total Environ. 889. https://doi.org/10.1016/j.scitotenv.2023.164101.
Lin, H. (2006), Temporal stability of soil moisture spatial pattern and subsurface preferential flow pathways in the shale hills catchment, Vadose Zone J. 5(1), 317-340. https://doi.org/10.2136/vzj2005.0058.
Liu, H., Y. Yu, W. Z. Zhao, L. Guo, J. T. Liu, and Q. Y. Yang (2020), Inferring Subsurface Preferential Flow Features From a Wavelet Analysis of Hydrological Signals in the Shale Hills Catchment, Water Resour. Res. 56(11). https://doi.org/10.1029/2019wr026668.
Luo, M., F. Meng, Y. Wang, C. Sa, Y. Duan, Y. Bao, and T. Liu (2023), Quantitative detection and attribution of soil moisture heterogeneity and variability in the Mongolian Plateau, J. Hydrol. 621. https://doi.org/10.1016/j.jhydrol.2023.129673.
Massei, N., J. P. Dupont, B. J. Mahler, B. Laignel, M. Fournier, D. Valdes, and S. Ogier (2006), Investigating transport properties and turbidity dynamics of a karst aquifer using correlation, spectral, and wavelet analyses, J. Hydrol. 329(1), 244-257. https://doi.org/https://doi.org/10.1016/j.jhydrol.2006.02.021.
Moges, E., B. L. Ruddell, L. Zhang, J. M. Driscoll, and L. G. Larsen (2022), Strength and Memory of Precipitation's Control Over Streamflow Across the Conterminous United States, Water Resour. Res. 58(3). https://doi.org/10.1029/2021wr030186.
Nowack, P., J. Runge, V. Eyring, and J. D. Haigh (2020), Causal networks for climate model evaluation and constrained projections, Nat. Commun. 11(1). https://doi.org/10.1038/s41467-020-15195-y.
Ombadi, M., P. Nguyen, S. Sorooshian, and K. l. Hsu (2020), Evaluation of Methods for Causal Discovery in Hydrometeorological Systems, Water Resour. Res. 56(7). https://doi.org/10.1029/2020wr027251.
Park, S. H., S. Ha, and J. K. Kim (2023), A general model-based causal inference method overcomes the curse of synchrony and indirect effect, Nat. Commun. 14(1). https://doi.org/10.1038/s41467-023-39983-4.
Patil, S., and M. Stieglitz (2014), Modelling daily streamflow at ungauged catchments: what information is necessary?, Hydrol. Process. 28(3), 1159-1169. https://doi.org/10.1002/hyp.9660.
Pearl, J. (2009), Causality: Models, Reasoning, and Inference, Cambridge University Press, Cambridge, 2 edn. https://doi.org/10.1017/CBO9780511803161.
Peng, S. L., K. Mihara, X. L. Xu, K. Kuramochi, Y. Toma, and R. Hatano (2024), Modeling hydrological processes under Multi-Model projections of climate change in a cold region of Hokkaido, Japan, Catena 234. https://doi.org/10.1016/j.catena.2023.107605.
Reichenbach, H. (1956), The direction of time, University of California Press, Berkeley and Los Angeles.
Reichstein, M., G. Camps-Valls, B. Stevens, M. Jung, J. Denzler, N. Carvalhais, and Prabhat (2019), Deep learning and process understanding for data-driven Earth system science, Nature 566(7743), 195-204. https://doi.org/10.1038/s41586-019-0912-1.
Richardson, C. W. (1981), Stochastic simulation of daily precipitation, temperature, and solar-radiation, Water Resour. Res. 17(1), 182-190. https://doi.org/10.1029/WR017i001p00182.
Rinderera, M., G. Ali, and L. G. Larsen (2018), Assessing structural, functional and effective hydrologic connectivity with brain neuroscience methods: State-of-the-art and research directions, Earth-Sci. Rev. 178, 29-47. https://doi.org/10.1016/j.earscirev.2018.01.009.
Ruddell, B. L., and P. Kumar (2009), Ecohydrologic process networks: 1. Identification, Water Resour. Res. 45(3). https://doi.org/10.1029/2008wr007279.
Rudin, C. (2019), Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead, Nat. Mach. Intell. 1(5), 206-215. https://doi.org/10.1038/s42256-019-0048-x.
Runge, J. (2018), Causal network reconstruction from time series: From theoretical assumptions to practical estimation, Chaos 28(7). https://doi.org/10.1063/1.5025050.
Runge, J. (2020), Discovering contemporaneous and lagged causal relations in autocorrelated nonlinear time series datasets. In Proc. 36th Conf. Uncertainty in Artificial Intelligence (UAI) Vol. 124 of Proc. Machine Learning Research (eds Peters, J. & Sontag, D.) 1388–1397. .
Runge, J., S. Bathiany, E. Bollt, G. Camps-Valls, D. Coumou, E. Deyle, C. Glymour, M. Kretschmer, M. D. Mahecha, J. Munoz-Mari, E. H. van Nes, J. Peters, R. Quax, M. Reichstein, M. Scheffer, B. Scholkopf, P. Spirtes, G. Sugihara, J. Sun, K. Zhang, and J. Zscheischler (2019a), Inferring causation from time series in Earth system sciences, Nat. Commun. 10. https://doi.org/10.1038/s41467-019-10105-3.
Runge, J., A. Gerhardus, G. Varando, V. Eyring, and G. Camps-Valls (2023), Causal inference for time series, Nat. Rev. Earth Env. 4(7), 487-505. https://doi.org/10.1038/s43017-023-00431-y.
Runge, J., P. Nowack, M. Kretschmer, S. Flaxman, and D. Sejdinovic (2019b), Detecting and quantifying causal associations in large nonlinear time series datasets, Sci. Adv. 5(11). https://doi.org/10.1126/sciadv.aau4996.
Sang, Y. F., V. P. Singh, J. Wen, and C. M. Liu (2015), Gradation of complexity and predictability of hydrological processes, J. Geophys. Res. Atmos. 120(11), 5334-5343. https://doi.org/10.1002/2014jd022844.
Schreiber, T. (2000), Measuring information transfer, Phys. Rev. Lett. 85(2), 461-464. https://doi.org/10.1103/PhysRevLett.85.461.
Shannon, C. E. (1948), A mathematical theory of communication, Bell System Technical Journal, 27(3), 379–423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x.
Shi, Y., K. J. Davis, C. J. Duffy, and X. Yu (2013), Development of a Coupled Land Surface Hydrologic Model and Evaluation at a Critical Zone Observatory, J. Hydrometeorol. 14(5), 1401-1420. https://doi.org/10.1175/jhm-d-12-0145.1.
Shojaie, A., and E. B. Fox (2022), Granger Causality: A Review and Recent Advances, Annu. Rev. Stat. Appl. 9, 289-319. https://doi.org/10.1146/annurev-statistics-040120-010930.
Sivakumar, B. (2004), Chaos theory in geophysics: past, present and future, Chaos Solitons & Fractals 19(2), 441-462. https://doi.org/10.1016/s0960-0779(03)00055-9.
Student (1908), The Probable Error of a Mean, Biometrika 6(1), 1-25. https://doi.org/10.2307/2331554.
Su, J. B., D. X. Chen, D. H. Zheng, Y. Su, and X. Li (2023), The insight of why: Causal inference in Earth system science, Science China-Earth Sciences. https://doi.org/10.1007/s11430-023-1148-7.
Sugihara, G., E. R. Deyle, and H. Ye (2017), Misconceptions about causation with synchronyand seasonal drivers reply, Proc. Natl. Acad. Sci. U. S. A. 114(12), E2272-E2274. https://doi.org/10.1073/pnas.1700998114.
Sugihara, G., R. May, H. Ye, C. H. Hsieh, E. Deyle, M. Fogarty, and S. Munch (2012), Detecting Causality in Complex Ecosystems, Science 338(6106), 496-500. https://doi.org/10.1126/science.1227079.
Takens, F. (1981), Detecting Strange Attractors in Turbulence, In Dynamical systems and turbulence, Warwick 1980, (pp. 366–381). Berlin, Heidelberg: Springer. https://doi.org/10.1007/BFb0091924.
Tennant, C., L. Larsen, D. Bellugi, E. Moges, L. Zhang, and H. X. Ma (2020), The Utility of Information Flow in Formulating Discharge Forecast Models: A Case Study From an Arid Snow-Dominated Catchment, Water Resour. Res. 56(8). https://doi.org/10.1029/2019wr024908.
Wang, Q. J., C. F. Yue, X. Q. Li, P. Liao, and X. Y. Li (2023), Enhancing robustness of monthly streamflow forecasting model using embedded-feature selection algorithm based on improved gray wolf optimizer, J. Hydrol. 617. https://doi.org/10.1016/j.jhydrol.2022.128995.
Wang, Y., J. Yang, Y. Chen, P. De Maeyer, Z. Li, and W. Duan (2018), Detecting the Causal Effect of Soil Moisture on Precipitation Using Convergent Cross Mapping, Sci. Rep. 8. https://doi.org/10.1038/s41598-018-30669-2.
Weijs, S. V., H. Foroozand, and A. Kumar (2018), Dependency and Redundancy: How Information Theory Untangles Three Variable Interactions in Environmental Data, Water Resour. Res. 54(10), 7143-7148. https://doi.org/10.1029/2018wr022649.
Wen, H., S. L. Brantley, K. J. Davis, J. M. Duncan, and L. Li (2021), The Limits of Homogenization: What Hydrological Dynamics can a Simple Model Represent at the Catchment Scale?, Water Resour. Res. 57(6). https://doi.org/10.1029/2020wr029528.
Wu, J., and X. J. Gao (2013), A gridded daily observation dataset over China region and comparison with the other datasets, Chinese Journal of Geophysics-Chinese Edition 56(4), 1102-1111. https://doi.org/10.6038/cjg20130406.
Xu, Z. P., X. L. Man, L. L. Duan, and T. J. Cai (2022), Improved subsurface soil moisture prediction from surface soil moisture through the integration of the (de)coupling effect, J. Hydrol. 608, 12. https://doi.org/10.1016/j.jhydrol.2022.127634.
Yang, W. J., Y. B. Wang, X. Liu, H. P. Zhao, R. Shao, and G. X. Wang (2020), Evaluation of the rescaled complementary principle in the estimation of evaporation on the Tibetan Plateau, Sci. Total Environ. 699. https://doi.org/10.1016/j.scitotenv.2019.134367.
Yang, Y., S. J. Chen, Y. R. Zhou, G. W. Ma, W. B. Huang, and Y. M. Zhu (2023), Method for quantitatively assessing the impact of an inter-basin water transfer project on ecological environment-power generation in a water supply region, J. Hydrol. 618. https://doi.org/10.1016/j.jhydrol.2023.129250.
Ye, H., E. R. Deyle, L. J. Gilarranz, and G. Sugihara (2015), Distinguishing time-delayed causal interactions using convergent cross mapping, Sci. Rep. 5. https://doi.org/10.1038/srep14750.
Yu, C., B. Gao, R. Muñoz-Carpena, Y. Tian, L. Wu, and O. Perez-Ovilla (2011), A laboratory study of colloid and solute transport in surface runoff on saturated soil, J. Hydrol. 402(1), 159-164. https://doi.org/https://doi.org/10.1016/j.jhydrol.2011.03.011.
Zhang, D., W. S. Wang, S. Y. Yu, S. Q. Liang, and Q. F. Hu (2021a), Assessment of the Contributions of Climate Change and Human Activities to Runoff Variation: Case Study in Four Subregions of the Jinsha River Basin, China, J. Hydro. Eng. 26(9). https://doi.org/10.1061/(asce)he.1943-5584.0002119.
Zhang, L., E. Moges, J. W. Kirchner, E. Coda, T. C. Liu, A. S. Wymore, Z. X. Xu, and L. G. Larsen (2021b), CHOSEN: A synthesis of hydrometeorological data from intensively monitored catchments and comparative analysis of hydrologic extremes, Hydrol. Process. 35(11). https://doi.org/10.1002/hyp.14429.
Zhao, Y. Y., T. J. Zhu, Z. Q. Zhou, H. J. Cai, and Z. D. Cao (2023a), Detecting nonlinear information about drought propagation time and rate with nonlinear dynamic system and chaos theory, J. Hydrol. 623. https://doi.org/10.1016/j.jhydrol.2023.129810.
Zhao, Y. Y., Y. G. Zou, E. Z. Ma, Z. Q. Zhou, Y. Q. Feng, Z. D. Cao, H. J. Cai, C. Li, and Y. H. Yan (2023b), Can groundwater storage in turn affect the cryospheric variables? A new perspective from nonlinear dynamic causality detection, J. Hydrol. 624, 14. https://doi.org/10.1016/j.jhydrol.2023.129910.
Zounemat-Kermani, M., O. Batelaan, M. Fadaee, and R. Hinkelmann (2021), Ensemble machine learning paradigms in hydrology: A review, J. Hydrol. 598, 13. https://doi.org/10.1016/j.jhydrol.2021.126266.

No competing interests reported.

Download PDF

Editorial decision: Revision requested
23 Sep, 2024
Reviews received at journal
26 Aug, 2024
Reviews received at journal
14 Aug, 2024
Reviewers agreed at journal
30 Jul, 2024
Reviewers agreed at journal
08 Jul, 2024
Reviewers invited by journal
07 Jul, 2024
Editor assigned by journal
28 Jun, 2024
Submission checks completed at journal
26 Jun, 2024
First submitted to journal
26 Jun, 2024

You are reading this latest preprint version

Inferring causal associations in hydrological systems: A comparison of methods

Status:

Version 1

Abstract

Figures

1. Introduction

2. Causal inference methods

2.1 Cross-correlation function

2.2 Convergent cross mapping

2.3 Transfer entropy

2.4 Causal network learning algorithms

3. Application to synthetic case study

3.1 Construction of conceptual hydrological model

3.2 Performance in large sample tests

3.3 Performance in sensitivity tests

4. Application to real case study

4.1 Application to real case 1

4.1.1 Study area and data

4.1.2 Results

4.2 Application to real case 2

4.2.1 Study area and data

4.2.2 Results

Discussion

5.1 Comparison of four causal inference methods

5.2 Direct and indirect causality

5.3 Progress, limitations, and future perspectives

Conclusion

Declarations

References

Additional Declarations

Status:

Version 1