Next Article in Journal
An Effective Framework for Intellectual Property Protection of NLG Models
Previous Article in Journal
Geometric Numerical Methods for Lie Systems and Their Application in Optimal Control
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Wavelet Multiscale Granger Causality Analysis Based on State Space Models

1
Department of Mathematics, College of Information Science and Technology, Jinan University, Guangzhou 510632, China
2
Department of Computer Science, College of Information Science and Technology, Jinan University, Guangzhou 510632, China
*
Author to whom correspondence should be addressed.
Symmetry 2023, 15(6), 1286; https://doi.org/10.3390/sym15061286
Submission received: 9 May 2023 / Revised: 12 June 2023 / Accepted: 15 June 2023 / Published: 20 June 2023

Abstract

:
Granger causality (GC) is a popular method in causal linkage recovery and has been applied to various fields, such as economics and neuroscience. While the conventional Granger causality model is capable of identifying symmetrical causal relationships among variables, it is the asymmetric Granger causality that provides a more comprehensive perspective of the short- and long-term interactions between variables, which is of greater value for empirical study. Traditional vector autoregressive models lack the ability to explore multiscale information flow and are affected by the moving average component. Therefore, by combining the wavelet-based approach and state space model, we propose a new Granger causality analysis method to overcome the inherent limitation of vector autoregressive models and extend to multiscale causality exploration. Two simulations were conducted to compare the proposed approach to an existing wavelet-based method, and five evaluation indicators were utilized. The results indicate that the proposed method efficiently identifies the accurate asymmetric causalities at varying scales, while improving accuracy and reducing bias as compared to the current wavelet-based method. In conclusion, the combination of the wavelet approach and state space method enhances the multiscale causality detecting capability and can potentially contribute to multiscale Granger causality research.

1. Introduction

Granger causality, which was proposed and refined by Norbert Wiener and Clive Granger, is primarily used to establish the presence of causal linkages across variables in econometric research [1].
In 1956, Norbert Wiener [2] first suggested that Y could be called the cause of X if adding the information of Y would improve the prediction of X. However, it was not until 1969 that Clive Granger provided a concrete implementation of a linear autoregressive model based on the stochastic process [3].
In the simplest bivariate case, Granger causality is calculated by first dividing the variables X and Y into two sets: one set contains all the variables and the other only contains the target variable X, which are used to constructed two separate vector autoregressive (VAR) models—the full VAR models and the restricted VAR models. These give the residual covariance matrices of the two models, and the Granger causality from Y to X can be defined as the likelihood ratio of residual covariance between the restricted and full models.
Since Granger causality is related to the flow of information across variables, it has been used in a growing number of studies in various fields, including the inference of information flow in the human brain as well as the analysis of economic and engineering data [4,5,6,7,8]. With the growing number of potential applications, Granger causality analysis is increasingly being employed to investigate multivariate and multiscale data scenarios. Moreover, recent research has explored the potential to extend Granger causality analysis to the compression domain, highlighting the versatility and adaptability of this technique to various applications and fields [9].
However, the traditional VAR models only consider the bivariate case, which may be affected by confounding factors or joint distributions in the multivariate case. Furthermore, traditional VAR models are unable to take into account the dynamic complexity of individual processes at various time scales and therefore need to be modified to apply to multiscale data, or we can consider investigating other models that are more suitable for multivariate or multiscale analysis.
Geweke [10] originally introduced the concept of symmetric Granger causality, which refers to a dual causality relationship between two variables, whereby X Granger causes Y, and Y Granger causes X simultaneously, and generalized Granger causality to a multivariate approach that has been developed and well proven throughout time [11].
Although there are several restrictions, using VAR models for Granger causality analysis typically necessitates that the data adhere to the smoothness and linearity criteria. If there is a co-dependent third variable for both variables, it may lead to a spurious causal relationship, which can be eliminated by considering co-dependencies. When dealing with exogenous or latent variables, it may be challenging to entirely eliminate their confounding effects. Nonetheless, in such situations, the application of conditional Granger causality can help reduce their impact on the analysis to a certain extent [12].
A Matlab toolbox named Granger causal connectivity analysis (GCCA) developed by Seth A. K. et al. [13] in 2010 can be used to conduct Granger causal linkage analysis on various neuroscience data, including functional magnetic resonance imaging (fMRI). The updated multivariate Granger causality (MVGC) toolbox applied to multivariate variables was proposed in 2014. It contains computations for conditional and unconditional, time and frequency domains, and pairwise Granger causality based on the VAR model [14], helping to discover symmetric and asymmetric Granger causality.
Even though the VAR model is straightforward and simple to estimate, it has a significant limitation, in that moving average components can lead to an infinite number of model orders or require a larger number of model orders to model a finite number of small samples. This not only reduces the statistical effectiveness but also enhances biases, weakening the ability to obtain accurate estimates of Granger causality. Moving average components are common in time series modeling, whether they are inherent to the time series itself, or generated during data sampling and processing.
Numerous research studies have sought to minimize estimation bias when applying Granger causality analysis. One of the proposed solutions involves utilizing a combination of lasso regression and genetic algorithm, facilitated by initialization, selection, feature cross over, and mutation, to overcome coefficient overfitting [15]. Similarly, a new unified Granger causality method has been proposed, addressing the inconsistency between model order selection criteria and hypothesis testing procedures [16]. This method utilizes the minimum description length principle to reconcile the two stages, offering a promising approach for future multiscale research.
The state space model has been applied to overcome moving average components and demonstrated that it can achieve better statistical power and less bias than the traditional AR model as well as a strong Granger causality maintained with noise and downsampling [17,18]. These are due to the fact that the state space model has the equivalence to the auto regression moving average (ARMA) model and is independent of noise and downsampling [19]. Additionally, the state space model was integrated with recurrent neural networks by the researchers in order to increase prediction accuracy by accounting for background information that might have an impact on the target time series [20].
Complex system time series with multiscale features are very common in practical applications, such as heartbeat signals, rainfall time series, and electroencephalogram data [21,22,23]. Time series with multiscale characteristics usually fluctuate at different time scales, implying that system transformations have a multi-level time scale structure and time domain localization characteristics. This is the aspect that traditional time domain Granger causality analysis cannot investigate. It is crucial to consider approaches to overcome the limitations of the VAR model, better accommodate the multiscale features of the data, and improve the precision of Granger causality analysis.
Costa et al. [24] presented a multiscale entropy approach and used it to analyze the multiscale properties of the cardiovascular signals data. Faes et al. [25] proposed a multivariate stochastic process analysis framework based on the state space model. This framework explored a weighted average rescaling operation, which equals to the effects of averaging and downsampling. The authors successfully applied this method to explore multiscale Granger relationships using carbon dioxide (CO 2 ) and global temperature data.
Reducing CO 2 emissions is a crucial aspect of sustainable development. Recent studies have analyzed the factors that influence CO 2 emissions from various perspectives such as technology and innovation, financial development, and human development. Policy advice is provided based on these analyses.
A panel causality analysis using the cross-sectional autoregressive distributed lag (CS-ARDL) method was conducted to investigate the factors affecting CO 2 emissions in a panel of BRICS (i.e., Brazil, Russia, India, China, and South Africa) countries [26]. Similarly, the impact of greenhouse gas emissions on the Group of Seven economy was studied from the viewpoint of environment-related information and communication technologies innovations [27]. Adebayo et al. adopt the Fourier quantile causality test to examine the causal impact of environmental quality in China [28]. The study explores the causal relationships between CO 2 emissions, ecological footprint, and load capacity factor.
However, as society develops, CO 2 emissions display diverse patterns over time. Therefore, a multiscale analysis can better assist researchers in identifying causal relationships at various time scales.
The multiscale analysis is generally used to evaluate the intrinsic structure of a single time series and the multiscale representation of the interaction of information from two time series. It assesses the multiscale dynamic nature of a sequence by resampling the original time series of a complex process at different time scales, thus revealing the structure at multiple time scales and better reflecting information about the dynamics of the complex system [29].
In order to better analyze multiscale data, Lungarella M et al. proposed multiscale transfer entropy based on wavelet transform [30]. This method projects the time series into wavelet space to obtain a new set of variables from which causal linkages can be extracted at many scales.
The wavelet transform is a signal processing technique that represents instantaneous or non-stationary signals in terms of time and scale distributions, and it has good localization capabilities for localizing time series [31].
Stramaglia et al. introduced a wavelet-based multiscale Granger analysis technique that overcomes the computational complexity associated with filtering and downsampling at multiple scales [32]. The method employs the a ´ trous wavelets that maintain translation invariance in the processing of time series, with decomposition coefficients at every level equating to the original time series length. The wavelet-transformed data are then utilized as a substitute for the driving variable data for Granger causality analysis. It is worth noting, however, that this approach depends on the VAR model’s inherent weaknesses previously mentioned, which may potentially increase the likelihood of producing inaccurate results.
Although multiscale Granger causality has been subjected to extensive research, the majority of these studies have employed the VAR model without addressing its limitations.
To improve the limitations mentioned in this section, the state space model with the advantage of downsamplng independence and the equality to ARMA model is considered in this paper, and a multiscale wavelet analysis framework based on the state space model is proposed, which can improve the accuracy and robustness of Granger causality compared to the VAR model.
The paper is structured into the following sections. Section 2 provides definitions of multiscale Granger causality and state space models and introduces the proposed approach in detail. Section 3 outlines the evaluation indicators adopted in this research and presents a comparative experiment between the methods of this paper and the wavelet-based method. The experimental results are then illustrated and discussed. Lastly, Section 4 concludes the study and offers suggestions for future work.

2. Materials and Methods

2.1. Granger Causality

Start with n smooth complex processes Y n = [ y 1 , n , , y M , n ] T , which are composed of M zero-mean scalar processes, respectively. Suppose y j is the target variable and y i is the driver variable, then the remaining M 2 processes of Y n make up Y k , where k denotes the difference set of { 1 , 2 , , M } and { i , j } . Let Y n denotes the past information of Y n , y n denotes the past information of y n . Then, the Granger causality from y i to y j means the extent of predicted improvement that based on the use of y j , n and Y k , n to predict y j , n , the addition of y i , n makes.
Use all past processes to predict the present of target variable, which is named full regression. The prediction error for this method can be determined using the following equation:
ε n = y j , n E [ y j , n | Y n ]
where E is the expectation operator.
The prediction error of the restricted regression is obtained by using the past of all variables, except the driving variable to predict the present of the target variable as follows:
ε n R = y j , n E [ y j , n | y j , n , Y k ]
where the superscript R represents reduced regression.
By calculating the log-likelihood values of the two regression prediction errors which are λ = E [ ε n ε n T ] and λ R = E [ ε n R ( ε n R ) T ] , where the superscript T represents the transpose. The Granger causality values from y i to y j can be obtained as follows:
F i j = l n λ R λ

2.2. State Space Model

Consider a general constant state space (SS) model,
X t + 1 = A X t + W t Y t = C X t + V t
where W t and V t are noisy variables with semi-positive definite covariance, which means v a r W t V t = Q S S T R . The parameters for model ( 3 ) is ( A , C , Q , R , S ) .
After projecting the state variable onto the space tensed by the past of the observed variable, that is Z t = E X t | Y t 1 , where Y t 1 = Y t 1 T , Y t 2 T T is an infinite column vector. Z t can also be considered the new state variable as well as the residuals of a linear regression on the variable y t , using infinite past.
In this way, the innovations of state space model (ISS) for Z t which is equal to SS model can be described as follows:
Z t + 1 = A Z t + K E t , Y t = C Z t + E t
where the innovation is defined as E t Y t E Y t | Y t 1 , with covariance matrix Σ E E t E t T . The parameters for model (4) are ( A , C , K , Σ ) .
According to the ISS model, the error of the full regression can be obtained as λ f u l l = Σ ( j , j ) . In order to obtain the error of the restricted regression, we consider the state space model (4) with the submodel, removing the driving variable. In this regard, the first equation retains the same form as in (4), while the second equation is formulated as follows:
Z t + 1 = A Z t + K ε t , Y t ( j k ) = C ( j k ) Z t + ε t ( j k )
where the superscript ( j k ) denotes the selection of the j , k rows in the matrix. At this point, the parameters of the state space model (5) are ( A , C ( j k ) , K Σ K T , Σ ( j k , j k ) , K Σ ( : , j k ) ) . It is notable that the ISS model and the SS model are mutually interchangeable as reported by Barnett and Takahashi in 2015 [17]. Specifically, the parameters of the SS model can be transformed into corresponding parameters of the ISS model by resolving the DARE equation. By implementing this transformation, the error λ r e s t r i c t e d = Σ R ( j , j ) of the constrained model can be deduced.
Finally, the Granger causality can be calculated as follows:
F i j = l n Σ R ( j , j ) Σ ( j , j )

2.3. Wavelet Transform

The approach of the wavelet transform, which is grounded on the short-time Fourier transform, provides a possibility for local analysis of time–frequency through examination of the fluctuation of each moment or spatial position across multiple time scales. Distinct from the Fourier transform, the wavelet transform exhibits tight support and, therefore, proves to be advantageous for investigating well-defined signals confined within a specific temporal window.
Let ψ ( t ) be the mother wavelet, and a stretching translation yields the subwavelet ψ a , b ( t ) = 1 a ψ ( t b a ) , where a 0 is called the scale factor for stretching, and 1 a ensures that ψ a , b ( t ) has a norm value of 1; b is called the translation factor and is used to translate. Then the discrete wavelet transform of the signal f ( t ) is defined as the function about a and b, which obtains the projection of the original signal f ( t ) onto the mother wavelet. The * in the following equation means complex conjugation:
W ψ f ( a , b ) = t f ( t ) ψ a , b ( t )
Several mother wavelets are frequently utilized in signal analysis, such as Haar wavelets, Mexican cap wavelets, and Morlet wavelets. However, not all wavelets are appropriate for this purpose. For instance, while Mallat wavelets utilize orthogonal bases, they lack translation invariance. As such, to acquire a translation-invariant wavelet decomposition, the discrete wavelet transform of the a ´ trous algorithm is implemented for signal decomposition.
The a ´ trous algorithm separates the high-frequency and low-frequency information in the signal f ( t ) by repeatedly using low-pass filtering h ( n ) . In this paper, h = 1 16 [ 1 , 4 , 6 , 4 , 1 ] is chosen as the low-pass filter. The high-frequency information corresponds to the coefficient terms of the wavelet transform and the low-frequency information corresponds to the approximation terms of the wavelet coefficients. The approximation of the signal at each scale is first obtained:
a s ( t ) = n h ( n ) a s 1 ( t + 2 s 1 n ) ( s = 1 , , N )
where a 0 ( t ) is the original discrete signal f ( t ) , n = 1 , 2 , , 5 , s is the scale index, and N is the maximum number of scales. Then the wavelet coefficients are defined as c s ( t ) = a s 1 ( t ) a s ( t ) . The reconstruction of the signal is completed by adding the last smoothed signal a N ( t ) and the wavelet coefficients at all scales as follows:
f ( t ) = a N ( t ) + s = 1 N c s ( t )
It is worth noting that the a ´ trous algorithm has the same length at each scale as the original signal, which allows the signal to be decomposed without bias, as well as better detection of detailed signals.

2.4. The Analysis Framework Proposed in This Paper

Building on the theories explained earlier, this paper introduces a novel method called Granger causality with wavelet transform and state space model (GTSS), which considers replacing the driving variable y i with the wavelet coefficients at the corresponding scale and using state space model for Granger causality analysis. Since the restricted regression does not use the data of the driving variable y i , the error of the restricted regression is not affected by the wavelet transform and remains as shown in Equation (2). Meanwhile, the prediction error of the full regression is ε s = y j E [ y j | y j , Y k , c s ] , then we can gain the variance by Σ s R ( j , j ) = E [ ε s ε s T ] .
Therefore, under the scale s, the Granger causality values from y i to y j can be obtained as follows:
F i j s = l n Σ s R ( j , j ) Σ ( j , j )
The proposed analytical framework of GTSS involves the following steps:
Step 1: Employ the a ´ trous wavelets as the mother wavelet to perform wavelet decomposition on the original time series, and obtain the wavelet coefficients C n = [ c 1 , n , , c N , n ] at each scale.
Step 2: Replace the data of driving variable y i with the wavelet coefficients c s for scale s, resulting in Y s = [ y 1 , n , , y i 1 , n , c s , i , y i + 1 , n , , y M , n ] T .
Step 3: Estimate the model parameters using the modified Y s and calculate the Granger causality values at scale s.
Step 4: Repeat Step 2 to 3 by varying the scale value s.
Step 5: Save the Granger causality values across multiple scales.
The flow of empirical analysis is presented in Figure 1 which adopts a multi-step methodology.
The process begins with conducting the augmented Dickey–Fuller (ADF) stationary test. If the data are observed to be stationary, Granger causality analysis follows. Alternatively, if the data are non-stationary, differential transform is performed until stationarity is obtained before proceeding with the GTSS method. The GTSS method involves wavelet transform, including wavelet decomposition and wavelet coefficient replacement, followed by Granger causality estimation at each scale s. The process iterates by changing the scale value s and repeating wavelet transform and causality estimation. Upon obtaining the hypothesis test results through the GTSS method, evaluation indicators, such as the reject rate and mean square error, are calculated to confirm the presence of a causal relationship.

3. Simulation Study and Results

This section presents experiments on Granger causality with the wavelet transform method (GT) proposed by Stramaglia S. et al. [32] and the GTSS method proposed in this paper, analyzing the results for the bivariate and multivariate cases, respectively, and comparing the advantages and disadvantages of the two methods.

3.1. Evaluation Indicator

The following experiments incorporate several evaluation indices that assist in the analysis. The calculation of each is explained below.
First, the coefficient bias [8] between the estimated and defined values of the VAR models is calculated to investigate the validity of the model estimation method in this paper. The coefficient bias (CB) is defined as follows:
CB = A A T 1 A 1
where 1 is the L1 norm, A is the estimated value of the model coefficient matrix, and A T is a pre-defined model coefficient matrix.
However, due to the absence of prior knowledge of the actual causal flow in reality, it is difficult to determine the accuracy of the utilized model, which is typically evaluated by comparing to the real causality. To address this challenge, we developed the reject rate and overall reject rate as evaluation metrics for assessing causal outcomes.
The reject rate (RR), which is calculated according to the significance tests’ results, is defined as follows:
RR = N R S T N S
where N R S represents the number of rejections for significance tests, and T N S represents the total number for significance tests.
This evaluation index measures the innate detection capability of the algorithm. A low rejection rate in instances of actual causality suggests that the algorithm is proficient in generating accurate estimates without resorting to hypothesis testing, thereby reducing reliance on such methods. Conversely, a high rejection rate in situations where no actual causality exists demonstrates that the accuracy of the estimate can be improved through the application of hypothesis testing techniques.
To better see the overall effect, in this paper, the reject rates in both cases are combined into one indicator named overall reject rate (ORR) as follows:
ORR = R R E + 1 R R N
where R R E is the reject rate where Granger causality exists, and R R N represents the reject rate for when it does not exist.
The mean square error (MSE) from i to j and accuracy are used in the simulation study and calculated as follows:
MSE i j = 1 n t r i a l Σ n = 1 n t r i a l ( G C n G C t ) 2
where n t r i a l is the number of trials, G C n is the estimated Granger causality value from i to j in the n th trial, and G C t is the theoretical value.
The final indicator used in this paper is accuracy, a commonly utilized measure to evaluate the degree of concordance between predicted outcomes and actual outcomes. It is defined as follows:
Accuracy = T P T P + F P
where T P means the number of consistent results between the estimated and predefined models, while F P denotes the number of instances where the predicted results differ from the predefined causality.

3.2. Example 1: Multivariate Series System

In this section, a VAR(3) model is used to conduct simulations and preform theoretical analysis. The VAR model equation is presented below,
x 1 x 2 x 3 = 2 r 1 c o s ( θ 1 ) 0 0 0.356 2 r 2 c o s ( θ 2 ) 0 0 0.3098 2 r 3 c o s ( θ 3 ) x 1 , t 1 x 2 , t 1 x 3 , t 1 + r 1 2 0 0 0.7136 r 2 2 0 0 0.5 r 3 2 x 1 , t 2 x 2 , t 2 x 3 , t 2 + 0 0 0 0.356 0 0 0 0.3098 0 x 1 , t 3 x 2 , t 3 x 3 , t 3 + ω 1 , t ω 2 , t ω 3 , t
where r i = { 0.9 , 0.7 , 0.8 } , θ i = f i Δ t 2 π , i = { 1 , 2 , 3 } , and suppose the sample rate is 1 Δ = 120 Hz and let f i = { 40 , 10 , 50 } Hz. Thus, a model which has the causality from x 1 to x 2 and from x 2 to x 3 can be obtained.
Multiple sets of observations were generated based on the model coefficients, using three different numbers of trials denoted as n t r i a l = { 1 , 10 , 100 } . Each time series observation within the sets is of length N = 500 .
Initially, the impact of varying model order on model estimation is explored. The value of the trial and model order are set to 10 , 100 and 3 , 4 , 5 , 10 , 20 , respectively. The following experiments are all executed with the maximum scale 4 and the significant value α = 0.05 .
Table 1 displays the outcomes obtained by GT and GTSS. To accurately estimate the coefficients of the VAR model for the time series data, a suitable estimation technique is necessary since the coefficients serve as the basis for defining and computing Granger causality. The accuracy of the coefficient estimation method plays a pivotal role in the effectiveness of the Granger causality model estimation. In this paper, the VAR coefficients utilized are estimated using the locally weighted regression (LWR) algorithm, which recursively computes the coefficient matrix of the inferred regression model. This algorithm possesses the merits of stability and swift computation, in comparison to the ordinary least square (OLS) algorithm. Moreover, it proves to be more appropriate for selecting models based on likelihood procedures, such as the Akaike information criterion (AIC) and Bayesian information criterion (BIC) [14].
Upon examining the experimental outcomes, we observe that for various trial and order numbers, the variation of the VAR coefficients does not surpass 0.1, implying that the LWR algorithm comparatively optimizes the VAR model. However, as the model order heightens, bias occurs in the coefficient estimation procedure that grows expeditiously. This phenomenon could be attributed to heightened computation noise with respect to larger orders [8]. To minimize this bias, a solution might entail repeated experimentation.
By comparing the GTSS and GT methods, it can be found that under the same number of trials and model orders, the rejection rate of the GTSS method is smaller than that of GT, and the correct rate is much higher than that of GT. Under the same number of trials of different orders, there is no obvious pattern in the change of the correct rate.
When the number of trials remains steady while model orders vary, the correct rate tends to decrease and then rise again as the model order increases. Additionally, in the GTSS approach, the rejection rate displays a similar pattern of initially increasing and subsequently decreasing, while the GT rejection rate increases as the end of the model rises. Regarding ORR, GTSS is generally inferior to GT in numerous instances.
Based on the above discussion, it is easy to see that the model order has a greater impact on the Granger causality analysis, and in order to better fit the model corresponding to the data, we use an order adaptive approach rather than a fixed order. The model order was estimated according to the AIC criterion, and Granger causality of the model was estimated using the GT and GTSS methods, respectively.
Table 2 displays the comparison of accuracy and ORR between GTSS and GT. In terms of accuracy, GTSS generally outperforms GT, and GT increases with the number of trials. On the other hand, GTSS achieves the maximum accuracy at trial number 1. This inders that the GTSS method achieves better results for single trial, which is in line with the practical application. Since single sets of experimental data are typical in practical scenarios, additional methods such as the Amplitude Adjusted Fourier–Transformed (AAFT) or improved AAFT (IAAFT) algorithm become necessary to generate test data with matching distribution and autocorrelation for multiple trial analyses [33]. In comparison, both techniques perform consistently in terms of ORR, as their accurate rates indicate.
The results of 100 trials are shown in Figure 2 as an example. The GTSS method only generated GC values on x 1 x 2 and x 2 x 3 where Granger causality are truly present, whereas the GT method generated Granger causality under all combinations of variables, some of which are spurious.
The dispersion in this picture reflects the 90 percent confidence interval, while the median represents the median Granger causality value of 100 trials.

3.3. Example 2: Bivariate Series System

Next, a VAR(1) model is analysis which has the following structure,
x t y t = A x t 1 y t 1 + ω 1 , t ω 2 , t
where A = 0.4 0.8 0 0.5 , hence the model has the Granger causality from y to x.
According to Equation (6) [34], the theoretical Granger causality value in s c a l e = 1 is 0.5578.
F y x = l n 1 + A 22 2 + A 12 2 + ( 1 + A 22 2 + A 12 2 ) 4 A 22 2 2
Then 100 realization with 10 4 nobs are generated according to VAR(1) model. The Granger causality of the model is estimated using the GT and GTSS methods respectively and the results are shown in Table 3, where MSE is the mean square error at s c a l e = 1 . Based on the results in Table 3, it is clear that the GTSS method outperforms the GT method for all three indicators.
The scale—GC plots were plotted before and after hypothesis testing in Figure 3. From Figure 3a, the estimates of both methods were small in the direction of no causality, which is the 1 to 2 direction in the picture. After the hypothesis testing shown in Figure 3b, GTSS could obtain the correct causality relationship, but GT still had individual spurious causality. In addition, GTSS yields GC estimates that are close to the theoretical values, while GT estimates are considerably smaller than GTSS and the theoretical values on several scales.
We plotted scale—GC plots in Figure 4 in order to explore the relationship between causal strength and the GC value by varying the A ( 2 , 1 ) term in the coefficient matrix A, which is the causal strength of y x . The GC value displayed in the figure corresponds to the mean value obtained after 100 trials. The results demonstrated that the GC values obtained using both methods increased as the value of causality strength increased, eventually reaching a maximum value at the lag of 1.
The MSE of the GC values for both methods at s c a l e = 1 is compared in Table 4. At minimal levels of causal strength, the MSE of both techniques is negligible in part due to comparatively low GC estimates and the absence of significant outliers in their estimations. Nevertheless, as causal strength heightens, the MSE concurrently escalates. In comparison to GT, the MSE remains consistently lower in GTSS, indicating its superior ability to restore Granger causality values.

4. Conclusions

System transformations exhibit a multi-timescale structure and possess localization properties in the time domain, resulting in fluctuations that transpire across multiple time scales. These attributes cannot be sufficiently scrutinized with conventional time domain Granger causality analyses. Such analyses rely on VAR models, which are vulnerable to noise and downsampling challenges, and lack the ability for multiscale analyses.
In this study, the GTSS approach, which analyzes Granger causality between multivariate time series on multiple time scales by combining the state space model with wavelet transform, is suggested. To be more precise, we first substitute the driving variable with the result of the wavelet transform before incorporating the transform data into the state space model. The hypothesis test is carried out by calculating the residuals of the models. Finally, the Granger causality value and the relationship between the various factors may be determined.
The experimental results show that GTSS has better capability for multiscale Granger causality analysis than the GT method based on the VAR model. The comparative analysis between the two methods highlights the superior performance of GTSS in accurately identifying causal relationships, especially in situations where they are asymmetric in nature.
The relationship between model order, number of trials and strength of causality and Granger causality is investigated. Our results reveal that the causality value decreases at higher time scales, while the accuracy improves with an increase in the number of trials, providing valuable insights for future studies to select appropriate parameter settings for these parameters, thereby enhancing the accuracy of Granger causality analyses.
Furthermore, the multiscale analysis using empirical model methods provides a better view of the underlying driving variables according to He K. et al. [35]. Deep learning has been effectively applied to the study of Granger causality to determine causal linkages in complicated systems [36]. Therefore, future study could examine data processing in conjunction with empirical modal decomposition or employ different deep learning models for modeling time series.

Author Contributions

Conceptualization and methodology, Q.Z. and C.Z.; investigation, Q.Z.; software, Q.Z. and S.C.; writing—original draft preparation, Q.Z.; writing—review and editing, Q.Z., S.C. and C.Z.; funding acquisition, C.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Guangdong Guangxi Joint Fund General Project with No. 2021A1515410005.

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
VARVector auto regression
GCGranger causality
GTGranger causality with wavelet transform method
GTSSGranger causality with wavelet transform and state space method

References

  1. Granger, C.W.J. Economic processes involving feedback. Inf. Control 1963, 6, 28–48. [Google Scholar] [CrossRef] [Green Version]
  2. Wiener, N. The theory of prediction. Mod. Math. Eng. 1956, 165–190. [Google Scholar]
  3. Granger, C.W. Investigating causal relations by econometric models and cross-spectral methods. Econom. J. Econom. Soc. 1969, 37, 424–438. [Google Scholar] [CrossRef]
  4. Granger, C.W. Some properties of time series data and their use in econometric model specification. J. Econom. 1981, 16, 121–130. [Google Scholar] [CrossRef]
  5. Peng, Y.; Chen, W.; Wei, P.; Yu, G. Spillover effect and Granger causality investigation between China’s stock market and international oil market: A dynamic multiscale approach. J. Comput. Appl. Math. 2020, 367, 112460. [Google Scholar] [CrossRef]
  6. Wang, S.; Zhao, Q.; Han, Y.; Wang, J. Root cause diagnosis for complex industrial process faults via spatiotemporal coalescent based time series prediction and optimized Granger causality. Chemom. Intell. Lab. Syst. 2023, 233, 104728. [Google Scholar] [CrossRef]
  7. Cîrstian, R.; Pilmeyer, J.; Bernas, A.; Jansen, J.F.; Breeuwer, M.; Aldenkamp, A.P.; Zinger, S. Objective biomarkers of depression: A study of Granger causality and wavelet coherence in resting-state fMRI. J. Neuroimaging 2023, 33, 404–414. [Google Scholar] [CrossRef]
  8. Gao, X.; Huang, W.; Liu, Y.; Zhang, Y.; Zhang, J.; Li, C.; Bore, J.C.; Wang, Z.; Si, Y.; Tian, Y.; et al. A novel robust Student’s t-based Granger causality for EEG based brain network analysis. Biomed. Signal Process. Control 2023, 80, 104321. [Google Scholar] [CrossRef]
  9. Kathpalia, A.; Nagaraj, N. Granger causality for compressively sensed sparse signals. Phys. Rev. E 2023, 107, 034308. [Google Scholar] [CrossRef]
  10. Geweke, J. Measurement of linear dependence and feedback between multiple time series. J. Am. Stat. Assoc. 1982, 77, 304–313. [Google Scholar] [CrossRef]
  11. Geweke, J.F. Measures of conditional linear dependence and feedback between time series. J. Am. Stat. Assoc. 1984, 79, 907–915. [Google Scholar] [CrossRef]
  12. Barnett, L.; Seth, A.K. Detectability of Granger causality for subsampled continuous-time neurophysiological processes. J. Neurosci. Methods 2017, 275, 93–121. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  13. Seth, A.K. A MATLAB toolbox for Granger causal connectivity analysis. J. Neurosci. Methods 2010, 186, 262–273. [Google Scholar] [CrossRef] [PubMed]
  14. Barnett, L.; Seth, A.K. The MVGC multivariate Granger causality toolbox: A new approach to Granger-causal inference. J. Neurosci. Methods 2014, 223, 50–68. [Google Scholar] [CrossRef] [Green Version]
  15. Marica, V.G.; Horobet, A. Conditional Granger Causality and Genetic Algorithms in VAR Model Selection. Symmetry 2019, 11, 1004. [Google Scholar] [CrossRef] [Green Version]
  16. Hu, Z.; Li, F.; Cheng, M.; Lin, Q. Investigating Large-Scale Network with Unified Granger Causality Analysis. Comput. Math. Methods Med. 2022, 2022, 6962359. [Google Scholar] [CrossRef]
  17. Barnett, L.; Seth, A.K. Granger causality for state-space models. Phys. Rev. E 2015, 91, 040101. [Google Scholar] [CrossRef] [Green Version]
  18. Solo, V. On causality I: Sampling and noise. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; IEEE: Piscataway, NJ, USA, 2007; pp. 3634–3639. [Google Scholar]
  19. Aoki, M.; Havenner, A. State space modeling of multiple time series. Econom. Rev. 1991, 10, 1–59. [Google Scholar] [CrossRef]
  20. Cai, R.; Lin, Z.; Chen, W.; Hao, Z. Shared state space model for background information extraction and time series prediction. Neurocomputing 2022, 468, 85–96. [Google Scholar] [CrossRef]
  21. Ivanov, P.C.; Amaral, L.A.N.; Goldberger, A.L.; Havlin, S.; Rosenblum, M.G.; Struzik, Z.R.; Stanley, H.E. Multifractality in human heartbeat dynamics. Nature 1999, 399, 461–465. [Google Scholar] [CrossRef] [Green Version]
  22. Chou, C.M. Wavelet-based multi-scale entropy analysis of complex rainfall time series. Entropy 2011, 13, 241–253. [Google Scholar] [CrossRef] [Green Version]
  23. Kang, X.; Jia, X.; Geocadin, R.G.; Thakor, N.V.; Maybhate, A. Multiscale entropy analysis of EEG for assessment of post-cardiac arrest neurological recovery under hypothermia in rats. IEEE Trans. Biomed. Eng. 2009, 56, 1023–1031. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Costa, M.D.; Peng, C.K.; Goldberger, A.L. Multiscale analysis of heart rate dynamics: Entropy and time irreversibility measures. Cardiovasc. Eng. 2008, 8, 88–93. [Google Scholar] [CrossRef] [Green Version]
  25. Faes, L.; Nollo, G.; Stramaglia, S.; Marinazzo, D. Multiscale granger causality. Phys. Rev. E 2017, 96, 042150. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Adebayo, T.S.; Ullah, S.; Kartal, M.T.; Ali, K.; Pata, U.K.; Ağa, M. Endorsing sustainable development in BRICS: The role of technological innovation, renewable energy consumption, and natural resources in limiting carbon emission. Sci. Total Environ. 2023, 859, 160181. [Google Scholar] [CrossRef]
  27. Ullah, S.; Adebayo, T.S.; Irfan, M.; Abbas, S. Environmental quality and energy transition prospects for G-7 economies: The prominence of environment-related ICT innovations, financial and human development. J. Environ. Manag. 2023, 342, 118120. [Google Scholar] [CrossRef]
  28. Adebayo, T.S.; Ullah, S. Formulating sustainable development policies for China within the framework of socioeconomic conditions and government stability. Environ. Pollut. 2023, 328, 121673. [Google Scholar] [CrossRef]
  29. Zhao, X.; Sun, Y.; Li, X.; Shang, P. Multiscale transfer entropy: Measuring information transfer on multiple time scales. Commun. Nonlinear Sci. Numer. Simul. 2018, 62, 202–212. [Google Scholar] [CrossRef]
  30. Lungarella, M.; Pitti, A.; Kuniyoshi, Y. Information transfer at multiple scales. Phys. Rev. E 2007, 76, 056117. [Google Scholar] [CrossRef] [Green Version]
  31. Starck, J.L.; Murtagh, F.D.; Bijaoui, A. Image Processing and Data Analysis: The Multiscale Approach; Cambridge University Press: Cambridge, UK, 1998. [Google Scholar]
  32. Stramaglia, S.; Bassez, I.; Faes, L.; Marinazzo, D. Multiscale Granger causality analysis by à trous wavelet transform. In Proceedings of the 2017 7th IEEE International Workshop on Advances in Sensors and Interfaces (IWASI), Vieste, Italy, 15–16 June 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 25–28. [Google Scholar]
  33. Lin, C.S.; Chiu, S.H.; Lin, T.Y. Empirical mode decomposition–based least squares support vector regression for foreign exchange rate forecasting. Econ. Model. 2012, 29, 2583–2590. [Google Scholar] [CrossRef]
  34. Barnett, L.; Seth, A.K. Behaviour of Granger causality under filtering: Theoretical invariance and practical application. J. Neurosci. Methods 2011, 201, 404–419. [Google Scholar] [CrossRef] [PubMed]
  35. He, K.; Zha, R.; Wu, J.; Lai, K.K. Multivariate EMD-based modeling and forecasting of crude oil price. Sustainability 2016, 8, 387. [Google Scholar] [CrossRef] [Green Version]
  36. Tank, A.; Covert, I.; Foti, N.; Shojaie, A.; Fox, E.B. Neural granger causality. IEEE Trans. Pattern Anal. Mach. Intell. 2021, 44, 4267–4279. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Flow of analysis. The green dashed part in this figure is the steps of the GTSS method.
Figure 1. Flow of analysis. The green dashed part in this figure is the steps of the GTSS method.
Symmetry 15 01286 g001
Figure 2. This is the results of VAR(3) with 100 trials. The y-axis is the Granger causality value and the x-axis is the scale. The orange and gray dispersion area contain the 100 trials’ results of GTSS and GT respectively.
Figure 2. This is the results of VAR(3) with 100 trials. The y-axis is the Granger causality value and the x-axis is the scale. The orange and gray dispersion area contain the 100 trials’ results of GTSS and GT respectively.
Symmetry 15 01286 g002
Figure 3. The results of VAR(1) before and after hypothesis test. (a) The result before hypothesis test; (b) the result after hypothesis test. The orange and gray parts represent GTSS and GT as described in Figure 2.
Figure 3. The results of VAR(1) before and after hypothesis test. (a) The result before hypothesis test; (b) the result after hypothesis test. The orange and gray parts represent GTSS and GT as described in Figure 2.
Symmetry 15 01286 g003
Figure 4. The plot displays the scale—GC for various causal strengths, where the y-axis represents the Granger causality value, and the x-axis represents the scale. The corresponding line colors for different causality strengths are indicated in the legend located on the right-hand side of the plot. (a) the results of GTSS; (b) the results of GT.
Figure 4. The plot displays the scale—GC for various causal strengths, where the y-axis represents the Granger causality value, and the x-axis represents the scale. The corresponding line colors for different causality strengths are indicated in the legend located on the right-hand side of the plot. (a) the results of GTSS; (b) the results of GT.
Symmetry 15 01286 g004
Table 1. The results of GT and GTSS.
Table 1. The results of GT and GTSS.
GTSSGT
TrialOrderCBRRAccuracyORRRRAccuracyORR
1030.03220.79580.85420.14580.79580.67080.3097
40.06000.80830.83330.16670.84380.62080.3313
50.07470.84170.80830.19170.85230.64580.3223
100.07730.80000.84170.15830.89510.67920.3126
200.08600.77500.87500.12500.94160.72080.3234
10030.01010.80750.85250.14750.80760.67670.3079
40.02020.84040.82040.17960.85160.66000.3129
50.02500.85130.81130.18880.86350.68250.3143
100.02280.81710.84210.15790.89890.72920.3136
200.02750.79250.87000.13000.94210.72710.3229
Table 2. The accuracy and ORR value for different trials.
Table 2. The accuracy and ORR value for different trials.
TrialAccuracyORR
GTSSGTGTSSGT
10.8750.5420.1250.334
100.8420.6540.1580.309
1000.8480.7230.1520.298
Table 3. Comparison between GT and GTSS with the results of 100 trials.
Table 3. Comparison between GT and GTSS with the results of 100 trials.
MethodAccuracyORRMSE
GT0.8690.2840.252
GTSS0.9980.0030.002
Table 4. The MSE with different causal strength. MSE GTSS means the MSE using GTSS method, and MSE G T means the MSE using GT method.
Table 4. The MSE with different causal strength. MSE GTSS means the MSE using GTSS method, and MSE G T means the MSE using GT method.
StrengthMSE GTSS MSE GT
0.050.00000.0000
0.10.00000.0001
0.20.00010.0018
0.30.00030.0081
0.40.00060.0236
0.50.00110.0527
0.80.00250.2519
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Q.; Zhang, C.; Cheng, S. Wavelet Multiscale Granger Causality Analysis Based on State Space Models. Symmetry 2023, 15, 1286. https://doi.org/10.3390/sym15061286

AMA Style

Zhang Q, Zhang C, Cheng S. Wavelet Multiscale Granger Causality Analysis Based on State Space Models. Symmetry. 2023; 15(6):1286. https://doi.org/10.3390/sym15061286

Chicago/Turabian Style

Zhang, Qiyi, Chuanlin Zhang, and Shuangqin Cheng. 2023. "Wavelet Multiscale Granger Causality Analysis Based on State Space Models" Symmetry 15, no. 6: 1286. https://doi.org/10.3390/sym15061286

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop