Next Article in Journal
Application of Multiscale Sample Entropy in Assessing Effects of Exercise Training on Skin Blood Flow Oscillations in People with Spinal Cord Injury
Next Article in Special Issue
Graph Regression Model for Spatial and Temporal Environmental Data—Case of Carbon Dioxide Emissions in the United States
Previous Article in Journal
Restricted Phase Space Thermodynamics of Einstein-Power-Yang–Mills AdS Black Hole
Previous Article in Special Issue
Robust Variable Selection with Exponential Squared Loss for the Spatial Single-Index Varying-Coefficient Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatio-Temporal Analysis of Marine Water Quality Data Based on Cross-Recurrence Plot (CRP) and Cross-Recurrence Quantitative Analysis (CRQA)

1
College of Artificial Intelligence, North China University of Science and Technology, Bohai Road, Tangshan 063210, China
2
Hebei Key Laboratory of Industrial Perception, Tangshan 063210, China
3
Marine Ecological Restoration and Smart Ocean Engineering Research Center of Hebei Province, Qinhuangdao 066000, China
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(4), 689; https://doi.org/10.3390/e25040689
Submission received: 15 February 2023 / Revised: 12 April 2023 / Accepted: 14 April 2023 / Published: 19 April 2023
(This article belongs to the Special Issue Spatial–Temporal Data Analysis and Its Applications)

Abstract

:
In recent years, with the frequency of marine disasters, water quality has become an important environmental problem for researchers, and much effort has been put into the prediction of marine water quality. The temporal and spatial correlation of marine water quality parameters directly determines whether the marine time-series data prediction task can be completed efficiently. However, existing research has only focused on the correlation analysis of marine data in a certain area and has ignored the temporal and spatial characteristics of marine data in complex and changeable marine environments. Therefore, we constructed a spatio-temporal dynamic analysis model of marine water quality based on a cross-recurrence plot (CRP) and cross-recurrence quantitative analysis (CRQA). The time-series data of marine water quality were first mapped to high-dimensional space through phase space reconstruction, and then the dynamic relationship among various factors affecting water quality was visually displayed through CRP. Finally, their correlation was quantitatively explained by CRQA. The experimental results showed that our scheme demonstrated well the dynamic correlation of various factors affecting water quality in different locations, providing important data support for the spatio-temporal prediction of marine water quality.

1. Introduction

With the acceleration of industrialization, the discharge load of industrial, agricultural, and domestic sewage in coastal areas has increased, and the deterioration of marine water quality has worsened year by year. Red tide disasters and the eutrophication of water bodies occur frequently. The quality of marine water is not only important for the economy, but it also affects human life. Therefore, the analysis of marine water quality has become a hot topic for researchers in recent years. Industrialization, mining, pollution, and natural disasters affect water quality. They introduce or change various parameters of water, thus affecting whether water is suitable for human consumption or general use [1].
Because of the important role of the marine environment in social development, marine data analysis has always been a hot topic. Water quality is affected by many factors, such as meteorology, chemistry, and human activities, which leads to the nonlinearity, randomness, and delay of water quality parameters. Statistical learning and other methods are widely used in the analysis and modeling of marine data. Jiang et al. [2] used the parallel FP-growth algorithm to analyze the oxygen, temperature, phosphates, nitrates, and silicates in the ocean. Meanwhile, based on association rules, they analyzed the correlation between different data. Erni-Cassola et al. [3] used the meta-analysis method to detect increased marine plastic debris in the sea surface and explored the separation degree of plastic debris in the water according to the density of microplastic polymers, so as to further determine whether the plastic debris sank under the sea. Lemenkova [4] used a statistical database embedded in Python for data analysis to study the interaction between environmental factors affecting the seafloor geomorphology of the Mariana Trench, established a geospatial data model, and studied the inhomogeneity of the seafloor structure. Deng et al. [5] put forward a water quality analysis framework based on the time-series data-mining method, determined the relationship between water quality in the main stream and tributaries of the Yangtze River and the variation rule of dissolved oxygen, and effectively mined valuable knowledge from historical time-series data of water quality. However, the above research methods did not pay attention to the changes in the data across time and space, ignoring the effects brought about by cross-regional factors. Additionally, they lacked an analysis of the internal dynamic characteristics and spatial characteristics of the data. It is necessary to accurately analyze the non-stationarity and spatial heterogeneity of marine data.
A CRP can mine the correlation characteristics of time-series data in space-time, process nonlinear and non-stationary high-dimensional data, and realize the visualization analysis of marine data information. A CRP detects the correlation between signals by mapping two signals to the same phase space. The details of dynamic changes between two signals can be intuitively explained by the classical structure of points, diagonals, and rectangles. In addition, the traditional analysis method of a CRP is CRQA, which describes the CRP of signals in different states through nonlinear characteristic quantities such as the recurrence rate, certainty rate, recurrence entropy, and stratification rate. Both have excellent applications in disease diagnosis, industrial fault identification, financial market analysis, intelligent oceans, and many other fields. Kanakambaran et al. [6] used a CRP to analyze signals captured by fiber sensors in order to improve the accuracy of partial discharge detection and localization. González et al. [7] used a CRP and CRQA to analyze systolic blood pressure and inter-beat intervals in healthy persons and nephrotic patients. Wang et al. [8] applied a CRP and CRQA to the time recording of current and voltage fluctuations in electrochemical noise analysis to identify dynamic characteristics and achieve efficient detection results. Li et al. [9] demonstrated the dependent behavior of chlorophyll a with various marine factors using a CRP and CRQA and quantified their dynamic similarity using recurrence analysis. Currently, the use of CRPs and CRQA in the marine field mainly focuses on a single area and lacks the exploration of the spatio-temporal dynamic characteristics of marine data. In view of the advantages of CRPs and CRQA in many fields and in the analysis of marine environmental data, in this study, we used them to analyze the dynamic correlation of marine water quality in time and space.
In order to solve the problem that marine water quality data analysis mainly focuses on a single region and ignores the spatio-temporal impact across regions, this paper proposes a spatio-temporal dynamic analysis model of marine water quality data based on CRPs and CRQA. Due to the complex and changeable characteristics of the marine environment, various data not only interact with each other, but also interact with factors in different locations. Considering the influence of time and space, the accurate analysis of marine data is the key premise for water quality prediction. Therefore, we selected eight major factors from three stations to analyze the dynamic correlation among different stations. Specifically, the scheme consisted of phase space reconstruction (PSR), spatio-temporal dynamic CRP analysis, and CRQA quantification. Firstly, a marine factor of one station and influencing factors of the other two stations were mapped to the same phase space, and then their dynamic correlation was displayed through a CRP. Some key indicators of CRQA are used to evaluate the correlation of multiple marine water quality parameters. In particular, mutual information entropy can be used to measure the dependence between two discrete variables, so mutual information entropy is commonly used as an index to evaluate the correlation degree of marine factors. Finally, we could screen out the key factors affecting water quality. The results indicated that the scheme proved the interaction between marine data of different stations and effectively selected important factors, providing data support for the input of future marine prediction models. The key contributions of this work can be summarized as follows.
  • This was the first attempt to use CRPs and CRQA to study the dynamic association between water quality factors at different stations, as well as to analyze the factors in time and space.
  • The water quality time-series data were converted to high-dimensional phase space through phase space reconstruction. After the reconstruction parameters were determined, the dynamic correlation relationship between the water quality data of different stations was displayed through a CRP, and the correlation degree was quantitatively described by CRQA and mutual information entropy.
  • The spatial correlation of different influencing factors of water quality was confirmed, providing more reliable data support for the input of marine prediction tasks.
Specifically, the framework of the model is shown in Figure 1.
The rest of this paper is structured as follows. Section 2 summarizes some relevant concepts in the correlation analysis of water quality factors. In Section 3, the experimental results from the CRP and CRQA investigation of the dynamic correlation between water quality factors at different stations are presented, and the correlation between the main water quality factors is discussed and explained. Finally, Section 4 provides conclusions and prospective research directions.

2. Materials and Methods

Herein, a new spatio-temporal recurrence analysis model for the influencing factors of marine water quality is introduced in detail. Firstly, the model is summarized, and then the main components of the model are introduced, including PSR, CRP, and CRQA.

2.1. Overview

In order to find the factors affecting the dynamic correlation of marine water quality in time and space and determine the main influencing factors of the quality of the water, we set up an analysis model based on CRPs and CRQA. According to the investigation and analysis, we selected eight kinds of water-quality-influencing factors from three stations to analyze the spatio-temporal correlation between them. Specifically, our model consisted of the following three parts: PSR, CRP analysis, and CRQA quantification.
Taking pH as an example, firstly, the pH of the center station and the water-quality-influencing factors of the other three stations were mapped to the same phase space to calculate the reconstruction parameters. Secondly, the cross-recurrence matrix of the time series of the pH and another water quality factor was obtained through a CRP. Finally, some CRQA indicators were used to quantify the influence degree of each water quality factor on the pH. Based on the above experimental results, the temporal and spatial correlation between water quality factors was analyzed.

2.2. PSR Conversion

For chaotic time-series analysis, it is necessary to reconstruct the phase space formed by these sequence changes to explore the temporal dynamics of the system [10]. PSR can transform a one-dimensional time series to high-dimensional phase space. The reconstructed marine water quality time series could exhibit more nonlinear dynamic characteristics while preserving the continuity of the original series. PSR was considered as a prerequisite for CRPs in the task of the spatio-temporal correlation analysis of marine water quality factor data. Its purpose was to project the corresponding marine series into the high-dimensional phase space to obtain chaotic attractors. For the one-dimensional water quality time series w ( t ) = w 1 , w 2 , , w i with a length of i, the reconstructed n-dimensional phase space could be expressed as follows:
w = w 1 w 2 w N = w 1 + t w 1 + 2 t w 1 + ( n 1 ) t w 2 + t w 2 + 2 t w 2 + ( n 1 ) t w N + t w N + 2 t w N + ( n 1 ) t
where N = i + ( n 1 )t, t represents the delay time, and n represents the embedding dimension. In our paper, the mutual information method was considered to select the appropriate delay time, and the false nearest neighbors was used to select the suitable embedding dimension [11,12]. Note that unified reconstruction parameters should be selected so that two marine series can be reconstructed into the same phase space. A higher embedding dimension n and smaller delay time t were chosen in our scheme.

2.3. CRP Visualization

PSR is the first step in analyzing time series using a CRP [13]. After PSR, a CRP was used to detect the dynamic information of two marine water quality time series in the same phase space, and the data were visualized by two-dimensional graphics. The CRP was determined by a cross-recurrence matrix. For two reconstruction variables a and b , the matrix is defined as follows:
C R i , j a i · b j ( ε ) = Θ ε a i b j , i = 1 N , j = 1 M
Θ ( e ) = 1 , e 0 0 , e < 0
where N and M denote the length of a i and b j , respectively; ε is the threshold that is 0.6% of the maximum phase space diameter [11]; and Θ represents the Heaviside function.
C R i , j is a two-dimensional matrix containing 1 and 0. When the distance between two sequences in the same phase space is less than ε , C R i , j is 1 and represented as a black point in the CRP; otherwise, C R i , j is 0 and represented as a white point in the CRP. Matching black and white dots with 1 and 0 were used to visualize similar behavior between two time series.

2.4. CRQA Quantification

In the analysis of marine time-series data, we used CRQA to quantify the frequency of similar changes between the time series of two water quality factors and then confirmed the dynamic spatial correlation of the time series of different water quality factors. CRQA indexes include mean diagonal line length (MDL), determinism (DET), laminarity (LAM), and recurrence rate (RR) [14,15,16,17,18].
RR, a metric for the density of recurrence points, exposes the probability that the time series of water quality factors remain similar under a certain delay. A high RR value indicates a high probability of similar states between both time series. It is determined by
R R ( ε ) = 1 N 2 i , j = 1 N C R i , j ( ε )
DET represents the ratio of diagonal structures. Usually, a high DET implies a deterministic process, while a low DET suggests a random process. In our case, it could evaluate the regularity and predictability of the interaction between two marine time series, expressed by
D E T = l = l min N l P ( l ) l = 1 N l P ( l )
where l min is set to 2 [19], and P ( l ) denotes the histogram of diagonals, defined as
P ( l ) = i , j = 1 N 1 C R i 1 , j 1 1 C R i + l , j + l × k = 0 l 1 C R i + k , j + k
LAM is the percentage of recurrence points comprising vertical structures. It hints at the steady states between two marine water quality time series, calculated as follows:
L A M = v = v min N v P ( v ) v = 1 N v P ( v )
where v min is set to 2, and P ( v ) denotes the histogram of verticals, given by
P ( v ) = i , j = 1 N 1 C R i 1 , j 1 1 C R i , j + v × k = 0 v 1 C R i , j + k
MDL refers to the mean length of diagonal lines, and its value is the average time for which the time series of the marine water quality factors in the two groups are similar, given by
M D L = l = l min N l P ( l ) l = l min N P ( l )

2.5. Mutual Information Entropy

Information entropy is a measure of the uncertainty of random variable X in statistics. The higher the uncertainty of X, the greater the entropy. For a discrete random variable X, its probability distribution is consistent with p ( x ) = P ( X = x ) , where x X . Information entropy is defined as follows:
H ( X ) = x X p ( x ) log p ( x )
For two discrete random variables X and Y, the joint distribution probability is p ( X , Y ) , and the marginal distribution probability is p ( X ) , p ( Y ) , where x X and y Y . Then, according to the definition of information entropy, the joint distribution entropy is
H ( X , Y ) = x X y Y p ( x , y ) log p ( x , y )
Mutual information entropy is used to measure the degree to which one random variable X reduces the uncertainty of another random variable Y. The mutual information entropy between X and Y can be defined as
I ( X ; Y ) = H ( X ) + H ( Y ) H ( X , Y )

3. Results

In this section, we present a thorough experimental evaluation of marine water quality within the framework of CRPs and CRQA. Firstly, we introduce the sources of the marine water quality data used in this study, followed by the determination of the PSR parameters. Secondly, a CRP was used to visualize the spatio-temporal characteristics of the marine water quality parameters at the three stations. Finally, some key CRQA indicators and mutual information entropy were used to quantify the degree of the temporal and spatial correlation of these parameters. We also explain the importance of the above research work for future marine water quality prediction tasks.

3.1. Datasets

The data used in this experiment came from a region in the Bohai Sea of China, and the data were collected by land-based stations and ocean buoys. A total of 4320 samples were collected from 22 July to 20 October 2021, with an interval of 30 min. After consulting the data, eight factors affecting the marine water quality were selected, including dissolved oxygen (Do), chlorophyll a (Chl), turbidity (Turb), blue-green algae (Bga), total dissolved solids (Tds), dissolved oxygen saturation (DoP), water temperature (Temp), and pH [20,21,22,23,24,25]. In order to explore the spatio-temporal characteristics of the marine water quality factors, the water quality factors were gathered from three adjacent stations. The locations and distances are shown in Figure 2. Specially, due to the large number of parameters calculated, the following example uses pH as the key indicator of marine water quality in order to illustrate the experimental process.

3.2. Parameter Determination

Before CRP was used for the dynamic correlation analysis of the marine water water quality data, one-dimensional time-series data of water quality had to be mapped to high-dimensional space, which was achieved through PSR. The delay time and embedding dimension had to be calculated. Taking pH as an example, the dynamic correlation between other water quality factors and the pH value was analyzed. Table 1 and Table 2, respectively, show the embedding dimension and delay time of each water quality factor in the three stations. Table 3 shows the optimal matching of PSR parameters when pH and other water quality time series were reconstructed into the same high-dimensional space. A larger embedding dimension and a smaller delay time were generally considered the best matches.

3.3. Spatio-Temporal Visualization and Results

A CRP contains many classical structures to analyze the correlation of series, including recurrence points, diagonal lines, and vertical/horizontal diagonal distribution. Figure 3, Figure 4 and Figure 5 show the correlation between the pH at ST1 and the other water quality factors at ST1, ST2, and ST3. Taking pH as an example, we qualitatively determined the correlation between the water quality time-series data of the three stations and the pH of the central station, as shown in the figures below.
Figure 3 shows the correlation analysis between the pH and the other factors of the central station (ST1). It can be seen from the black recurrence points, rectangular structures, vertical horizontal lines, and other CRP structures in Figure 3 that all factors had a certain degree of correlation with the pH. Specifically, there are significantly more CRP structures in Figure 3d and Figure 3e,g than in Figure 3b,c. This indicates that the similarity of the recurrence behavior between Bga, Tds, and Temp and pH was stronger than that for Chl and Turb in ST1. The latter had a weak correlation with pH.
Figure 4 and Figure 5 show the analysis results of the temporal and spatial correlation between the water quality factors at ST2 and ST3 and the pH at the central station (ST1), respectively. It can be clearly seen that the factors in Figure 4 and Figure 5 also had similar characteristics to those in Figure 3. Specifically, there are a large number of rectangular structures in Figure 4e and Figure 5e, and there are many horizontal line segments in Figure 4g and Figure 5g. Nevertheless, in Figure 4b and Figure 5b, there are few black recurrence points and large areas of blank space. This reflects a more synchronized state between the Tds and Temp at ST2 and ST3 and the pH at the central station. The pH and Chl at ST1 did not present obvious behavioral similarities. The above visualization results were consistent with those shown in Figure 3. By analyzing the recurrence behavior of factors among multiple stations, the temporal and spatial dynamic correlation characteristics of the marine water quality factors were verified.

3.4. CRQA Analysis

As a quantitative analysis method of CRPs, CRQA includes several indicators to measure the influence degree of factors including RR, DET, MDL, LAM. In this study, we analyzed the influence degree of water quality factors through these four indicators and quantitatively evaluated the correlation of water quality factors by integrating the four indicators. Table 4 shows the quantification results of the four CRQA indexes for the correlation between seven water quality factors at three stations and pH at the central station.
RR indicates the probability that two water quality sequences had similar states. In the central station ST1, the RR value of Tds was 0.0727, which was significantly higher than that of the other water quality factors, indicating more similar recurrence states between the Tds and pH series. The RR value of Chl was only 0.0114, indicating a very weak correlation between the CHL and pH sequences. We also reached the same conclusion for the ST2 and ST3 stations, which indicated that the Tds at the ST2, ST3, and ST1 stations had a highly similar impact on the pH of the ST1 station. Similarly, Chl had the lowest RR values among the three stations, indicating that there was no obvious correlation between Chl and pH when analyzing the impact of a single station or multiple stations.
As an important measure of the similarity of two time series, MDL reveals the average duration of similar states in the phase space of two time series. The values of Tds were 6.5991, 8.8260, and 7.0533, and the values of Temp were 9.0706, 7.3402, and 8.5231. These were still higher than the values for other water quality factors. The Chl MDL values were 3.4141, 3.6056, and 3.5846, representing the worst performance.
DET characterizes the determinacy of two sequences by calculating the ratio of the diagonal lines in the CRP. When two deterministic processes have similar phase space states, the corresponding DET value is larger. At the three stations, the values of Tds were 0.9355, 0.9513, and 0.9387, and the values of Temp were 0.9562, 0.9582, and 0.9531, respectively. The DET values of the two were significantly higher than those of the other factors, indicating that their states were more similar to the phase space of the pH sequence, while the findings for Chl were the opposite.
LAM represents the synchrony between sequences by quantifying the vertical/horizontal structures. From Table 4, it can be seen that the Tds and Temp values were the largest, while the Chl value was the smallest. This implied that Tds and Temp were more in sync with the pH sequences, and Chl was almost out of sync with pH.

3.5. Quantitative Analysis

In Section 3.4, we found that under the four indexes, the water quality factors did not have exactly the same degree of influence on pH. For example, at ST2, the value of Turb was higher than that of Bga considering the influence of MDL, but the opposite was true at ST3. For LAM, the Do and DoP values also presented the same performance across stations. A single index cannot measure the effect of several water quality factors on another factor. Therefore, we added the value of mutual information entropy, normalized it with the four CRQA indexes, calculated the geometric mean value, and comprehensively measured the behavioral similarity among the water quality factors. By introducing mutual information entropy, the correlation between the data could be mined from the perspective of recurrence analysis and information theory, which made the scheme more convincing.
Figure 6 shows the comprehensive evaluation results of CRQA and mutual information entropy for the spatio-temporal correlation characteristics of the marine factors. The figure in the heat map is the combined value of the correlation between the two factors, ranging from 0 to 1. The larger the value, the stronger the temporal and spatial correlation between the two factors. At the same time, the colors reflect the degree of correlation along with the values. The darker the color, the stronger the relationship between the two factors. Taking Figure 6h as an example, considering their impact on the pH of the central station, the top four factors at ST1 were Temp, Tds, DoP, and Turb. At ST2, they were Temp, Tds, Do, DoP. At ST3, they were Temp, Tds, DoP, and Turb. In the case of interference by various factors, we could assume that the high-correlation factors of the three stations were basically the same. In the pH prediction task, we could select the water quality factors of the target station and the high-association factor data of the adjacent stations as the input. High-correlation factors for other water quality factors can also be found in Figure 6. In marine water quality multi-task prediction, one can conduct a comprehensive evaluation of water quality by predicting various factors affecting water quality at the same time, so that the prediction results are more convincing. In addition, the CRQA comprehensive measurement values corresponding to the marine water quality factors in Figure 6 are presented in Table A1.
Furthermore, in order to verify the scheme for multiple data lengths, we chose 2000 and 3000 pieces of data to draw a comparison with the analysis results of the total 4320 pieces of data. Figure 7 shows the correlation between the pH sequences for the three data lengths and other factors. As can be seen from the values and color shades in this figure, in the experiments with data lengths of 2000 and 3000, the correlation between the pH sequence and Tds and Temp was still large, while that for Chl was small. This was almost consistent with the results for 4320 pieces of data.

4. Conclusions

In this paper, we developed a spatio-temporal analysis model of marine water quality data based on CRPs and CRQA. Furthermore, mutual information entropy was introduced as one of the evaluation indexes. In contrast to single-station analysis, this model could analyze the spatio-temporal dynamic characteristics of water quality factors at multiple stations. Accordingly, we could obtain better prior data as the input for subsequent marine water quality forecasting tasks. Through spatio-temporal analysis, it could be concluded that certain variation factors of the target station were largely affected by other factors at adjacent stations. In future marine water quality multi-tasking prediction work, we will give full consideration to the effects of other stations and consider comprehensive data changes caused by many factors, so as to improve the accuracy of prediction and provide a more novel research scheme for marine prediction work.

Author Contributions

Conceptualization, Z.L. and X.S.; methodology, Y.L.; software, T.S.; validation, T.S. and Y.W.; formal analysis, Z.L.; investigation, Z.L. and Y.L.; data curation, Z.L.; writing—original draft preparation, T.S. and X.S.; writing—review and editing, T.S. and X.S.; visualization, Y.W.; funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science and Technology Project of Hebei Education Department, grant ZD2021088; the open fund project of the Marine Ecological Restoration and Smart Ocean Engineering Research Center of Hebei Province, grant HBMESO2315; and the project Intelligent Early Warning Prediction and Demonstration Application of Pollutants from Land to Sea, grant 21373302D.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The datasets for this study are available upon request from the corresponding author.

Acknowledgments

We thank the editorial board and all reviewers for their professional advice to improve this work.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. CRQA indicators and mutual information entropy comprehensive measurement results of the associations among 8 factors from 3 stations, in contrast to the central station factors.
Table A1. CRQA indicators and mutual information entropy comprehensive measurement results of the associations among 8 factors from 3 stations, in contrast to the central station factors.
StationContrast
(from ST1)
DoChlTurbBgaTdsDoPTemppH
ST1pH0.49260.15560.43480.43960.65240.63760.95460.1688
Temp0.32730.40050.66040.44580.73310.47770.95100.3334
DoP0.33800.50840.70460.56430.34530.47310.74250.1678
Tds0.35550.26800.49420.38340.47030.44080.99150.4128
Bga0.31330.57310.63290.40960.34610.54790.59890.1470
Turb0.35810.50410.65880.50000.41460.65230.79750.1536
Chl0.32560.35360.63430.68550.34350.59210.63740.1269
Do0.29960.35340.56570.55800.54290.69270.96840.4373
ST2Ph0.70230.14660.30880.30710.85220.68800.96610.3852
Temp0.63690.28650.46420.31070.80510.46560.91200.6564
DoP0.61940.40810.36520.32050.48240.32960.77440.3503
Tds0.55470.21500.30640.26590.48490.53180.97450.5902
Bga0.46670.33780.42730.19760.40320.36960.58580.3931
Turb0.61890.47730.30260.42560.58380.51060.81200.4762
Chl0.39960.22430.50990.46960.44050.37680.62960.3444
Do0.47990.14860.39010.28450.78520.65600.99620.7455
ST3Ph0.43730.22810.39790.41120.69050.56280.92250.2228
Temp0.64000.50670.25180.25040.70990.44390.79830.3234
DoP0.74240.59930.51310.26270.30430.26900.35720.2721
Tds0.50740.43520.20220.10110.38370.41410.89270.3107
Bga0.61310.49740.48450.27320.16230.34690.38400.2728
Turb0.72330.71860.51310.42020.25690.39590.65050.2956
Chl0.61790.37110.56480.39520.28000.35090.31550.2369
Do0.45110.53020.19360.16980.45030.58440.81390.3917

References

  1. Ajayi, O.O.; Bagula, A.B.; Maluleke, H.C.; Gaffoor, Z.; Jovanovic, N.; Pietersen, K.C. Waternet: A network for monitoring and assessing water quality for drinking and irrigation purposes. IEEE Access 2022, 10, 48318–48337. [Google Scholar] [CrossRef]
  2. Jiang, Y.; Zhao, M.; Hu, C. A parallel FP-growth algorithm on World Ocean Atlas data with multi-core CPU. J. Supercomput. 2019, 75, 732–745. [Google Scholar] [CrossRef]
  3. Erni-Cassola, G.; Zadjelovic, V.; Gibson, M.I.; Christie-Oleza, J.A. Distribution of plastic polymer types in the marine environment; A meta-analysis. J. Hazard. Mater. 2019, 369, 691–698. [Google Scholar] [CrossRef] [PubMed]
  4. Lemenkova, P. Computing and plotting correlograms by Python and R libraries for correlation analysis of the environmental data in marine geomorphology. Jeomorfol. Araşt. Derg. 2019, 3, 1–16. [Google Scholar]
  5. Deng, W.; Wang, G. A novel water quality data analysis framework based on time-series data mining. J. Environ. Manag. 2017, 196, 365–375. [Google Scholar] [CrossRef]
  6. Kanakambaran, S.; Sarathi, R.; Srinivasan, B. Identification and localization of partial discharge in transformer insulation adopting cross recurrence plot analysis of acoustic signals detected using fiber bragg gratings. IEEE Trans. Dielectr. Electr. Insul. 2017, 24, 1773–1780. [Google Scholar] [CrossRef]
  7. González, G.H.; Infante, O.; Martínez-García, P.; Pérez-Grovas, H.; Saavedra, N.; Caviedes, A.; Becerra, B.; Lerma, C. Dynamical interaction between heart rate and blood pressure of end-stage renal disease patients evaluated by cross recurrence plot diagonal analysis. J. Appl. Physiol. 2020, 128, 189–196. [Google Scholar] [CrossRef]
  8. Liu, W.; Wang, D.; Chen, X.; Wang, C.; Liu, H. Recurrence plot-based dynamic analysis on electrochemical noise of the evolutive corrosion process. Corros. Sci. 2017, 124, 93–102. [Google Scholar] [CrossRef]
  9. Li, Z.; Cai, D.; Wang, J. Machine learning based dynamic correlation on marine environmental data using cross-recurrence strategy. IEEE Access 2019, 7, 185121–185130. [Google Scholar] [CrossRef]
  10. Shuai, H.; Wang, S.; Liu, Q. Phase space reconstruction driven spatio-temporal feature learning for dynamic facial expression recognition. IEEE Trans. Affect. Comput. 2020, 13, 1466–1476. [Google Scholar]
  11. Li, Y.; Cai, D.; Wang, J.; Sun, X.; Li, Z.; Zhang, H.; Wang, N. Recurrence behavior statistics of blast furnace gas sensor data in industrial internet of things. IEEE Internet Things J. 2020, 7, 5666–5676. [Google Scholar] [CrossRef]
  12. Xu, Z.; Zhong, L.; Zhang, A. Phase space reconstruction-based conceptor network for time series prediction. IEEE Access 2019, 7, 163172–163179. [Google Scholar] [CrossRef]
  13. Fan, G.F.; Peng, L.L.; Hong, W.C. Short term load forecasting based on phase space reconstruction algorithm and bi-square kernel regression model. Appl. Energy 2018, 224, 13–33. [Google Scholar] [CrossRef]
  14. Coco, M.I.; Mønster, D.; Leonardi, G. Unidimensional and multidimensional methods for recurrence quantification analysis with CRQA. arXiv, 2020; arXiv:2006.01954. [Google Scholar]
  15. Yan, B.; Chan, P.W.; Li, Q. Dynamic analysis of meteorological time series in Hong Kong: A nonlinear perspective. Int. J. Climatol. 2021, 41, 4920–4932. [Google Scholar] [CrossRef]
  16. Mukherjee, S.; Ray, R.; Samanta, R. Nonlinearity and chaos in wireless network traffic. Chaos Solitons Fractals 2017, 96, 23–29. [Google Scholar] [CrossRef]
  17. Amiri, A.; Samet, H.; Ghanbari, T. Recurrence plots based method for detecting series arc faults in photovoltaic systems. IEEE Trans. Ind. Electron. 2021, 69, 6308–6315. [Google Scholar] [CrossRef]
  18. Yang, D.; Ren, W.X.; Hu, Y.D. Selection of optimal threshold to construct recurrence plot for structural operational vibration measurements. J. Sound Vib. 2015, 349, 361–374. [Google Scholar] [CrossRef]
  19. Martín-González, S.; Navarro-Mesa, J.L.; Juliá-Serdá, G.; Ramírez-Ávila, G.M.; Ravelo-García, A.G. Improving the understanding of sleep apnea characterization using recurrence quantification analysis by defining overall acceptable values for the dimensionality of the system, the delay, and the distance threshold. PLoS ONE 2018, 13, e0194462. [Google Scholar] [CrossRef]
  20. Ashikur, M.; Rupom, R.; Sazzad, M. A remote sensing approach to ascertain spatial and temporal variations of seawater quality parameters in the coastal area of bay of Bengal, Bangladesh. Remote. Sens. Appl. Soc. Environ. 2021, 23, 100593. [Google Scholar] [CrossRef]
  21. Deswati, D.; Safni, S.; Khairiyah, K.; Yani, E.; Yusuf, Y.; Pardi, H. Biofloc technology: Water quality (ph, temperature, do, cod, bod) in a flood & drain aquaponic system. Int. J. Environ. Anal. Chem. 2022, 102, 6835–6844. [Google Scholar]
  22. Elkiran, G.; Nourani, V.; Abba, S. Multi-step ahead modelling of river water quality parameters using ensemble artificial intelligence-based approach. J. Hydrol. 2019, 577, 123962. [Google Scholar] [CrossRef]
  23. Najafzadeh, M.; Ghaemi, A.; Emamgholizadeh, S. Prediction of water quality parameters using evolutionary computing-based formulations. Int. J. Environ. Sci. Technol. 2019, 16, 6377–6396. [Google Scholar] [CrossRef]
  24. Xu, Z.; Shen, J.; Qu, Y.; Chen, H.; Zhou, X.; Hong, H.; Sun, H.; Lin, H.; Deng, W.; Wu, F. Using simple and easy water quality parameters to predict trihalomethane occurrence in tap water. Chemosphere 2022, 286, 131586. [Google Scholar] [CrossRef] [PubMed]
  25. Susanti, N.D.; Sagita, D.; Apriyanto, I.F.; Anggara, C.E.W.; Darmajana, D.A.; Rahayuningtyas, A. Design and implementation of water quality monitoring system (temperature, ph, tds) in aquaculture using iot at low cost. In Proceedings of the 6th International Conference of Food, Agriculture, and Natural Resource (IC-FANRES 2021), Tangerang, Indonesia, 4–5 August 2021; pp. 7–11. [Google Scholar]
Figure 1. Marine water quality spatio−temporal analysis framework based on CRPs and CRQA.
Figure 1. Marine water quality spatio−temporal analysis framework based on CRPs and CRQA.
Entropy 25 00689 g001
Figure 2. Geographical location of the three marine monitoring stations.
Figure 2. Geographical location of the three marine monitoring stations.
Entropy 25 00689 g002
Figure 3. CRP visualization of correlation between marine factors and pH at ST1 station.
Figure 3. CRP visualization of correlation between marine factors and pH at ST1 station.
Entropy 25 00689 g003
Figure 4. CRP visualization of correlation between marine factors at ST2 station and pH at ST1 station.
Figure 4. CRP visualization of correlation between marine factors at ST2 station and pH at ST1 station.
Entropy 25 00689 g004
Figure 5. CRP visualization of correlation between marine factors at ST3 station and pH at ST1 station.
Figure 5. CRP visualization of correlation between marine factors at ST3 station and pH at ST1 station.
Entropy 25 00689 g005
Figure 6. CRQA indicators and mutual information entropy comprehensive measurement results of the associations among marine factors.
Figure 6. CRQA indicators and mutual information entropy comprehensive measurement results of the associations among marine factors.
Entropy 25 00689 g006
Figure 7. Comprehensive measurement results of correlation between pH sequences and other factors for different data lengths.
Figure 7. Comprehensive measurement results of correlation between pH sequences and other factors for different data lengths.
Entropy 25 00689 g007
Table 1. Determination of embedding dimension of water quality sequences at the three stations.
Table 1. Determination of embedding dimension of water quality sequences at the three stations.
StationDoChlTurbBgaTdsDoPTemppH
ST158656556
ST249887656
ST34715177565
Table 2. Determination of delay time of water quality sequences at the three stations.
Table 2. Determination of delay time of water quality sequences at the three stations.
StationDoChlTurbBgaTdsDoPTemppH
ST1182842221914149
ST22218222235151518
ST31315242420212618
Table 3. Taking pH as an example, the best-match results of embedding dimension/delay time.
Table 3. Taking pH as an example, the best-match results of embedding dimension/delay time.
StationDoChlTurbBgaTdsDoPTemppH
ST16/98/96/96/96/96/96/96/9
ST26/99/98/98/97/96/96/96/9
ST36/97/915/917/97/96/96/96/9
Table 4. CRQA quantification results of associations between seven factors from three stations and pH at the central station.
Table 4. CRQA quantification results of associations between seven factors from three stations and pH at the central station.
StationCRQADoChlTurbBgaTdsDoPTemppH
ST1MDL5.26373.41415.27534.70756.59915.67459.07065.5753
RR0.04110.01140.05700.03860.07270.04520.04520.0602
DET0.89940.77580.88590.86840.93550.90630.95620.8940
LAM0.92810.84020.90870.90970.94460.92340.95740.9384
ST2MDL7.65843.60564.15263.97688.82600.73767.34029.1455
RR0.08480.00930.03390.01940.10290.05900.10820.1635
DET0.93980.76000.81660.83120.95130.93330.95820.9745
LAM0.94430.85800.88960.88740.95380.94170.95890.9866
ST3MDL4.76793.58465.81096.14397.05336.65908.52315.2118
RR0.03300.02180.05660.03170.06980.04010.11200.0456
DET0.88260.81800.86170.90690.93870.92220.95310.9236
LAM0.89920.87010.92250.93100.95010.93310.95530.9591
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, Z.; Sun, T.; Wang, Y.; Liu, Y.; Sun, X. Spatio-Temporal Analysis of Marine Water Quality Data Based on Cross-Recurrence Plot (CRP) and Cross-Recurrence Quantitative Analysis (CRQA). Entropy 2023, 25, 689. https://doi.org/10.3390/e25040689

AMA Style

Li Z, Sun T, Wang Y, Liu Y, Sun X. Spatio-Temporal Analysis of Marine Water Quality Data Based on Cross-Recurrence Plot (CRP) and Cross-Recurrence Quantitative Analysis (CRQA). Entropy. 2023; 25(4):689. https://doi.org/10.3390/e25040689

Chicago/Turabian Style

Li, Zhigang, Ting Sun, Yu Wang, Yujie Liu, and Xiaochuan Sun. 2023. "Spatio-Temporal Analysis of Marine Water Quality Data Based on Cross-Recurrence Plot (CRP) and Cross-Recurrence Quantitative Analysis (CRQA)" Entropy 25, no. 4: 689. https://doi.org/10.3390/e25040689

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop