Next Article in Journal
Seaport Terminals Risks Prioritization Using a Structural Modeling-Based Approach: A Real Case Study
Next Article in Special Issue
AUV Formation Coordination Control Based on Transformed Topology under Time-Varying Delay and Communication Interruption
Previous Article in Journal
Supporting Predictive Maintenance of a Ship by Analysis of Onboard Measurements
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Track Pairs Collision Detection with Applications to Ship Collision Risk Assessment

1
Navigation College, Dalian Maritime University, Dalian 116026, China
2
Key Laboratory of Navigation Safety Guarantee of Liaoning Province, Dalian 116026, China
*
Author to whom correspondence should be addressed.
J. Mar. Sci. Eng. 2022, 10(2), 216; https://doi.org/10.3390/jmse10020216
Submission received: 7 January 2022 / Revised: 2 February 2022 / Accepted: 4 February 2022 / Published: 6 February 2022

Abstract

:
The port waterway network plays an important role in the organization and management of port ship traffic. Due to limited ship operations, conflicts, congestion, and safety issues often arise in port waters. Conflicts between ships can be predicted by collision detection between ships. A novel collision detection algorithm for trajectory pairs is proposed by introducing variable time interval variables. In addition, to improve the overall accuracy of trajectory compression and reduce redundant calculation in collision detection, a multi-factor Douglas-Peucker algorithm adapted to ship trajectory compression is proposed with the consideration of speed and turn constraints. The maximum speed difference of the algorithm is increased by 1.5–2.5%, and the average speed difference increased by 2.0–4.5%. Based on the method mentioned above, the risk assessment framework of maritime collision is established and the risk situation of the waters near Ningbo Zhoushan Port is evaluated and analyzed by using ship historical track data.

1. Introduction

As the last nautical mile of ship transportation, the safety of navigation for vessels in the port waters directly affects the trade exchanges between countries [1]. To strengthen the safety of navigation in port waters, the International Maritime Organization (IMO) has established a traffic separation system and a ship reporting system. At the same time, countries have established a Vessel Traffic System (VTS) supervision system based on the characteristics of the water area. Figure 1 shows the management area of VTS near Ningbo-Zhoushan port and Yang-shan port, where there are several functional areas such as anchorage, fairway, alert area, and pilot embarking or disembarking area (point), et al. Ningbo-Zhoushan port and Yang-shan port both have become one of the biggest port all over the world, where the vessel traffic flow is very complex constrained by narrow waterways, islands, bridges, et al. Serval managerial and navigational solutions such as the Traffic Separation Scheme (TSS) and VTS have been implemented in this area, however, the actual effectiveness of these solutions in generating navigational safety and the focus of subsequent navigational regulation still deserve further study.
Current ship collision risk evaluation theories and model studies can be classified according to macro and micro [2]. Macroscopic collision evaluation methods are mostly based on collision data, traffic flow statistics, and relevant hydro-meteorological data using statistical analysis to make a comprehensive evaluation of navigational safety in a certain range of waters. This study focuses on the evaluation of macroscopic collision in this water.
It can be subdivided into two categories, including the evaluation methods based on the number of collisions and collision rate and the evaluation methods based on the number of encounters and encounter rate. In [3], the authors had assessed the collision risk of the narrow waterway in Ningbo-Zhoushan with collision detection method and ship domain. This study first aligns and discretizes the trajectories, and then achieves collision detection of trajectory pairs by detecting collisions between isolated trajectory points. It is easy to find that the efficiency will be low and the process is complicated.
Regarding the study of conflict prediction methods within the channel, the authors of literature [4,5] classified conflicts into two categories, connecting segments and intersections, by introducing the concept of network. Literature [6] provides a literature review on the assessment of individual collision risk in ship navigation. Chai [7] proposed a quantitative risk assessment model with the consideration of human life loss and environmental pollution. Weng [8] made an estimation for the collision frequency of vessels in the Singapore Strait with the real-time vessel movement data. Goerlandt [9] developed a collision alert framework from the perspective of the concept of risk and the intended use of the model. Furthermore, the authors also introduced a method for measuring the ship collision risk, which was successfully applied to a case study. Debnath [10] made a study on the influential factors of collision risks and analyzed the associations between risks and geometric, traffic, and the regulation for controlling traffic in Singapore port fairways. Based on the ship domain safety areas derived by the VTS operator, Yoo [11] proposed a real-time collision risk assessment support system, whose validity was verified in the scenarios at Busan port.
Since macro collision risk evaluation is mostly obtained by statistical analysis based on historical big data, and the huge amount of data will bring some trouble to the research. In terms of compression of trajectory big data, the famous Douglas Peucker algorithm was proposed in [12]. Considering the importance of the velocity of a moving object in map-matching [13], mobility prediction, and moving pattern mining [14], Ying, et al. [15] proposed an algorithm for preserving velocity when simplifying trajectory.
The key of ship collision risk assessment is to calculate the number of conflicts between ships. At present, in the field of ship-based conflict detection research, most of the continuous collision detection is carried out through the discrete collision detection method. Firstly, the continuous state of the trajectory pair is discretized, and then approximately continuous collision detection is performed by judging whether there is a collision between the finite states of the trajectory pair. Although its understanding and implementation are simple, its efficiency is very low. The detail for real-time collision detection can be found in the literature [16]. Tang [17] presented a novel continuous collision detection algorithm by introducing deforming non-penetration filters.
The purpose of this work is to evaluate the collision risk for vessels in specific water. The main contribution is divided into two parts. Firstly, the previous research work is improved and a new trajectory simplification method is designed. Secondly, a collision detection method between trajectories is proposed, which can effectively improve the efficiency of collision detection and reduce the difficulty of algorithm implementation. The framework of ship collision risk assessment is shown in Figure 2.

2. Ship Collision Risk Assessment

2.1. AIS Compression Based on the Improved Douglas-Peucker Algorithm

Ship trajectory data is accumulated continuously through AIS equipment, and its broadcast interval can be as short as 5 s, so it will consume a lot of computing resources when analyzing ship collisions through massive AIS data. Especially for ships sailing in the waterway network, their navigation is limited by the direction of the waterway. According to statistics, in a ship trajectory, the key points to describe navigation behavior only account for about 8%. Therefore, we firstly compress the big AIS data. Therefore, our primary work is to compress AIS data.
Regarding the compression for AIS data, some works have been made in our previous work [18]. Considering that the historical trajectory of the ship contains information such as speed, position, heading, etc., this section improves the DP (Douglas-Peucker) algorithm by considering the influence of various factors on the compression algorithm. This section improves the DP algorithm considering the influence of ship speed based on Euclidean distance, realizes the compression and preprocessing of ship AIS data, reduces the number of AIS data, and improves the efficiency of ship collision detection.

2.1.1. Trajectory Preprocessing

We use the trajectory AIS data of Ningbo-Zhoushan, China in 2018, due to the original data is redundant, besides, the same ship may have many voyages into the same port, therefore, the AIS data should be preprocessed before grouping by voyage. The preprocessing steps are as follows (Algorithm 1).
Algorithm 1. Preprocessing AIS data
Require: AIS database aisdata.db, threshold_time = 30 min
1:  Select distinct(MMSI) from aisdata.db squry_mmsi
2:  For each sub_mmsi in squry_mmsi
3:     Select track from aisdata.db where MMSI = sub_mmsi
4:     If the number of track is less than 10 then continue
5:     Else For each point in track
6:         If the time difference time_diff between the next timestamp and the present timestamp is bigger than threshold_time then insert voyage into return_voyages, voyage set NULL
7:         Else insert point into voyage
8:         End If
9:         Insert voyage into return_voyages, voyage set NULL
10:       End for
11:     End If
12:  End for
13:  Return return_voyages
Due to the coordinate of trajectory are stored by longitude and latitude. Amis at the satisfaction of accuracy and efficiency of trajectory compression and clustering, there need to be transferred from geodetic coordinate system to Mercator. ( φ , λ ) refers to the geographical coordinates and Mercator coordinates is ( x , y ) , a refers to the long radius of the ellipse of the earth and e refers to the first eccentricity of earth ellipsoid that is the meridian parts at latitude, r 0 refers to the circle radius of base dimensions latitude. The formula for transferring coordinate from geographical to Mercator as Equation.
{ r 0 = ( a × cos φ ) 1 e 2 sin 2 φ q = ln tan ( π 4 + φ 2 ) e 2 * ln ( ( 1 + e sin φ ) ( 1 e sin φ ) ) x = r 0 × λ y = r 0 × q

2.1.2. Multi-Factor Douglas-Peucker Algorithm

A trajectory is approximated by discrete samples of the AIS coordinate point. It can be shown a series of chronologically coordinate points, e.g., T = { p 1 , p 2 , , p n } , where each point consists of an ID, a coordinate set, and timestamp such as P = { M M S I , L o n g i t u d e , A t i t u d e , C o u r s e , S p e e d , T i m e } . DP algorithm [12] as one of the most accurate line simplification algorithms is proposed in 1973. The core concept of the DP compression method is to reduce the original trajectory point with compression threshold the perpendicular Euclidean distance of each remaining point. Before simplifying the trajectory data from the berthing data, there need to certify the compression threshold τ . DP algorithm recursively divides the curve composed of line segments. Initially, it is given all the points between the first and the last point. Firstly, the first and last points are kept, and then find the furthest from the line segment with the first and last. If the point is closer than τ to the line segment, then all points can be discarded, otherwise, that point must be kept. The algorithm recursively calls itself with the first and the furthest and then with the furthest and the last. The Ramer-Douglas-Peucker (RDP) algorithm, also known as the Douglas-Peucker algorithm and iterative endpoint fit algorithm, so in the following, RDP algorithm refers to DP algorithm.
Based on the DP algorithm and combined with speed and turning limitation, Multi-Factor Douglas-Peucker (MFDP) algorithm (Algorithm 2) is proposed. Before introducing the MFDP algorithm, we first describe the formal definitions for illustrating the parameters:
Definition 1.
The perpendicular Euclidean distance d ( P i ) of each remaining point P i is the perpendicular Euclidean distance  d ( P i , L j ) of a point P i to line L. It can be calculated by the following Equation (2).
d ( P i ) = d ( P i , L j )
Definition 2.
The azimuth C i , j of each remaining point P i is the difference related to P j . The azimuth difference d θ ( P i ) is the difference between the adjacent points. It is an important item to a unified coordinate system. Based on transformation by Equation(3) between geodetic coordinate system and Mercator, the formula of d θ ( P i ) is shown as Equation (4). In Equation (4),  M P refers to meridian parts at latitude φ .
{ M P = 7915.70447 lg [ tan ( π 4 + φ 2 ) e 2 ] C i , j = arctan ( λ j λ i M P ( φ j ) M P ( φ i ) )
d θ ( P i ) = | C i + 1 , i C i , i 1 |
Definition 3.
The speed difference d s ( P i ) of each remaining point P i is defined as follows. V i 1 , V i , V i + 1 refer to the speed of P i 1 , P i , P i + 1 respectively. t i 1 , t i , t i + 1 refer to the time of P i 1 , P i , P i + 1 , respectively.
{ f i ( θ ) = ( ( arctan ( V i + 1 V i t i + 1 t i ) ) ( arctan ( V i V i 1 t i t i 1 ) ) ) * 180 / π d s ( P i ) = { f i ( θ ) + 360 360 < f i ( θ ) < 180 | f i ( θ ) | 180 < f i ( θ ) < 180 360 f i ( θ ) 180 < f i ( θ ) < 360
Definition 4.
MFDP algorithm aims at considerable and universal, so the above distance is part of structure distance d i s t i ( P i ) , which can be calculated by the following Equation (6), w d , w θ , and w s refer to the weight of the perpendicular Euclidean distance, the angle distance, and the speed distance, respectively.
d i s t i ( P i ) = w d d ( P i ) + w θ d θ ( P i ) + w s d s ( P i )
Due to the existence of weight of d i s t i ( P i ) for simplification, therefore, the weight should be fixed when using. Under the certain necessities, the weight can be fixed differently as we need.
Algorithm 2. MFDP algorithm
Input: Track points list D = [ P i , , P j ] , threshold ε = [ ε d , ε θ , ε s ] , weight w = [ w d , w θ , w s ]
Output:
1:  Function MFDP (D, i, j, e)
2:  Find the point with maximum distance related to the baseline as d
3:  If d > dmax, split the track point list into two parts D = [ P i , , P d ] , D = [ P d , , P j ] , recursive call
4:    MFDP (D, i, d, e)
5:    MFDP (D, d, j, e)
6:  Else output P i , P j
7:  End if

2.1.3. Error Analysis

The main measurements for the experimental evaluations are as follows, accuracy by turning angle S E D C and speed S E D V . The way of measuring the speed and turning angle difference between AIS trace and its compressed version is to measure the Synchronized Euclidean Distance (SED). SED measures the distance for any point in the track, this is the distance between its actual location and its synchronous position estimated via interpolation between its predecessor and successor points in the compressed track shown in Figure 3.
The Synchronous Euclidean Distance error of turning angle S E D C i and speed S E D V i for synchronization points are defined as follows.
S E D V i = ( v t i v t i ) 2 + ( t i t i ) 2 ( i = 1 , 2 , , n )
S E D C i = ( c t i c t i ) 2 + ( t i t i ) 2 ( i = 1 , 2 , , n )
The algorithm for generating synchronization points is presented in Algorithm 3.
Algorithm 3. Generating synchronization points
Subtrack D = [ P i , , P j ] is a points list of each trajectory in tracks; marklabel is the filtered point label list; id is each one filtered point label in Subtrack; Outtrack is the generated synchronization trajectory list; L is the number of points in Outtrack.
1:  For each submark in tracks do
2:   If the number of submark is less than 3 then continue
3:    Else
4:     For each id in marklabel do
5:      while L < id do
6:       L++
7:       Insert P L into Outtrack
8:      Calculate the coordinate of synchronization point P i d under interpolation principle
9:      Insert P i d into Outtrack
10:    End for
11:   End If
12:  End for
13:  Return Outtrack

2.2. Collision Risk Assessment

At present, the research methods for ship collision accidents in a specific research water area within a specific period mostly refer to the results in the road and aviation fields. In the aviation field, a collision event is defined as the number of collisions within a specific period, which is also referred to as the frequency of collisions f c o l l i s i o n . It is equal to the product of the number of collision incidents in a given time N c o n f l i c t and the accident-causing coefficient P c a u s a t i o n .
f c o l l i s i o n = N c o n f l i c t × P c a u s a t i o n
Note that the conflicts between ships can be divided into three categories based on the International Regulations for Preventing Collisions at Sea (COLREGS) [7], such as head-on, crossing, and overtaking conflict. All three types of conflict are distinguished through the course differences between ship pairs.
Overtaking conflict means that ship pairs are sailing on almost parallel courses and the course difference should not exceed 10°. Crossing conflict means that the course difference of ship pair falls in the range 10–170° or 190–350°. Head-on conflict means that the course difference of ship pair falls in the range 170–190° [7]. Furthermore, the necessary condition for the occurrence of the above three types of conflicts is that one ship will intrude into the safety domain of the other when both ships sail with their current course and speed.
Considering the three types of conflicts mentioned above, the collision risk can be assessed by the sum of the number of collisions that all types will encounter.
f c o l l i s i o n = N c o n f l i c t h × P c a u s a t i o n h + N c o n f l i c t c × P c a u s a t i o n c + N c o n f l i c t o × P c a u s a t i o n o
where N c o n f l i c t h , N c o n f l i c t c , and N c o n f l i c t o are respectively the number of conflicts occurring in each scenario of head-on, crossing, and overtaking between ship pairs in a specific area within a certain period, P c a u s a t i o n h , P c a u s a t i o n c , and P c a u s a t i o n o are respectively the causation probability for the head-on, crossing, and overtaking conflicts.
As mentioned, if we want to evaluate the number of collisions, we need firstly calculate the number of conflicts. The basis for whether there is a collision risk between two ships is whether there is an intrusion in the safety domain of the two ships under relative motion. Regarding the research on instantaneous collision detection, as mentioned in the first section, the most widely used currently is to evaluate the collision risk through the nearest encounter point or the ship domain.

2.2.1. Spatial-Temporal Alignment of Trajectory

In the past, collision detection was performed by judging the relative position between ship fields. Since the trajectory of a ship is composed of multiple position points, which can be regarded as a discrete object, therefore the collision detection between the trajectories needs to frequently detect the state of the two ships. To avoid unnecessary detection, it is necessary to filter out the state where there is no possibility of a collision at all. Traditional methods include hierarchical bounding boxes, space segmentation, and GJK, but most of the above methods are collision screening of isolated objects, and there is still a problem of low efficiency in trajectory collision detection. In this study, the spatiotemporal collision detection between trajectory pairs is simplified by time intervals.
Different from the previous study in [3,7,8], regarding the calculation for the number of vessel conflicts, we determined a self-adaptive time interval from the trajectory pairs as shown in Figure 4. In the previous study mentioned above, the trajectory was discrete by a fixed time interval, such as the value was set as three minutes in [7,8], and 10 s was taken in [3]. However, since the launch of an AIS device follows the principle of time division multiple access, the timestamps of each track are not consistent. Therefore, the collision detection based on fixed time intervals would need to interpolate the track to achieve the purpose of timestamp alignment. Furthermore, there is no guarantee that there is no waypoint when further collision detection is performed on the trajectory pair of each time interval through relative motion.
The steps for determining the time interval for collision detection of trajectory pairs are as follows:
(1)
Determine the time intersection interval of the trajectory pair, if it exists, continue, otherwise exit.
(2)
Find the larger value of the trajectory to the start time.
(3)
Compare the magnitude between the two trajectories that are larger than the smallest value.
(4)
Use the smaller value in (2) and (3) as the collision detection time interval.
(5)
Set the smaller value in (3) as the larger value in (2), and repeat steps (2)–(5) until the end.
For example, in Figure 4, there are two trajectories labeled with a number from 1 to 5 and 1 to 6, respectively. Each trajectory has a series of waypoints and the arrival time at each waypoint was expressed as t i , m , where m is the index of a waypoint. According to the mentioned process, due to the arrival time of vessel i is less than t j , 1 at waypoint 1, therefore, the left value is equal to t j , 1 . Then, we can derive the right value t i , 2 by comparing the next adjacent arrival time t i , 2 and t j , 2 . Finally, the time interval [ t j , 1 , t i , 2 ] is obtained, subsequently, with the repetition of processes, the time interval set T can be determined as T = { [ t j , 1 , t i , 2 ] , [ t i , 2 , t j , 2 ] , , [ t i , 5 , t j , 5 ] } . In this study, the radius of the safety domain R is various lengths of vessels, which is set as three times the length of vessel referred [8].

2.2.2. Trajectory Pairs Collision Detection

In the mentioned above, the time interval set has been determined, based on this foundation, we need to detect whether there is conflict at any time interval. Figure 5 is a conflict example for trajectory pair, where ship domain was utilized to detect collision between two vessels. If ship i enters the domain of another ship j during the time interval [ t i , 4 , t i , 5 ] , it is deemed as a conflict.
Whether an intrusion event occurs between ships i and j is determined here by the relative distance of movement and the size of the ship’s domain as shown in Figure 6 where D i , j t is the relative distance of ship pair at the time t , v i t is the speed of ship i at the time t , R i is the safety domain of ship i , the position of the ship i at the time t is presented as ( φ i t , λ i t ) , L i , j t is the distance of ship j can sail using the relative speed v i , j t within the time interval Δ t , θ i , j t is the angle between the relative speed and the connecting line of ship pairs. Some of the mentioned variables can be calculated as follows. Using Formula (1), we transferred the coordinate from geographical ( φ i t , λ i t ) to Mercator ( x i t , y i t ) .
D i , j t = ( x i t x j t ) 2 + ( y i t y j t ) 2
| v i j | = ( | v i | cos α i t | v j | cos α j t ) 2 + ( | v i | sin α i t | v j | sin α j t ) 2
θ i , j t = arccos ( 1 ( | v i | sin α i t | v j | sin α j t | v i , j t | x j t x i t D i , j t ) 2 + ( | v i | cos α i t | v j | cos α j t | v i , j t | y j t y i t D i , j t ) 2 2 )
Thus, the problem of ship collision risk assessment can be transformed into a statistical problem of the number of ship collisions in a specific area in a specific period. As shown in Figure 6, if D i , j t + Δ t is less than R i , then the ship j will intrude the safety domain of the ship i in the time interval Δ t .
D i , j t + Δ t R j ( x i t + Δ t x j t + Δ t ) 2 + ( y i t + Δ t y j t + Δ t ) 2 R j
Assuming that two ships maintain their current course and speed during the time interval, Formula (14) can be transformed to
( x i t x j t + | v i | sin α i t Δ t | v j | sin α j t Δ t ) 2 + ( y i t y j t + | v i | cos α i t Δ t | v j | cos α j t Δ t ) 2 R j D i , j t 2 + | v i , j t Δ t | 2 + 2 Δ t [ ( x i t x j t ) ( | v i | sin α i t | v j | sin α j t ) + ( y i t y j t ) ( | v i | cos α i t | v j | cos α j t ) ] R j L i , j t = D i , j t cos θ i , j t R j 2 ( D i , j t sin θ i , j t ) 2 | v i , j t | Δ t
Therefore, whether there is a conflict between ship pairs during a time interval Δ t or not can be determined by the Boolean variable B ( i , j , t ) .
B ( i , j , t ) { 1 , 0 < L i , j t | v i , j t | Δ t 0 , o t h e r w i s e
Furthermore, due to the large span of the researched data in time and geographic scope, to avoid the extra calculation amount between trajectories that do not exist in conflict, it can be realized utilizing spatial retrieval. Currently, mature spatial retrieval technologies include the R-tree index, grid index, quad-tree index, and so on. In practical applications, we can combine one or more of the above technologies and merge their characteristics to form a new spatial indexing technology.

3. Case Study and Results

In this section, a series of experiments for the performance evaluation of the proposed MFDP and path generation algorithm by a real trajectory set obtained from Ningbo, CHINA was conducted. All the experiments are implemented in python 3.6 on an AMD Ryzen 7 5800H with Radeon Graphics 3.20 GHz machine with 16.0 GB of memory running Microsoft Windows 11. Our experimental results are as following discussions.

3.1. Experimental Data Source

In this study, AIS data were obtained from the Transportation and Maritime Safety Administration of China which is responsible for storing and maintaining various ship information along the coast of China.
The experimental data was all the Class-A AIS messages, that were collected in Zhou Shan Islands from selected obtained from 1 January 2018, to 10 January 2018. The data had been pre-processed utilizing the study which was described in the work [3]. The description of the measurement is shown in Table 1. The number of ships and the number of trajectories of each type are listed in Table 2 and Table 3 gives the analysis of the ratio of ship length to width.

3.2. Comparison of AIS Compression Method

3.2.1. Threshold Analysis

From the described in Section 3.1, we can find that Compression accuracy depends on the compression threshold. The larger the compression threshold, the lower the compression accuracy. So, the threshold selection will directly determine the quality of the simplified data. However, the optimal threshold value is difficult to determine which was affected by many factors. For example, Zhao et al., 2018 used the own ship’s length in different trajectories as the threshold value for satisfying the adaptability. Due to the data was collected from different types of ships, the characteristics of the trajectory are affected by a number of parameters, such as the length, the width, and the ratio of length to width. For example, the same length of ship sail on the same water, the track is different depending on the width of ships.
Considering the different maneuverability of ships, the bigger the ship, the more difficult it is to operate, therefore the track of the big one would have fewer points deviate the overall shape of ship trajectory than the small one. Therefore, if the threshold is global and suitable for the big ship, then the important track point for the small ship would be discarded. Conversely, if the threshold is suitable for the small ship, then there would retain many points of redundancy, and this leads to the low compression rate.
Consequently, the ratio of the length to width of ships was considered in the selection of threshold values. The threshold for each trajectory is different from each ratio of ship length to width. In the following experiments, which equal to 10 times or 100 times the ratio of ship length to width. Furthermore, when setting the threshold value, the sailing waters also can be considered, for example, the ratio in the harbor water can be set smaller than the ratio in the open water. Our practical experience shows that the appropriate ratio in the harbor water usually ranges from 30 times to 80 times. For different users, the threshold value varies to the purpose, thus whose variability should be considered in the application of the DP algorithm. Under certain conditions, the threshold value can be fixed differently as needed.
From the results shown in the left one of Figure 7, there was a strong correlation between the retention rate of characteristic points and the course threshold when the threshold of the Euclidean Distance and angle difference remained constant. From the results shown in the right one of Figure 7, there was a strong correlation between the retention rate of characteristic points and speed threshold when the threshold of the Euclidean Distance and the course difference remained constant. This showed that the MCDP method can retain the characteristic points that we wanted, and more features can be extracted.
Figure 8a and Figure 9a showed a strong correlation between the course difference, course threshold, and speed threshold when the threshold of the Euclidean Distance remained constant in standard error and mean error, respectively. The results in Figure 8b and Figure 9b showed a strong correlation between the speed difference, speed threshold, and course difference when the threshold of the Euclidean Distance remained constant in standard error and mean error, respectively. Through analysis, it is also can be found that the MCDP method can retain the characteristic points that we wanted, and more features can be extracted.

3.2.2. Experimental Comparison

To further verify quality improvement, the samples were tested with the provided method. The experimental results of the proposed method are shown in Table 2, Table 3 and Table 4. The results of the number of critical points deleted by RDP incorrectly are presented in Table 2, where RDP was tested with the threshold of Euclidean Distance, and MCDP was tested with the threshold of Euclidean Distance, angle constraint, and course constraint. From this, we can find that the provided method could retain more characteristic points than RDP. Table 3 and Table 4 is the speed loss and course loss of RDP and MCDP on the dataset with different thresholds. In comparing the results, MCDP (rather than RDP) improved 1.5–2.5% in the average course synchronization error of trajectories and 2–4.5% in the average speed synchronization error of trajectories. From the comparison shown in Figure 10, it is apparent that the trajectory after simplification using MCDP was more similar to the original trajectory than when using RDP for the course difference and speed difference. Table 1 is the measurement, unit, and description of the measurement.
Figure 11 shows the Synchronized Euclidean Distance of MCDP and RDP. In Figure 10, the various Synchronous Euclidean Distance error of speed and course of RDP is greater than MCDP, in other words, the trends both in speed and course simplified with MCDP algorithm were more accurate than that of RDP algorithm by SEDtotv, SEDavgv, SEDtotc, and SEDavgc. The reason is that the RDP algorithm just considered the spatial and temporal information and not the speed and course status. Our MCDP method fully and deeply tackled multiple constraints, e.g., velocity, course, and position, for improving the accuracy of the trajectory simplification. The Synchronous Euclidean Distance error of both methods increases with an increasing threshold value.
Figure 12 shows the comparison in terms of the number of the remained critical points in different times of ship length to width. From this, we can find that the number of the remained critical points of both methods decreases with increasing threshold and MCDP can remain much more critical points than RDP.
To verify the advantages of MCDP, one trajectory was utilized to conduct the parallel experiments. In this experiment, the threshold value was set as 50 times the ship length to width. In Figure 13, the blue and red lines represent the original algorithm and the improved algorithm, respectively. The improved algorithm at each peak and valley can better preserve the original properties. The results reveal that the MCDP method can preserve more attributes than RDP methods in terms of speed attribute and course attribute. Figure 14 shows the compression results of a vessel trajectory in the parallel experiment. Through analysis of this figure, it can be found that there will have more critical points been remained as denoted with an arrow in Figure 14. Therefore, the MCDP method is superior to the RDP method in retaining the attribute characteristics of trajectory including shape, speed, and course.

3.3. Collision Risk Analysis

In this section, collision risk was assessed with different meeting situations, such as head-on, crossing, overtaking situation. Therefore, we derived from existing studies [2,3,7], the causation probability values for head-on, crossing, and overtaking conflicts were set as p h = 4.90 × 10 5 , p c = 6.83 × 10 5 , and p o = 4.90 × 10 5 , respectively.
Table 5 presents the results of the collision risk assessment for the study area, where the collision frequency of the first officer is bigger than others, and the collision frequency of the other two duty periods is the same. In addition, there is a positive correlation between the on-duty stage and the frequency of conflict.
Table 6 shows the comparison of collision frequency in different encounter situations using analysis of variance (ANOVA) [3,8]. The F value and p-value in ANOVA are used to analyze the differences between and within groups. F value corresponds to a p-value, and the larger the F value, the smaller the p-value. If the F value is equal to 1, the difference between groups is consistent with the difference within groups. It can be found that F values of the three conflict types differ greatly and p values are small, therefore, the differences between groups are large, and it can be concluded that the collision frequency is affected by the driver’s experience or duty period.
Figure 15 reveals that the difference between the relative speed distribution of ships involved in different conflict types. The major relative speed in head-on, crossing, and overtaking conflicts is range 11–12, 15–18, and 6–8 knots, respectively.
Figure 16 indicates that the length distribution for vessels involved in different types of conflict. From three types of distribution, it can be found that the majority of length in head-on and crossing conflicts is around 100 m. In overtaking conflict situation, the majority of length ranges from 100 to 150 m.
Figure 17, Figure 18 and Figure 19 illustrate the spatio-temporal distribution under head-on, crossing, and overtaking conflicts for the specific time of the studied waters respectively. Where the red, yellow, and brown colors represent the conflicts at different watch stages, respectively. The red color represents the duty stage of the first officer from 00:00 to 04:00 and 12:00 to 16:00. The yellow color represents the duty stage of the second officer from 04:00 to 08:00 and 16:00 to 20:00. The brown color represents the duty stage of the third officer from 08:00 to 12:00 and 20:00 to 24:00.

4. Discussion

Combining the water layout map in Figure 1 and the spatio-temporal trajectory distribution map, it can be found that the intersection of the Jin-Tang Bridge and the north side of the Xi Hou-men Bridge are both high incidence areas in the three types of conflicts. Furthermore, there are few conflicts in the Yang-shan harbor of Shang-hai port, the reason caused the conflict here should be limited by the narrow waters here so that the encounter distance between the ship pairs is less than the safety domain applied in this study. The width of the channel here is about 400 m, therefore, if there exists an encounter situation in the channel, it is very easy to form a conflict. In addition, during the maintenance and dredging of this channel, some engineering vessels are continuously operating in the channel, and other vessels may easily form conflicts with such vessels.
From the three spatio-temporal results, it can be found that conflicts are easy to occur in the traffic separation system (TSS) of Ningbo Zhoushan Port, especially at the intersection and the waterway in the south. The reasons for this phenomenon may be the following two aspects, on the one hand, the ship traffic flow is large, on the other hand, it may be due to there is a difference of the size of safety domain between the applied in this study and the actual in TSS.
Due to the simple pre-processing of the ship trajectory data in the early stage, the calculated collision frequency values will be smaller than the actual situation. The collision correlation between different ship types and ship sizes can be further analyzed in the later stage to further the risk evaluation of the waters.

5. Conclusions

The purpose of this study is to provide a framework for the design of a ship collision risk assessment system in port waters. In this paper, based on the estimated data of ships in port waters, we proposed a novel track pairs collision detection algorithm for evaluating the risk of ship collision in the port water. Based on considering the influence of ship speed on trajectory compression, the DP algorithm is improved, which not only ensures the compression ratio but also retains more key behavior points. The framework has great flexibility and can realize collision risk assessment in different waters, which is helpful to identify high-risk areas in port waters and improve ship management.
In the future, we can study the collision probability of this water by collecting collision accident data and determine a more suitable ship safety domain through comparative experiments.

Author Contributions

Conceptualization, J.S.; software, J.S.; supervision, Z.L.; writing—original draft, J.S.; writing—review and editing, J.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number (51579025, 51709165) and the Fundamental Research Funds for the Central Universities (3132020134, 3132020139).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Some or all data, models, or code generated or used during the study are available from the corresponding author by request.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. UNCTAD. Review of Maritime Transport, 2021. No. UNCTAD/RMT/2021; United Nations Publications: New York, NY, USA, 2021. [Google Scholar]
  2. Xu, Q.; Wang, N. A survey on ship collision risk evaluation. Promet-Traffic Transp. 2014, 26, 475–486. [Google Scholar] [CrossRef]
  3. Liu, D.; Shi, G. Ship collision risk assessment based on collision detection algorithm. IEEE Access 2020, 8, 161969–161980. [Google Scholar] [CrossRef]
  4. Fan, H.S.L.; Cao, J.M. Sea space capacity and operation strategy analysis system. Transp. Plan. Technol. 2000, 24, 49–63. [Google Scholar] [CrossRef]
  5. Li, Q.; Lam, J.S.L. Conflict resolution for enhancing shipping safety and improving navigational traffic within a seaport: Vessel arrival scheduling. Transp. A Transp. Sci. 2017, 13, 727–741. [Google Scholar] [CrossRef]
  6. Ozturk, U.; Cicek, K. Individual collision risk assessment in ship navigation: A systematic literature review. Ocean. Eng. 2019, 180, 130–143. [Google Scholar] [CrossRef]
  7. Chai, T.; Weng, J.; Xiong, D.-Q. Development of a quantitative risk assessment model for ship collisions in fairways. Saf. Sci. 2017, 91, 71–83. [Google Scholar] [CrossRef]
  8. Weng, J.; Meng, Q.; Qu, X. Vessel collision frequency estimation in the Singapore Strait. J. Navig. 2012, 65, 207–221. [Google Scholar] [CrossRef] [Green Version]
  9. Goerlandt, F.; Montewka, J.; Kuzmin, V.; Kujala, P. A risk-informed ship collision alert system: Framework and application. Saf. Sci. 2015, 77, 182–204. [Google Scholar] [CrossRef] [Green Version]
  10. Debnath, A.K.; Chin, H.C.; Haque, M.M. Modelling port water collision risk using traffic conflicts. J. Navig. 2011, 64, 645–655. [Google Scholar] [CrossRef] [Green Version]
  11. Yoo, Y.; Lee, J.-S. Collision Risk Assessment Support System for MASS RO and VTSO Support in Multi-Ship Environment of Vessel Traffic Service Area. J. Mar. Sci. Eng. 2021, 9, 1143. [Google Scholar] [CrossRef]
  12. Douglas, D.H.; Peucker, T.K. Algorithms for the reduction of the number of points required to represent a digitized line or its caricature. Cartogr. Int. J. Geogr. Inf. Geovisualization 1973, 10, 112–122. [Google Scholar] [CrossRef] [Green Version]
  13. Lou, Y.; Zhang, C.; Zheng, Y.; Xie, X.; Wang, W.; Huang, Y. Map-matching for low-sampling-rate GPS trajectories. In Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, Seattle, WA, USA, 4–6 November 2009; pp. 352–361. [Google Scholar]
  14. Xiang, L.; Wu, T.; Gong, J. A Geo-spatial Information Oriented Trajectory Model and Spatio-temporal Pattern Querying. Acta Geod. Cartogr. Sin. 2014, 43, 982–988. [Google Scholar]
  15. Ying, J.J.; Su, J. On velocity-preserving trajectory simplification. In Proceedings of the Asian Conference on Intelligent Information and Database Systems, Da Nang, Vietnam, 14–16 March 2016; Springer: Berlin/Heidelberg, Germany, 2016; pp. 241–250. [Google Scholar]
  16. Ericson, C. Real-Time Collision Detection; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar]
  17. Tang, M.; Manocha, D.; Tong, R. Fast continuous collision detection using deforming non-penetration filters. In Proceedings of the 2010 ACM SIGGRAPH Symposium on Interactive 3D Graphics and Games, Washington, DC, USA, 19–21 February 2010. [Google Scholar]
  18. Shi, J.; Liu, Z. Deep Learning in Unmanned Surface Vehicles Collision-Avoidance Pattern Based on AIS Big Data with Double GRU-RNN. J. Mar. Sci. Eng. 2020, 8, 682. [Google Scholar] [CrossRef]
Figure 1. The study area for collision risk assessment.
Figure 1. The study area for collision risk assessment.
Jmse 10 00216 g001
Figure 2. Framework of collision risk assessment.
Figure 2. Framework of collision risk assessment.
Jmse 10 00216 g002
Figure 3. Synchronous Euclidean Distance. Where segment AC is the successive track points after compressing and point list [ A , B , C ] is the successive actual location of the track. Point B is the generated synchronization point. S E D ( A , B , C ) is the Synchronous Euclidean distance between the point B and B . The velocity of the Synchronized point B is v B = v A + v AC × ( t B t A ) / ( t C t A ) , the time is t B = t B and the course is c B = c A + c AC × ( t B t A ) / ( t C t A ) .
Figure 3. Synchronous Euclidean Distance. Where segment AC is the successive track points after compressing and point list [ A , B , C ] is the successive actual location of the track. Point B is the generated synchronization point. S E D ( A , B , C ) is the Synchronous Euclidean distance between the point B and B . The velocity of the Synchronized point B is v B = v A + v AC × ( t B t A ) / ( t C t A ) , the time is t B = t B and the course is c B = c A + c AC × ( t B t A ) / ( t C t A ) .
Jmse 10 00216 g003
Figure 4. Determining the time interval from trajectory pairs.
Figure 4. Determining the time interval from trajectory pairs.
Jmse 10 00216 g004
Figure 5. A conflict example for trajectories pairs.
Figure 5. A conflict example for trajectories pairs.
Jmse 10 00216 g005
Figure 6. Relative motion of vessel pair at a time interval and ship conflicts.
Figure 6. Relative motion of vessel pair at a time interval and ship conflicts.
Jmse 10 00216 g006
Figure 7. The relationship between retaining ratio and speed constraint or course constraint. (a) is the relationship between the retaining ratio and course constraint when the speed constraint is 87 and the threshold of the Euclidean distance is 100 times; (b) is the relationship between the retaining ratio and speed threshold when the course threshold is 27 and the threshold of the Euclidean distance is 100 times.
Figure 7. The relationship between retaining ratio and speed constraint or course constraint. (a) is the relationship between the retaining ratio and course constraint when the speed constraint is 87 and the threshold of the Euclidean distance is 100 times; (b) is the relationship between the retaining ratio and speed threshold when the course threshold is 27 and the threshold of the Euclidean distance is 100 times.
Jmse 10 00216 g007
Figure 8. Correlation between the standard error of the course difference threshold, the speed difference threshold, and the course difference or speed mean difference. (a) shows the course difference; (b) shows the speed difference.
Figure 8. Correlation between the standard error of the course difference threshold, the speed difference threshold, and the course difference or speed mean difference. (a) shows the course difference; (b) shows the speed difference.
Jmse 10 00216 g008
Figure 9. Correlation between the course difference threshold, speed difference threshold, and the mean error of course difference or speed difference. (a) shows the course difference; (b) shows the speed difference.
Figure 9. Correlation between the course difference threshold, speed difference threshold, and the mean error of course difference or speed difference. (a) shows the course difference; (b) shows the speed difference.
Jmse 10 00216 g009
Figure 10. The relative error between Synchronous Euclidean Distance error of speed and course of RDP and MCDP.
Figure 10. The relative error between Synchronous Euclidean Distance error of speed and course of RDP and MCDP.
Jmse 10 00216 g010
Figure 11. Comparison between Synchronous Euclidean Distance error of speed and course of RDP and MCDP. (ad) show the total SED error of speed, total SED error of course, average SED error of speed, and average SED error of course, respectively.
Figure 11. Comparison between Synchronous Euclidean Distance error of speed and course of RDP and MCDP. (ad) show the total SED error of speed, total SED error of course, average SED error of speed, and average SED error of course, respectively.
Jmse 10 00216 g011
Figure 12. Comparison between MCDP and RDP.
Figure 12. Comparison between MCDP and RDP.
Jmse 10 00216 g012
Figure 13. Charts comparing with the RDP algorithm and the MCDP algorithm (Our algorithm). (a) is the velocity simplification comparison with RDP, original, and MCDP; (b) is the course simplification comparison with RDP, original, and MCDP. The threshold was set as 50 times the ship length to width.
Figure 13. Charts comparing with the RDP algorithm and the MCDP algorithm (Our algorithm). (a) is the velocity simplification comparison with RDP, original, and MCDP; (b) is the course simplification comparison with RDP, original, and MCDP. The threshold was set as 50 times the ship length to width.
Jmse 10 00216 g013
Figure 14. Compression results of vessel trajectory in the parallel experiment. (a) Original trajectory; (b) The simplified trajectory by RDP; (c) The simplified trajectory by MCDP.
Figure 14. Compression results of vessel trajectory in the parallel experiment. (a) Original trajectory; (b) The simplified trajectory by RDP; (c) The simplified trajectory by MCDP.
Jmse 10 00216 g014
Figure 15. Relative speed distribution for vessels involving in different conflicts. (a) Head-on conflict; (b) Crossing conflict; (c) Overtaking conflict.
Figure 15. Relative speed distribution for vessels involving in different conflicts. (a) Head-on conflict; (b) Crossing conflict; (c) Overtaking conflict.
Jmse 10 00216 g015aJmse 10 00216 g015b
Figure 16. Length distribution for vessels involved in different conflicts. (a) Head-on conflict; (b) Crossing conflict; (c) Overtaking conflict.
Figure 16. Length distribution for vessels involved in different conflicts. (a) Head-on conflict; (b) Crossing conflict; (c) Overtaking conflict.
Jmse 10 00216 g016aJmse 10 00216 g016b
Figure 17. Spatio-temporal distribution of head-on conflicts.
Figure 17. Spatio-temporal distribution of head-on conflicts.
Jmse 10 00216 g017
Figure 18. Spatio-temporal distribution of crossing conflicts.
Figure 18. Spatio-temporal distribution of crossing conflicts.
Jmse 10 00216 g018
Figure 19. Spatio-temporal distribution of overtaking conflicts.
Figure 19. Spatio-temporal distribution of overtaking conflicts.
Jmse 10 00216 g019
Table 1. Description of the measurement.
Table 1. Description of the measurement.
MeasurementDescriptionUnit
SEDtotvTotal speed synchronization error of trajectoriesKnots
SEDavgvAverage speed synchronization error of trajectoriesKnots
SEDtotcTotal course synchronization error of trajectoriesDegree
SEDavgcAverage course synchronization error of trajectoriesDegree
THRThreshold (times the ratio of ship length to width)None
Table 2. Quantity analysis for comparison between the existing method and proposed method.
Table 2. Quantity analysis for comparison between the existing method and proposed method.
THRRDPMCDPDeleted Incorrectly
RemainedRatio (%)RemainedRatio (%)
10467,4510.577976077469,4250.5761939111974
20343,3020.690060227347,7030.6860869184401
30281,6840.745690165287,0160.7408763315332
40242,8890.780715051248,9490.7752439646060
50216,1430.804861864222,7590.798888816616
60196,8560.822274546203,5060.8162707956650
70181,8830.835792463188,8850.8294709217002
80170,0020.846518863177,0630.8401440547061
90160,1800.855386357167,1730.849072946993
Table 3. Speed loss of RDP and MCDP on the dataset with different thresholds.
Table 3. Speed loss of RDP and MCDP on the dataset with different thresholds.
THRRDPMCDP
SEDtotvSEDavgvSEDtotvSEDavgv
1093,320.93370.050690,102.90130.0496
20146,309.76290.0824139,549.07540.0803
30188,422.11340.1085178,265.47370.1052
40224,651.37450.1312210,409.25790.1266
50256,911.20680.15082239,676.80590.1453
60285,895.66880.1681265,157.20750.1616
70311,667.07480.1840288,211.90250.1763
80335,843.82890.1992309,908.02170.1907
90359,218.89100.2129329,731.61560.2036
Table 4. Course loss of RDP and MCDP on the dataset with different thresholds.
Table 4. Course loss of RDP and MCDP on the dataset with different thresholds.
THRRDPMCDP
SEDtotcSEDavgcSEDtotcSEDavgc
101,711,254.81411.08101,663,394.01281.0656
202,713,959.26321.75902,620,121.49881.7256
303,503,489.63102.31693,373,388.17812.2699
404,174,016.66522.81593,990,199.76712.7510
504,740,253.51913.21014,524,679.07143.1326
605,245,404.40853.57045,001,570.17203.4821
705,716,247.33303.89965,441,852.02863.8040
806,134,831.49634.19665,837,147.10614.0917
906,540,768.79904.47506,211,981.42994.3621
Table 5. Comparison of collision frequency in different duty periods.
Table 5. Comparison of collision frequency in different duty periods.
Collision TypeHead OnCrossingOvertakingOverall
First officer 6.52 × 10 3 5.18 × 10 2 5.14 × 10 2 1.097 × 10 1
Second officer 2.21 × 10 3 3.42 × 10 2 2.75 × 10 2 6.396 × 10 2
Third officer 4.90 × 10 3 3.61 × 10 2 4.59 × 10 2 8.689 × 10 2
Table 6. Comparison of collision frequency in different encounter situations.
Table 6. Comparison of collision frequency in different encounter situations.
Collision TypeMean/DayStd/DayFp-Value
Head-on
First officer 7.24 × 10 4 3.109 × 10 3 5.920.008
Second officer 2.45 × 10 4 6.49 × 10 4
Third officer 5.44 × 10 4 2.45 × 10 4
Crossing
First officer 5.75 × 10 3 9.64 × 10 2 2.680.089
Second officer 3.80 × 10 3 2.99 × 10 2
Third officer 4.01 × 10 3 4.25 × 10 2
Overtaking
First officer 5.72 × 10 3 1.36 × 10 1 3.690.040
Second officer 3.06 × 10 3 2.55 × 10 2
Third officer 5.10 × 10 3 1.27 × 10 1
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Shi, J.; Liu, Z. Track Pairs Collision Detection with Applications to Ship Collision Risk Assessment. J. Mar. Sci. Eng. 2022, 10, 216. https://doi.org/10.3390/jmse10020216

AMA Style

Shi J, Liu Z. Track Pairs Collision Detection with Applications to Ship Collision Risk Assessment. Journal of Marine Science and Engineering. 2022; 10(2):216. https://doi.org/10.3390/jmse10020216

Chicago/Turabian Style

Shi, Jiahui, and Zhengjiang Liu. 2022. "Track Pairs Collision Detection with Applications to Ship Collision Risk Assessment" Journal of Marine Science and Engineering 10, no. 2: 216. https://doi.org/10.3390/jmse10020216

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop