Next Article in Journal
Measuring the Multiple Functions and Tradeoffs among Streets: A New Framework Using the Deep Learning Method
Next Article in Special Issue
Research on the Evolution and Driving Factors of the Economic Spatial Pattern of the Guangdong–Hong Kong–Macao Greater Bay Area in the Context of the COVID-19 Epidemic
Previous Article in Journal
Internet in the Middle of Nowhere: Performance of Geoportals in Rural Areas According to Core Web Vitals
Previous Article in Special Issue
A Study of the Impact of COVID-19 on Urban Contact Networks in China Based on Population Flows
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Head/Tail Breaks-Based Approach to Characterizing Space-Time Risks of COVID-19 Epidemic in China’s Cities

1
School of Geography and Environment, Jiangxi Normal University, Nanchang 330022, China
2
Key Laboratory of Poyang Lake Wetland and Watershed Research, Ministry of Education, Jiangxi Normal University, Nanchang 330022, China
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2023, 12(12), 485; https://doi.org/10.3390/ijgi12120485
Submission received: 9 October 2023 / Revised: 23 November 2023 / Accepted: 27 November 2023 / Published: 29 November 2023
(This article belongs to the Collection Spatial Components of COVID-19 Pandemic)

Abstract

:
The novel coronavirus pneumonia (COVID-19) pandemic has caused enormous impacts around the world. Characterizing the risk dynamics for urgent epidemics such as COVID-19 is of great benefit to epidemic control and emergency management. This article presents a novel approach to characterizing the space-time risks of the COVID-19 epidemic. We analyzed the heavy-tailed distribution and spatial hierarchy of confirmed COVID-19 cases in 367 cities from 20 January to 12 April 2020, and population density data for 2019, and modelled two parameters, COVID-19 confirmed cases and population density, to measure the risk value of each city and assess the epidemic from the perspective of spatial and temporal changes. The evolution pattern of high-risk areas was assessed from a spatial and temporal perspective. The number of high-risk cities decreased from 57 in week 1 to 6 in week 12. The results show that the risk measurement model based on the head/tail breaks approach can describe the spatial and temporal evolution characteristics of the risk of COVID-19, and can better predict the risk trend of future epidemics in each city and identify the risk of future epidemics even during low incidence periods. Compared with the traditional risk assessment method model, it pays more attention to the differences in the spatial level of each city and provides a new perspective for the assessment of the risk level of epidemic transmission. It has generality and flexibility and provides a certain reference for the prevention of infectious diseases as well as a theoretical basis for government implementation strategies.

1. Introduction

The COVID-19 pandemic caused by the novel coronavirus (SARS-CoV-2) infection has constituted a global health security threat since 2020 [1,2,3]. As of June 2023, the COVID-19 pandemic has lasted over three years, posing great challenges to regional epidemic prevention and control [4]. Characterizing the risk dynamics of the COVID-19 epidemic is of great benefit to epidemic control and emergency management. It is urgent to establish an effective assessment system to identify space-time risks which can provide support for formulating epidemic control and prevention policies [5]. Since the COVID-19 outbreak, governments have launched various measures to combat the pandemic, such as quarantines, the closure of cities, and the suspension of large gatherings [6,7]. Meanwhile, researchers have conducted a large number of studies to combat the COVID-19 pandemic, focusing on the clinical characteristics [8], viral gene sequencing [9], epidemic transmission characteristics [10,11], spatiotemporal distribution patterns [12,13,14,15], space-time risk assessments [16,17], and others.
With the advent of the post-epidemic era, the study of space-time risk assessment of the COVID-19 epidemic has gradually become a focus of attention [18,19]. Various data sources related to the epidemic’s spread, e.g., social and environmental data, were used to identify the dynamics of epidemic risks. The population-related data played an important role in risk assessment, such as population movement data [20,21], residential population data [22], and population density data [23], which were usually correlated with reported cases to support risk modelling. In addition, spatiotemporal big data such as Tencent location-based big data can be applied to indicate the population flow and help construct an epidemic model to evaluate the risk of the epidemic [24]. Smartphone signaling data can also be used to indicate human mobility and support modelling the human-to-human transmission networks to assess the risk of COVID-19 infections [25]. Many scholars have assessed the risks of COVID-19 transmission in different countries and regions associated with various natural and social environmental factors [16,25], and results have shown that the COVID-19 epidemic spread was due to multiple environmental, economic, and social factors [26], in which population density was an important determinant of the infection.
On the other hand, many researchers have proposed corresponding modelling approaches for the spatiotemporal spread of COVID-19 epidemics. The most common approach is a series of models derived from the classical SIR model [27,28] based on the principles of infectious disease dynamics, including SEIR models [29,30,31], SEAIR models [32,33], etc., which simulate the COVID-19 epidemic situation in cities. Key epidemiological parameters such as the basic reproduction number (R0) [34,35] have found widespread application in monitoring the transmission dynamics and risk assessment of COVID-19. Additionally, scholars have applied various methods such as spatial stratified heterogeneity statistics [34], Kalman filtering [36,37,38], Bayesian maximum entropy [39], head/tail breaks [40], and more [41,42,43,44,45]. These methods have been employed for various applications in modelling the spatiotemporal spread of COVID-19, including identifying spatiotemporal distribution patterns [46,47], mapping spatiotemporal disease distributions [48], describing spatiotemporal characteristics of the epidemic [49,50,51], and assessing its impact on production and daily life in China [52,53,54]. These studies cover different stages of epidemic transmission.
China, with its vast territory and large population, exhibits significant regional disparities. Risk assessment and early warning require hierarchical zoning, with different regions at various risk levels adopting different prevention and control measures. However, research in this area remains limited. The factors influencing the transmission and spread of the COVID-19 virus are multifaceted and include not only the intrinsic biological factors of the infected [55,56,57] but also natural and social factors [47,58] such as temperature [58], population mobility [59], and economic development [60,61]. Relevant studies have indicated that the spatial hierarchy of epidemic cases is correlated with the spatial hierarchy of the underlying population. Comparing the spatial hierarchies of COVID-19 cases and the population, the results show that the pandemic is largely determined by the underlying population, with an R2 value as high as 0.82 [40]. In simple terms, the more the population, the more cases there are. For a specific study area, the cases and the population are distributed across different units, such as different cities, forming different spatial hierarchies. Therefore, we can identify the potential risk of epidemic transmission by comparing the differences in their relative hierarchies.
With this consideration in mind, our research endeavors to introduce a novel approach to characterizing the space-time risk of COVID-19 outbreaks in Chinese cities. We employ a “head/tail breaks approach” [62,63,64] to delineate the spatial hierarchy of the COVID-19 cases and populations within each city for every date or specified period (e.g., weekly). Subsequently, we compare the disparities in their relative hierarchies. We then define a risk assessment metric to ascertain the risk levels for each city. This approach enables the examination of the dynamic risk of epidemic spread in both the spatial and temporal domains. Unlike traditional classification and stratification methods, the head/tail breaks method applies to data with heavy tail distribution [62,65] and can better reveal the inherent hierarchical structures of things, which can effectively portray regional differences in the space-time risk of the COVID-19 epidemic. At present, although there have been studies applying the head/tail breaks approach to the space-time mapping of the COVID-19 epidemic [40], there are few, if any, studies on the assessment of the risk level of COVID-19 epidemics. Our study provides a direct method for describing the space-time risk of epidemic transmission, offering a fresh perspective on the dynamics of epidemic risk assessment. Importantly, the methodology we propose not only elucidates the inherent hierarchy of epidemic cases and populations at a macroscopic level but also allows for the characterization of risk dynamics at a local level, such as within specific regions or individual cities. This provides a theoretical basis for government policy implementation, offers insights for domestic and international efforts in the prevention and control of COVID-19, and imparts valuable lessons for enhancing the utilization of healthcare services in other regions.

2. Data and Methodology

2.1. Study Area and Data Sources

This paper, using China as a case study, examines the space-time evolution of COVID-19 risk. To account for the typicality of the COVID-19 pandemic, this study acquired daily confirmed COVID-19 cases for 367 municipal-level administrative units in China from 20 January 2020 to 12 April 2020, from sources including the National Health Commission of the People’s Republic of China (www.nhc.gov.cn, accessed on 1 April 2020) and official websites of municipal health commissions. As of 12 April, China had reported 108 new confirmed cases, bringing the total cumulative confirmed cases to 82,160. The development of the COVID-19 situation in China exhibits distinct phases. Based on the trend in daily new confirmed cases (as shown in Figure 1a), the pandemic in China can be divided into three stages: a rapid growth phase (20 January–12 February), a controlled reduction phase (13 February–1 March), and a stable maintenance phase (2 March–12 April). It is noteworthy that, on 12 February, there was a peak in the number of new confirmed cases due to a change in the statistical criteria, which included clinical diagnosis cases as confirmed cases. This change aided in controlling the outbreak, increasing the recovery rate, and reducing the mortality rate. Spatially (as depicted in Figure 1b), the confirmed cases are concentrated in the eastern regions, particularly in areas surrounding Wuhan, such as Huanggang, Xiaogan, Chongqing, and other cities, which were significantly affected by the pandemic. In contrast, the northwestern and southwestern regions experienced a relatively lower impact, with fewer new cases of COVID-19. The spread of the pandemic exhibits periodic characteristics, and we will explore the space-time evolution of the COVID-19 risk in various Chinese cities over 12 weeks, using 7-day intervals.
We collected 2019 population data and administrative unit area data for each city from the 2019 Statistical Yearbook. Subsequently, we calculated the incidence rate and population density for each city. Population mobility data were obtained from Baidu Migration Data (https://qianxi.baidu.com/#/, accessed on 1 July 2020), which includes metrics such as intra-city travel intensity and a migration scale index for inflow and outflow. Given that population inflow is a significant influencing factor in the spread of the epidemic, we selected the inflow intensity index to represent population mobility data in our study.

2.2. Methodology

2.2.1. Heavy-Tailed Distributions

Heavy-tailed distributions [66,67,68] are characterized by a split between the mean of the data and the majority of the small values in the tail of the distribution, with only a minority of the large values in the head. The imbalance between the head and tail of heavy-tailed distributions can be conceptualized as the dominance of the small values over the large ones. They exhibit a feature of having a “fat head” and a “long tail”, in statistical terms. A common visualization tool for exploring heavy-tailed distributions is the rank-size distribution plot (as shown in Figure 2), where ranks are plotted on the x-axis and corresponding values or sizes on the y-axis. Initially used in studies of word frequencies and city sizes [62], this visualization provides a straightforward and intuitive way to investigate heavy-tailed distributions. Common forms of heavy-tailed distributions include power-law distributions, log-normal distributions, exponential distributions, and others.
For a given dataset, specific statistical methods, such as power-law fitting [67], can be employed to detect whether it follows a heavy-tailed distribution. Power-law distributions reveal scaling relationships within data, indicating that small values occur much more frequently than large ones. The expression for a power-law distribution is given by:
y = c x α
In this context, α represents the power-law exponent or scaling parameter (1 < α < 3), and c denotes a constant. Typically, a higher power-law exponent, α , indicates greater spatial heterogeneity. In other words, a power-law exponent of 3 implies more heterogeneity compared to an exponent of 1 [40].
As for power-law testing, the simplest approach involves taking the logarithm of both sides of Equation (1). The resulting function expression is as follows, and allows us to assess whether it forms a straight line on a double-logarithmic graph.
ln   y = α ln x + ln c
However, this method has limitations and relies on a substantial amount of data for reliability, being suitable primarily for discrete data. In this study, we employed a robust maximum likelihood approach and the Kolmogorov–Smirnov test to assess power-law distributions. The indicators of power-law fitting are represented by the estimated exponent α ^ and goodness-of-fit index p A higher p value indicates a better fit to the power-law distribution. A p value approaching 1 suggests that the data closely follow a power-law distribution, with p values exceeding 0.1 considered acceptable for fitting.

2.2.2. Head/Tail Breaks and ht Index

The head/tail breaks method was initially introduced by scholar Jiang Bin in 2013 as an approach to classify or stratify data with characteristics of heavy-tailed distributions [62], such as power-law distributions. It enables rapid data categorization and reveals the latent hierarchical structure of the dataset. The head/tail segmentation is a recursive function, as shown in Figure 3, used to iteratively deduce the head and tail for all levels (scales) and recursively unveil the inherent hierarchical structure of the dataset. Based on a descending order of data sorting, the original data are split into the head, consisting of values higher than the mean, and the tail, consisting of values lower than the mean. Since the tail exhibits minimal variation, there is no need to segment values below the mean. This process continues to iteratively split the head until the concept that small values are significantly more abundant than large values is violated, signifying that the head no longer conforms to a heavy-tailed distribution, at which point the segmentation stops.
The structural hierarchy levels obtained using the head/tail break method can be quantified using the ht index [69], revealing the number of levels in the hierarchy. The calculation formula involves adding 1 to the number of recursive segmentations. There are two fundamental principles governing hierarchical structures: scaling laws, where smaller levels are far more numerous than larger ones within the hierarchy, and Tobler’s law, which stipulates the presence of varying degrees of similarity within each level of the rank structure. In other words, spatial heterogeneity and spatial homogeneity, or spatial dependency, can be simultaneously observed in the hierarchical structure. A higher ht index signifies a higher level of hierarchy within the structure.
We also employed the aforementioned set of 1001 data points (1, 1/2, 1/3, …, 1/1001) to further illustrate this classification scheme (Figure 3). The average of the 1001 data points is 0.0075, which divides the 1001 data points into two categories: the head, consisting of data points greater than the average (1, 1/2, …, 1/133), and the tail, consisting of data points lower than the average (1/134, 1/135, …, 1/1001). For the data points in the head category (1, 1/2, …, 1/133), their average is 0.0411. This value further divides the head data into two subcategories: the upper head (1, 1/2, …, 1/24) and the lower head (1/25, 1/26, …, 1/133). We continue this process with the upper head data, obtaining a third average value of 0.1573, which further divides the upper head data into two subcategories. Subsequently, we obtain a fourth average value of 0.4083, which further divides the six data points into two categories. After this, the data no longer follow a heavy-tailed distribution, and the segmentation process stops. In total, the data iterated (was segmented) four times, and the ht index is expressed as the number of recursive segmentations (as stated on the previous page) + 1, meaning the ht index is 5. The data are classified into five categories: [1/134, 1/135, …, 1/1001], [1/25, 1/26, …, 1/133], [1/7, 1/8, …, 1/24], [1/3, 1/4, …, 1/6], and [1, 1/2]. These correspond to five hierarchical structure levels: L1, L2, L3, L4, and L5. Furthermore, we can calculate the hierarchical level of each city in the total number of levels, i.e., L i N = ht index rank, where L i represents hierarchical structure levels, and N is the ht index.

2.2.3. Epidemic Risk Measurement

This study provides a dynamic perspective for identifying the space-time risk of an epidemic in the space-time domain. We examine the disparity between the relative levels of cases and the population at each date for each specific region across the entire study area. The space-time risk refers to the likelihood of an outbreak around a specific region at a certain date. In this study, we define a risk assessment indicator, R i , t , to characterize the space-time risk of COVID-19.
R i , t = L i , t ( c a s ) N t ( c a s ) L i p o p N p o p
where N t ( c a s ) denotes the ht index of COVID-19 confirmed cases at the moment t , L i , t ( c a s ) denotes the hierarchical level of COVID-19 confirmed cases at the moment t , N ( p o p ) is the ht index of the population density, L i , t ( c a s ) N t ( c a s ) denotes the t , which is the time rank of COVID-19 confirmed cases; L i ( p o p ) denotes the hierarchical level of the urban population density and L i ( p o p ) N ( p o p ) represents the rank of the urban population. R i , t denotes the risk value for a certain region i at time t . From Equation (3), we can obtain three possible outcomes: R i , t > 0 indicates high risk, R i , t = 0 indicates medium risk, and R i , t < 0 indicates low risk. Obviously, if a region has a lower level of population density but a higher level of confirmed cases, the risk of epidemic spread exists. Specifically, while a region has a relatively low level of cases but a relatively high level of population density, R i , t is negative, which indicates low risk. While the two relative levels of cases and population density are identical, R i , t equals zero and indicates medium risk. Further, while the relative level of the population density is low but the relative level of cases is high, R i , t is positive, which indicates high risk. Thus, if the risk assessment indicator, R i , t , is greater than 0, it indicates the epidemic risk of a certain city i at a certain date t (a higher value of R i , t indicates increased risk). It is worth noting that this study does not consider the situations of medium and low risk, focusing on the spatiotemporal variations in high-risk areas. The positive value of the risk assessment indicator represents the epidemic risk across the space-time domain. Its greater value indicates a higher risk. Specifically, when a region exhibits the highest hierarchical level of cases at a certain date, and meanwhile exhibits the lowest level of population density, the epidemic risk reaches the highest level (approaching 1). According to Equation (3), the highest risk value in this study was calculated as up to 0.84.

3. Results

3.1. Exploration of Heavy-Tailed Distribution

We first examined the heavy-tailed distributions of COVID-19 cases and population density. The results indicated that both exhibited heavy-tailed distributions. According to statistics, the total population density of 367 Chinese cities in 2019 was 148,161.80 individuals per square kilometre, with an average population density of 403.71 individuals per square kilometre. Among these, the highest population density was 6727.91 individuals per square kilometre, while the lowest was 0.35 individuals per square kilometre. Table 1 displays the results of population density head/tail break calculations, involving five iterations of the population density data. Figure 4a,b depict the top three stratification levels of the population density and the magnitude ranking of the confirmed COVID-19 cases in the first week, respectively, using embedded insets. The head data are represented in red, while the tail data are in blue. Two smaller insets are added for each dataset, providing a visual representation of the heavy-tailed distribution characteristics of the data and further illustrating the role of the data hierarchy structure. As shown in Figure 4a, the largest plot encompasses population density data for all 367 cities. Dark red indicates the first head (126 cities), while dark blue represents the first tail (241 cities). Replotting the population density rank-size plot for the 126 cities in the first head reveals that dark red denotes 30 cities with a population density in the head, while dark blue represents 96 cities with a population density in the tail. The smallest plot is for the population density rank-size plot within the second head, consisting of 30 cities, and it also exhibits heavy-tailed characteristics in the population density data.
Throughout the power-law fitting process, except for the first week, the power-law exponent α remained around 2, with goodness-of-fit (p) values consistently exceeding 0.1. We did a power-law estimation test for the number of confirmed COVID-19 cases at 12 weeks (Figure 4c–e), and as shown in Figure 4d,e, the maximum value of the power-law estimation index for COVID-19 cases (α = 2.02) occurred in the first week, while the minimum value of the goodness-of-fit p (p = 0.048 (p < 0.05)) also occurred in this time period, and it did not pass the power-law test. The COVID-19 confirmed cases did not show a power-law distribution in the first week. However, the pattern of power-law distribution was presented after only the second week, and the power-law distribution of the COVID-19 confirmed cases showed a trend of gradually decreasing significance with increasing time (Figure 4c: change from dark blue to light blue).
In the context of the dynamic development of the epidemic, the power-law estimation index α underwent an overall decreasing and then increasing evolution, with an inverted “V” shape (Figure 4d). The most robust spatial heterogeneity was observed for COVID-19 cases in the second week (α = 2.02, p > 0.05) and the smallest in the seventh week (α = 1.58, p > 0.05). We can find that, as the power law estimation index decreases, the scale effect of the epidemic spread decreases and so does the spatial heterogeneity. At the beginning of the epidemic, the local outbreak was at the “low level of spread” stage, the scale effect of the epidemic spread was not obvious, and there was no power-law distribution. With the spread of the epidemic, the COVID-19 cases gradually showed a power-law distribution. It is worth clarifying that the confirmed cases of COVID-19 did not show a power-law distribution pattern in the first week, but it is clear from the result of their head/tail breaks (Table 1) and rank-size plot (Figure 4b) that they also satisfy the heavy-tailed distribution characteristics.

3.2. The ht Index and Spatial Hierarchy of COVID-19 Confirmed Cases and Population Density

We can use head/tail breaks to derive the hierarchical structures for population densities and the numbers of confirmed COVID-19 cases (See Appendix A for full details). The ht index indicates the spatial hierarchy structure. For example, the population density ht index = 6, implying the presence of six hierarchical levels, namely L1 (0–404), L2 (405–861), L3 (862–1668), L4 (1669–3151), L5 (3152–4666), and L6 (4667–6728). From Figure 5, it can be found that, for the population densities, the hierarchical structure had a stable ht index value of 6. Or, we can say that the spatial hierarchy of the population densities had a total of six levels. We employed choropleth symbolization to visualize the spatial hierarchy levels of the population density for each city. The darker the color, the higher the spatial hierarchy level, indicating that L1 (pop: 0–404) < L2 (pop: 405–861) < L3 (pop: 862–1668) < L4 (pop: 1669–3151) < L5 (pop: 3152–4666) < L6 (pop: 4667–6728). As depicted in Figure 6, cities with a higher population density are generally located in the eastern regions, particularly in some coastal areas such as Shenzhen and Shanghai. For the epidemic cases (Figure 5), the ht index value varied from 3 to 5. The first week had the largest ht index value of 5. At first, there were a total of five levels in the spatial hierarchy of the cases, and then, the ht index varied around 3 and 4 during the entire period.
To explore the inherent spatial hierarchy of COVID-19 cases, we further generated spatial hierarchy maps for the first week (ht index = 5), the sixth week (ht index = 3), and the 12th week (ht index = 4) in relation to the number of COVID-19 cases (Figure 6b–d). Darker colors indicate higher spatial hierarchy levels of the cities. In Figure 6b, during the first week, the cities with higher levels of COVID-19 cases were concentrated around Wuhan, with five cities being at the highest level, namely Wuhan (500 cases), Huanggang (142 cases), Chongqing (110 cases), Xiaogan (100 cases), and Jingmen (90 cases). By the sixth week (Figure 6c), the cities with higher COVID-19 case numbers remained concentrated around Wuhan, but the hierarchy structure of the case numbers decreased to three levels, indicating a reduction in the heterogeneity of the pandemic spread. After nearly three months of epidemic prevention and control efforts, in the final week of the study (Figure 6d), the ht index for the spatial hierarchy structure of the cases was 4. Overall, the COVID-19 case levels in the eastern region were higher than those in the western region, and the hierarchy structure changed over time. For instance, from the first week to the sixth week, the levels decreased from Wuhan, then in surrounding cities, and then in other areas. By the 12th week, the cities with higher levels were distributed in the northeast region.

3.3. Assessment of the Risk of Transmission of COVID-19 Cases in China

Based on the hierarchical structures of the population density and COVID cases, we can calculate the risk assessment indicators for all cities over 12 weeks and display the spatiotemporal dynamics of high-risk cities. Figure 7 illustrates the spatial distribution of high-risk cities from week 1 to week 12. The color red represents risk, with darker shades indicating a higher risk. The evolution of the COVID-19 space-time risk exhibits significant regional disparities. During the early stages of the study, i.e., from week 1 to week 5 (Figure 7a–d), the high-risk cities were predominantly concentrated around Wuhan, displaying a larger “single-core” structure. However, in the middle weeks, from week 6 to week 8 (Figure 7f–h), the distribution of the high-risk cities became more scattered, with only a few cities exhibiting relatively higher infection risks. Increases in population mobility may contribute to higher epidemic risks. In the final four weeks, i.e., from week 9 to week 12 (Figure 7i–k), the high-risk cities were primarily located in the eastern coastal regions and a few central-western areas, such as Beijing and Chongqing. These cities exhibit high population density, intense population migration, and vibrant socioeconomic activity, resulting in elevated risk values.
To delineate more detailed information about the pandemic risk, we primarily investigate the changes in the number of cities at risk from week 1 to week 12, along with the statistical metrics of the average, standard deviation, and maximum risk values per week. The average is the sum of the space-time risks of all cities divided by the number of cities in each week, which reflects the trend of the space-time risk concentration. Standard deviation mathematically refers to the arithmetic square root of the arithmetic mean of the squared deviation from the mean (i.e., variance). In this study, computing the standard deviation of the risk values over the 12 weeks serves as a measure of uncertainty, representing the precision of the risk estimates. The maximum risk value per week is the one with the largest weekly temporal risk for all cities. Over the entire study period, the number of high-risk cities decreased from 57 in the first week to 6 in the last week (Figure 8). However, it is worth noting that, in the last four weeks, the average value and standard deviation of the risk assessment indicator noticeably increased. While the number of high-risk cities decreased significantly during the study period, greater attention should be paid to the later stages of epidemic control, especially in several cities with extremely high-risk values. It is noteworthy that the maximum risk value occurred in the last three weeks.

3.4. Application of Outbreak Risk Assessment Models

Finally, we selected six representative cities: Beijing, Guangzhou, Wuhan, Shanghai, Wenzhou, and Chongqing, to explore their local-level risk dynamics and compared their 84-day risk assessment values with their incidence rates and population migration intensities (Figure 9). We observed that the risk evolution in these six cities varies, and their relationship with the incidence rate and population migration intensity differs as well.
An increase in the incidence rate contributes to a higher infection risk in a region, which is particularly evident in the early stages of the epidemic (the rapid growth phase and the control reduction phase), as seen in Wenzhou from 26 January to 11 February (Figure 9e) and Chongqing from 21 January to 26 January (Figure 9f). In the middle and later stages of the epidemic, a city’s epidemic risk is mainly influenced by its population mobility. When the incidence rate is low, a city may still face a high risk due to factors such as population mobility. Beijing (Figure 9a) showed a higher population migration rank from 28 March to 10 April, increasing the local epidemic risk as the influx of people grew. From 18 March to 9 April, Wenzhou experienced a gradual rise in its population inflow intensity, reaching a high of 0.42 during that period. Chongqing exhibited similar characteristics from 20 February to 23 February. Furthermore, due to the virus’s high destructiveness, high infection rate, and rapid transmission, the risk persists even with the implementation of government-enforced lockdown measures. This is particularly evident in the Wuhan region (Figure 9c), where the risk remained consistently high, especially from 20 January to 17 March.
It is worth noting that, due to the lag in epidemic risk assessment, situations can occur where risks are not immediately recognized during periods of high incidence rates and high population migration ranks. Instead, risks may manifest later, especially as observed in Guangzhou from 26 January to 6 February (Figure 9b) and Shanghai from 25 January to 4 February (Figure 9d). Therefore, in epidemic control efforts, vigilance should not be relaxed even when the incidence rate is high and the population influx is significant, to prevent potential epidemic rebounds.
In summary, the risk dynamics in these six cities vary, and their relationships with incidence rates and population migration intensity differ as well. The approach we have presented allows for a better description of the risk dynamics in each city. Specifically, an increase in the incidence rate can elevate the risk for a city. However, it is crucial to recognize that, even with lower incidence rates, other factors such as population mobility can still place a city at a high risk.

4. Discussion

Epidemic risk assessment has been a focal point of research in post-pandemic China and other countries. In recent years, the head/tail break method has been widely applied across various domains, including urban studies [62,64], environmental pollution studies [70], and more, offering functions such as image texture extraction and data visualization. In epidemiology, the head/tail break method is commonly employed as a classification approach to visualize the spread patterns of diseases, such as the mapping of the global COVID-19 pandemic [40].
This study employed the head/tail segmentation method to investigate the space-time evolution of COVID-19 risks in China. The COVID-19 pandemic is influenced by underlying populations, and the spatial hierarchies of epidemic cases in different regions correlate with the spatial hierarchy of the base population. By comparing the differences in the relative hierarchy of the population density for confirmed COVID-19 cases, a risk measurement model was constructed. This proposed risk measurement model quantifies the impact of population density on infectious disease transmission and is feasible for identifying areas with a high risk of infection in urban epidemic surveillance. Important research results were achieved, providing valuable references for cities facing increasing cases during the pandemic.
In the early stages of the new coronary pneumonia outbreak, many researchers have predicted the potential outbreak size and duration through various models [71,72,73]. For instance, the basic reproduction number (R0), serving as a crucial indicator of infectious disease transmission, elucidates the extent of COVID-19 case dissemination. In real-time studies of epidemic transmission risk, the effective reproduction number (Rt) is employed to signify the current epidemic transmission index, which studies have used to provide some useful clues for health authorities in China and other countries or regions. In contrast, one of the significant contributions of our study is the introduction of a method to directly depict epidemic risks. Based on two parameters, namely newly confirmed COVID-19 cases and population density, we constructed a risk measurement model that dynamically assesses the risk of epidemic spread through spatially stratified ranking. Compared with other epidemic risk assessment models such as SEIR models and Kalman filtering, the risk assessment indicator proposed in this paper, R i , t , is more straightforward and user-friendly.
Our study of one-week units reveals the risk of COVID-19 transmission in China. It is worth stating that the virus does not adhere to or respect either aggregation definition. Hence, different time periods (e.g., 10-day or 14-day) would likely produce different results. Moreover, a different spatial structure (e.g., at a coarser administrative level) would also produce different results. In other words, the results produced by this article are only valid under the assumption of a 7-day week and the municipal administrative district. Other temporal frameworks or enumeration unit boundary structures for the same period (12 weeks) and geographic area (China) will result in different outcomes. Thus, the effects of the modifiable areal unit problem (MAUP) cannot be ignored. We observed that areas with higher pre-epidemic levels may have a higher risk of infection. The number of high-risk cities increased sharply, mainly concentrated in densely populated coastal cities, central and western regions, and the Beijing-Tianjin-Hebei region. This is because these areas have high population densities and active socio-economic activities. Therefore, the timely identification of densely populated areas should be considered one of the key measures in controlling the spread of the disease [74,75]. After the outbreak, various regions implemented a series of targeted infection prevention and control measures, including isolating confirmed cases and suspected cases, breaking the transmission chain, and reducing the spread of COVID-19 in various regions. As a result, the epidemic in China was well-controlled within a short period (6–8 weeks).
In addition, the results of the risk assessment not only reflect the inherent hierarchy of the epidemiological and demographic context at the macro level but also portray the dynamics of risk evolution at the micro level for local areas or individual cities. The analysis of the daily risk assessment values of six cities, Beijing, Guangzhou, Wuhan, Shanghai, Wenzhou, and Chongqing, compared with their COVID-19 incidence rates and the ranking of their population movement intensity, reveals that the model has a certain degree of foresight and can effectively assess the risk of an area in the future. When the incidence rate rises, the risk value will also rise. The risk of an epidemic is influenced by population movement, and an increase in the rank of the population inflow will bring about the risk of an epidemic, and the risk value will fluctuate and change. To a certain extent, this provides the basis for the government to adjust and update its strategy and shorten the time for China to prevent and control the epidemic, which provides important support for the rapid resumption of production in February and March.
Another scientific contribution of this study lies in the pioneering application of the head/tail breaks approach to the space-time risk research of epidemics. The proposed method offers a better depiction of the risk evolution in each city. The study visually mapped the space-time risk evolution of each city over 12 weeks. In terms of spatial distribution, the high-risk cities were concentrated around Wuhan in the early stages, forming a “large core,” and were later distributed in coastal areas, forming a “striped” pattern. To validate the relationship between the space-time risk changes in each city and the actual situation, we compared the incidence rates, the ht index ranking of the population influx intensity, and the R i , t values of six cities. We found that the risk assessment index has a certain usability and can be applied to the evaluation of urban epidemic risk. This model has provided a new perspective on the assessment of epidemic transmission risk levels and a new approach to the analysis of space-time changes in epidemic risk.
Although we have revealed spatial and temporal variation in areas at risk of outbreaks in China, further research is needed. As many stringent interventions have been implemented in some cities, more potential variables (preventive and control measures, vaccination, virus transmission characteristics, etc.) should be considered to refine the identification of areas at a high risk of transmission of confirmed COVID-19 cases. Secondly, longer-period datasets of the COVID-19 outbreak in China could be incorporated to further enhance the depth and significance of our research, and time-series data of the COVID-19 outbreaks in other countries or regions should be collected to validate the study in this paper. Exploring data on the impact of multiple factors on the spread of the epidemic and long-time series epidemics is an important research component to improve the accuracy of risk assessment and prediction.

5. Conclusions

This study, taking China as an example, analyzes the spatiotemporal characteristics of the evolution of COVID-19 risk over 12 weeks from 20 January to 12 April 2020, in 367 prefecture-level cities, focusing on the perspective of spatial hierarchy differences. We propose a COVID-19 risk assessment index based on the head/tail break method to identify the spatiotemporal distribution of high-risk cities. In this research, we observed that both COVID-19 cases and the population density exhibit heavy-tailed distribution characteristics. We found that the hierarchy structure of COVID-19 cases in China consistently approached but never surpassed the hierarchy structure of the population density. Consequently, the spatiotemporal transmission of COVID-19 is influenced by the underlying population. Furthermore, the calculated risk assessment values have practical significance, as they increase with rising incidence rates. It is worth noting that, even with lower incidence rates, the influence of other potential factors like population mobility can place an area at high risk.
In summary, these findings enhance the overall understanding of the space-time risk of COVID-19 in China and provide a scientific basis for policymakers. The risk measurement model we propose can describe the spatiotemporal patterns of COVID-19 risk and offers a new perspective for assessing the risk level of epidemic transmission, which holds practical significance. However, this study did not delve into more detailed influencing factors of epidemic risk, and the time series is relatively short. Therefore, future research should focus on longer time series and a comprehensive consideration of factors influencing changes in epidemic risk.

Author Contributions

Conceptualization, Bisong Hu and Shuhua Qi; methodology, Bisong Hu; software, Tingting Wu and Bisong Hu; validation, Bisong Hu and Jin Luo; formal analysis, Tingting Wu, Jin Luo and Bisong Hu; investigation, Shuhua Qi and Jin Luo; resources, Shuhua Qi and Bisong Hu; data curation, Tingting Wu, Bisong Hu and Jin Luo; writing—original draft preparation, Tingting Wu; writing—review and editing, Tingting Wu and Bisong Hu; visualization, Tingting Wu, Jin Luo and Shuhua Qi; supervision, Bisong Hu; funding acquisition, Bisong Hu. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (grant number 42061075) and the Graduate Innovation Fund of Jiangxi Normal University, China (No. YJS2022013). The funders have no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Data Availability Statement

The spatiotemporal data of daily new confirmed cases were collected from multiple official and publicly available sources, and had been comparatively verified the epidemic data through the public platform of the 2019-nCoV-infected pneumonia epidemic (http://2019ncov.chinacdc.cn/2019-nCoV/). The dataset of China’s urban populations is owned by China Population Census Yearbook 2020 (http://www.stats.gov.cn/tjsj/pcsj/rkpc/7rp/zk/indexce.htm). The head/tail breaks software can be found at https://en.wikipedia.org/wiki/head/tail_breaks.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Appendix A

This appendix includes a statistical table of the results of head/tail breaks calculations for 12-week COVID-19 cases.
Table A1. Results of head/tail break COVID-19 cases calculations.
Table A1. Results of head/tail break COVID-19 cases calculations.
DateCityMeanHeadTail%Head
Week13676.967029719%
7029.99175324%
1787.0651229%
5188.401420%
Week236739.313832910%
38312.7173118%
711551614%
Week336762.88303378%
30669.2332710%
347981233%
Week436785.75153524%
151971.331146%
Week536718.73163514%
16412.631156%
Week63677.8463612%
64671517%
Week73671.9393582%
9781811%
Week83670.31153524%
157.4731220%
328.671233%
Week93670.633633110%
366.4243211%
Week103671.02283398%
2812.6862221%
Week113670.593733010%
375.8482922%
820.253538%
Week123671.04123553%
1230.833925%
3107.331233%

References

  1. Boldog, P.; Tekeli, T.; Vizi, Z.; Dénes, A.; Bartha, F.A.; Röst, G. Risk Assessment of Novel Coronavirus COVID-19 Outbreaks Outside China. J. Clin. Med. 2020, 9, 571. [Google Scholar] [CrossRef] [PubMed]
  2. Carballada, A.M.; Balsa-Barreiro, J. Geospatial Analysis and Mapping Strategies for Fine-Grained and Detailed COVID-19 Data with GIS. ISPRS Int. J. Geo-Inf. 2021, 10, 602. [Google Scholar] [CrossRef]
  3. Chen, S.; Yang, J.; Yang, W.; Wang, C.; Bärnighausen, T. COVID-19 Control in China during Mass Population Movements at New Year. Lancet 2020, 395, 764–766. [Google Scholar] [CrossRef] [PubMed]
  4. Wu, F.; Zhao, S.; Yu, B.; Chen, Y.-M.; Wang, W.; Song, Z.-G.; Hu, Y.; Tao, Z.-W.; Tian, J.-H.; Pei, Y.-Y. A New Coronavirus Associated with Human Respiratory Disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef] [PubMed]
  5. Navinya, C.; Patidar, G.; Phuleria, H.C. Examining Effects of the COVID-19 National Lockdown on Ambient Air Quality across Urban India. Aerosol Air Qual. Res. 2020, 20, 1759–1771. [Google Scholar] [CrossRef]
  6. Leung, K.; Wu, J.T.; Liu, D.; Leung, G.M. First-Wave COVID-19 Transmissibility and Severity in China Outside Hubei after Control Measures, and Second-Wave Scenario Planning: A Modelling Impact Assessment. Lancet 2020, 395, 1382–1393. [Google Scholar] [CrossRef]
  7. Zhang, C.; Pei, Y.; Li, J.; Qin, Q.; Yue, J. Application of Luojia 1-01 Nighttime Images for Detecting the Light Changes for the 2019 Spring Festival in Western Cities, China. Remote Sens. 2020, 12, 1416. [Google Scholar] [CrossRef]
  8. Wei, Y.; Lu, Y.; Xia, L.; Yuan, X.; Li, G.; Li, X.; Liu, L.; Liu, W.; Zhou, P.; Wang, C.-Y. Analysis of 2019 Novel Coronavirus Infection and Clinical Characteristics of Outpatients: An Epidemiological Study from a Fever Clinic in Wuhan, China. J. Med. Virol. 2020, 92, 2758–2767. [Google Scholar] [CrossRef]
  9. Chan, J.F.-W.; Yuan, S.; Kok, K.-H.; To, K.K.-W.; Chu, H.; Yang, J.; Xing, F.; Liu, J.; Yip, C.C.-Y.; Poon, R.W.-S. A Familial Cluster of Pneumonia Associated with the 2019 Novel Coronavirus Indicating Person-to-Person Transmission: A Study of a Family Cluster. Lancet 2020, 395, 514–523. [Google Scholar] [CrossRef]
  10. Backer, J.A.; Klinkenberg, D.; Wallinga, J. Incubation Period of 2019 Novel Coronavirus (2019-nCoV) Infections among Travellers from Wuhan, China, 20–28 January 2020. Eurosurveillance 2020, 25, 2000062. [Google Scholar] [CrossRef]
  11. Wu, J.T.; Leung, K.; Leung, G.M. Nowcasting and Forecasting the Potential Domestic and International Spread of the 2019-nCoV Outbreak Originating in Wuhan, China: A Modelling Study. Lancet 2020, 395, 689–697. [Google Scholar] [CrossRef] [PubMed]
  12. Liu, Y.; Yang, D.Y.; Dong, G.P.; Zhang, H.; Miao, C.H. The Spatio-Temporal Spread Characteristics of COVID-19 and Risk Assessment Based on Population Movement in Henan Province: Analysis Based on 1243 Case Reports. Econ. Geogr. 2020, 40, 24–32. [Google Scholar] [CrossRef]
  13. Yang, K.; Qiao, G.; Li, J.; Chen, H.; Xin, X.; Zhu, Y. Epidemic of Coronavirus Disease 2019 in Zhejiang Province, China: Characteristics of Spatial Diffusion and Impact Factors. Chin. J. Virol. 2021, 37, 9. [Google Scholar] [CrossRef]
  14. Wang, J.; Du, D.; Wei, Y.; Yang, H.H. The Development of COVID-19 in China: Spatial Diffusion and Geographical Pattern. Geogr. Res. 2020, 39, 1450–1462. [Google Scholar] [CrossRef]
  15. Yang, D.; Shi, W.; Yu, Y.; Chen, L.; Chen, R. Analysis of the Spatial Distribution and Associated Factors of the Transmission Locations of COVID-19 in the First Four Waves in Hong Kong. ISPRS Int. J. Geo-Inf. 2023, 12, 111. [Google Scholar] [CrossRef]
  16. Hu, J.X.; Liu, T.; Xiao, J.P.; He, G.H.; Rong, Z.H.; Yin, L.H.; Wan, D.H.; Zeng, W.L.; Gong, D.X.; Guo, L.C. Risk Assessment and Early Warning of Imported COVID-19 in Guangdong Province. Chin. J. Epidemiol. 2020, 41, 657–661. [Google Scholar] [CrossRef]
  17. Pang, S.; Xiao, J.; Fang, Y. Risk Assessment Model and Application of COVID-19 Virus Transmission in Closed Environments at Sea. Sustain. Cities Soc. 2021, 74, 103245. [Google Scholar] [CrossRef]
  18. Ranjan, A.K.; Patra, A.K.; Gorai, A.K. Effect of Lockdown Due to SARS COVID-19 on Aerosol Optical Depth (AOD) over Urban and Mining Regions in India. Sci. Total Environ. 2020, 745, 141024. [Google Scholar] [CrossRef]
  19. Chirisa, I.; Mutambisi, T.; Chivenge, M.; Mabaso, E.; Matamanda, A.R.; Ncube, R. The Urban Penalty of COVID-19 Lockdowns across the Globe: Manifestations and Lessons for Anglophone Sub-Saharan Africa. GeoJournal 2022, 87, 815–828. [Google Scholar] [CrossRef]
  20. He, X.; Zhou, C.; Wang, Y.; Yuan, X. Risk Assessment and Prediction of COVID-19 Based on Epidemiological Data from Spatiotemporal Geography. Front. Environ. Sci. 2021, 9, 634156. [Google Scholar] [CrossRef]
  21. Luo, M.; Qin, S.; Tan, B.; Cai, M.; Yue, Y.; Xiong, Q. Population Mobility and the Transmission Risk of the COVID-19 in Wuhan, China. ISPRS Int. J. Geo-Inf. 2021, 10, 395. [Google Scholar] [CrossRef]
  22. Gomes, D.S.; Andrade, L.A.; Ribeiro, C.J.N.; Peixoto, M.V.S.; Lima, S.V.M.A.; Duque, A.M.; Cirilo, T.M.; Góes, M.A.O.; Lima, A.G.C.F.; Santos, M.B.; et al. Risk Clusters of COVID-19 Transmission in Northeastern Brazil: Prospective Space–Time Modelling. Epidemiol. Infect. 2020, 148, e188. [Google Scholar] [CrossRef] [PubMed]
  23. Coccia, M. An Index to Quantify Environmental Risk of Exposure to Future Epidemics of the COVID-19 and Similar Viral Agents: Theory and Practice. Environ. Res. 2020, 191, 110155. [Google Scholar] [CrossRef] [PubMed]
  24. Du, Z.; Wang, L.; Cauchemez, S.; Xu, X.; Wang, X.; Cowling, B.J.; Meyers, L.A. Risk for Transportation of Coronavirus Disease from Wuhan to Other Cities in China. Emerg. Infect. Dis. 2020, 26, 1049. [Google Scholar] [CrossRef] [PubMed]
  25. Hâncean, M.-G.; Perc, M.; Lerner, J. Early Spread of COVID-19 in Romania: Imported Cases from Italy and Human-to-Human Transmission Networks. R. Soc. Open Sci. 2020, 7, 200780. [Google Scholar] [CrossRef] [PubMed]
  26. Bontempi, E.; Vergalli, S.; Squazzoni, F. Understanding COVID-19 Diffusion Requires an Interdisciplinary, Multi-Dimensional Approach. Environ. Res. 2020, 188, 109814. [Google Scholar] [CrossRef]
  27. Chen, Y.-C.; Lu, P.-E.; Chang, C.-S.; Liu, T.-H. A Time-Dependent SIR Model for COVID-19 with Undetectable Infected Persons. IEEE Trans. Netw. Sci. Eng. 2020, 7, 3279–3294. [Google Scholar] [CrossRef]
  28. Cooper, I.; Mondal, A.; Antonopoulos, C.G. A SIR Model Assumption for the Spread of COVID-19 in Different Communities. Chaos Solitons Fractals 2020, 139, 110057. [Google Scholar] [CrossRef]
  29. Lai, S.; Ruktanonchai, N.W.; Zhou, L.; Prosper, O.; Luo, W.; Floyd, J.R.; Wesolowski, A.; Santillana, M.; Zhang, C.; Du, X. Effect of Non-Pharmaceutical Interventions to Contain COVID-19 in China. Nature 2020, 585, 410–413. [Google Scholar] [CrossRef]
  30. Chang, S.; Pierson, E.; Koh, P.W.; Gerardin, J.; Redbird, B.; Grusky, D.; Leskovec, J. Mobility Network Models of COVID-19 Explain Inequities and Inform Reopening. Nature 2021, 589, 82–87. [Google Scholar] [CrossRef]
  31. Annas, S.; Pratama, M.I.; Rifandi, M.; Sanusi, W.; Side, S. Stability Analysis and Numerical Simulation of SEIR Model for Pandemic COVID-19 Spread in Indonesia. Chaos Solitons Fractals 2020, 139, 110072. [Google Scholar] [CrossRef] [PubMed]
  32. Basnarkov, L. SEAIR Epidemic Spreading Model of COVID-19. Chaos Solitons Fractals 2021, 142, 110394. [Google Scholar] [CrossRef] [PubMed]
  33. Chan, T.-L.; Yuan, H.-Y.; Lo, W.-C. Modeling Covid-19 Transmission Dynamics with Self-Learning Population Behavioral Change. Front. Public Health 2021, 9, 2136. [Google Scholar] [CrossRef] [PubMed]
  34. Hu, B.; Ning, P.; Qiu, J.; Tao, V.; Devlin, A.T.; Chen, H.; Wang, J.; Lin, H. Modeling the Complete Spatiotemporal Spread of the COVID-19 Epidemic in Mainland China. Int. J. Infect. Dis. 2021, 110, 247–257. [Google Scholar] [CrossRef] [PubMed]
  35. Mandal, M.; Jana, S.; Nandi, S.K.; Khatua, A.; Adak, S.; Kar, T.K. A Model Based Study on the Dynamics of COVID-19: Prediction and Control. Chaos Solitons Fractals 2020, 136, 109889. [Google Scholar] [CrossRef] [PubMed]
  36. Yang, Q.; Yi, C.; Vajdi, A.; Cohnstaedt, L.W.; Wu, H.; Guo, X.; Scoglio, C.M. Short-Term Forecasts and Long-Term Mitigation Evaluations for the COVID-19 Epidemic in Hubei Province, China. Infect. Dis. Model. 2020, 5, 563–574. [Google Scholar] [CrossRef]
  37. Hasan, A.; Putri, E.R.; Susanto, H.; Nuraini, N. Data-Driven Modeling and Forecasting of COVID-19 Outbreak for Public Policy Making. ISA Trans. 2022, 124, 135–143. [Google Scholar] [CrossRef]
  38. Song, J.; Xie, H.; Gao, B.; Zhong, Y.; Gu, C.; Choi, K.-S. Maximum Likelihood-Based Extended Kalman Filter for COVID-19 Prediction. Chaos Solitons Fractals 2021, 146, 110922. [Google Scholar] [CrossRef]
  39. Alotaibi, N.; Elbatal, I.; Almetwally, E.M.; Alyami, S.A.; Al-Moisheer, A.S.; Elgarhy, M. Truncated Cauchy Power Weibull-G Class of Distributions: Bayesian and Non-Bayesian Inference Modelling for COVID-19 and Carbon Fiber Data. Mathematics 2022, 10, 1565. [Google Scholar] [CrossRef]
  40. Jiang, B.; de Rijke, C. A Power-Law-Based Approach to Mapping COVID-19 Cases in the United States. Geo-Spat. Inf. Sci. 2021, 24, 333–339. [Google Scholar] [CrossRef]
  41. Ren, H.; Zhao, L.; Zhang, A.; Song, L.; Liao, Y.; Lu, W.; Cui, C. Early Forecasting of the Potential Risk Zones of COVID-19 in China’s Megacities. Sci. Total Environ. 2020, 729, 138995. [Google Scholar] [CrossRef] [PubMed]
  42. Pang, S.; Wu, J.; Lu, Y. Thermodynamic Imaging Calculation Model on COVID-19 Transmission and Epidemic Cities Risk Level Assessment—Data from Hubei in China. Pers. Ubiquitous Comput. 2023, 27, 715–731. [Google Scholar] [CrossRef] [PubMed]
  43. Oh, Y.; Park, S.; Ye, J.C. Deep Learning COVID-19 Features on CXR Using Limited Training Data Sets. IEEE Trans. Med. Imaging 2020, 39, 2688–2700. [Google Scholar] [CrossRef] [PubMed]
  44. Lalmuanawma, S.; Hussain, J.; Chhakchhuak, L. Applications of Machine Learning and Artificial Intelligence for Covid-19 (SARS-CoV-2) Pandemic: A Review. Chaos Solitons Fractals 2020, 139, 110059. [Google Scholar] [CrossRef] [PubMed]
  45. Dandekar, R.; Rackauckas, C.; Barbastathis, G. A Machine Learning-Aided Global Diagnostic and Comparative Tool to Assess Effect of Quarantine Control in COVID-19 Spread. Patterns 2020, 1, 100145. [Google Scholar] [CrossRef] [PubMed]
  46. Kraemer, M.U.; Yang, C.-H.; Gutierrez, B.; Wu, C.-H.; Klein, B.; Pigott, D.M.; Open COVID-19 Data Working Group; Du Plessis, L.; Faria, N.R.; Li, R.; et al. The Effect of Human Mobility and Control Measures on the COVID-19 Epidemic in China. Science 2020, 368, 493–497. [Google Scholar] [CrossRef]
  47. Jia, J.S.; Lu, X.; Yuan, Y.; Xu, G.; Jia, J.; Christakis, N.A. Population Flow Drives Spatio-Temporal Distribution of COVID-19 in China. Nature 2020, 582, 389–394. [Google Scholar] [CrossRef]
  48. Habibi, R.; Alesheikh, A.A.; Bayat, S. An Event-Based Model and a Map Visualization Approach for Spatiotemporal Association Relations Discovery of Diseases Diffusion. Sustain. Cities Soc. 2022, 87, 104187. [Google Scholar] [CrossRef]
  49. Moftakhar, L.; Mozhgan, S.; Safe, M.S. Exponentially Increasing Trend of Infected Patients with COVID-19 in Iran: A Comparison of Neural Network and ARIMA Forecasting Models. Iran. J. Public Health 2020, 49, 92. [Google Scholar] [CrossRef]
  50. Xiong, Y.; Wang, Y.; Chen, F.; Zhu, M. Spatial Statistics and Influencing Factors of the COVID-19 Epidemic at Both Prefecture and County Levels in Hubei Province, China. Int. J. Environ. Res. Public Health 2020, 17, 3903. [Google Scholar] [CrossRef]
  51. Tang, W.; Liao, H.; Marley, G.; Wang, Z.; Cheng, W.; Wu, D.; Yu, R. The Changing Patterns of Coronavirus Disease 2019 (COVID-19) in China: A Tempogeographic Analysis of the Severe Acute Respiratory Syndrome Coronavirus 2 Epidemic. Clin. Infect. Dis. 2020, 71, 818–824. [Google Scholar] [CrossRef] [PubMed]
  52. Chen, H.; Huang, X.; Li, Z. A Content Analysis of Chinese News Coverage on COVID-19 and Tourism. Curr. Issues Tour. 2022, 25, 198–205. [Google Scholar] [CrossRef]
  53. Liu, Q.; Sha, D.; Liu, W.; Houser, P.; Zhang, L.; Hou, R.; Lan, H.; Flynn, C.; Lu, M.; Hu, T. Spatiotemporal Patterns of COVID-19 Impact on Human Activities and Environment in Mainland China Using Nighttime Light and Air Quality Data. Remote Sens. 2020, 12, 1576. [Google Scholar] [CrossRef]
  54. Iqbal, N.; Fareed, Z.; Shahzad, F.; He, X.; Shahzad, U.; Lina, M. The Nexus between COVID-19, Temperature and Exchange Rate in Wuhan City: New Findings from Partial and Multiple Wavelet Coherence. Sci. Total Environ. 2020, 729, 138916. [Google Scholar] [CrossRef] [PubMed]
  55. Wang, W.; Tang, J.; Wei, F. Updated Understanding of the Outbreak of 2019 Novel Coronavirus (2019-nCoV) in Wuhan, China. J. Med. Virol. 2020, 92, 441–447. [Google Scholar] [CrossRef] [PubMed]
  56. Lighter, J.; Phillips, M.; Hochman, S.; Sterling, S.; Johnson, D.; Francois, F.; Stachel, A. Obesity in Patients Younger than 60 Years is a Risk Factor for COVID-19 Hospital Admission. Clin. Infect. Dis. 2020, 71, 896–897. [Google Scholar] [CrossRef] [PubMed]
  57. Deng, X.; Yang, J.; Wang, W.; Wang, X.; Zhou, J.; Chen, Z.; Li, J.; Chen, Y.; Yan, H.; Zhang, J. Case Fatality Risk of the First Pandemic Wave of Coronavirus Disease 2019 (COVID-19) in China. Clin. Infect. Dis. 2021, 73, e79–e85. [Google Scholar] [CrossRef]
  58. Paez, A.; Lopez, F.A.; Menezes, T.; Cavalcanti, R.; da Rocha Pitta, M.G. A Spatio-temporal Analysis of the Environmental Correlates of COVID-19 Incidence in Spain. Geogr. Anal. 2021, 53, 397–421. [Google Scholar] [CrossRef]
  59. Zhu, Y.; Xie, J.; Huang, F.; Cao, L. The Mediating Effect of Air Quality on the Association between Human Mobility and COVID-19 Infection in China. Environ. Res. 2020, 189, 109911. [Google Scholar] [CrossRef]
  60. Sinha, I.P.; Lee, A.R.; Bennett, D.; McGeehan, L.; Abrams, E.M.; Mayell, S.J.; Harwood, R.; Hawcutt, D.B.; Gilchrist, F.J.; Auth, M.K. Child Poverty, Food Insecurity, and Respiratory Health during the COVID-19 Pandemic. Lancet Respir. Med. 2020, 8, 762–763. [Google Scholar] [CrossRef]
  61. Oronce, C.I.A.; Scannell, C.A.; Kawachi, I.; Tsugawa, Y. Association between State-Level Income Inequality and COVID-19 Cases and Mortality in the USA. J. Gen. Intern. Med. 2020, 35, 2791–2793. [Google Scholar] [CrossRef] [PubMed]
  62. Jiang, B. Head/Tail Breaks: A New Classification Scheme for Data with a Heavy-Tailed Distribution. Prof. Geogr. 2013, 65, 482–494. [Google Scholar] [CrossRef]
  63. Jiang, B.; Ren, Z. Geographic Space as a Living Structure for Predicting Human Activities Using Big Data. Int. J. Geogr. Inf. Sci. 2019, 33, 764–779. [Google Scholar] [CrossRef]
  64. Jiang, B. Head/Tail Breaks for Visualization of City Structure and Dynamics. Cities 2015, 43, 69–77. [Google Scholar] [CrossRef]
  65. Gao, P.; Liu, Z.; Liu, G.; Zhao, H.; Xie, X. Unified Metrics for Characterizing the Fractal Nature of Geographic Features. Ann. Am. Assoc. Geogr. 2017, 107, 1315–1331. [Google Scholar] [CrossRef]
  66. Ma, D.; Osaragi, T.; Oki, T.; Jiang, B. Exploring the Heterogeneity of Human Urban Movements Using Geo-Tagged Tweets. Int. J. Geogr. Inf. Sci. 2020, 34, 2475–2496. [Google Scholar] [CrossRef]
  67. Clauset, A.; Shalizi, C.R.; Newman, M.E. Power-Law Distributions in Empirical Data. SIAM Rev. 2009, 51, 661–703. [Google Scholar] [CrossRef]
  68. Jiang, B.; Liu, X. Scaling of Geographic Space from the Perspective of City and Field Blocks and Using Volunteered Geographic Information. Int. J. Geogr. Inf. Sci. 2012, 26, 215–229. [Google Scholar] [CrossRef]
  69. Jiang, B.; Yin, J. Ht-Index for Quantifying the Fractal or Scaling Structure of Geographic Features. Ann. Assoc. Am. Geogr. 2014, 104, 530–540. [Google Scholar] [CrossRef]
  70. Liu, X.; Huang, H.; Jiang, Y.; Wang, T.; Xu, Y.; Abbaszade, G.; Schnelle-Kreis, J.; Zimmermann, R. Assessment of German Population Exposure Levels to PM10 Based on Multiple Spatial-Temporal Data. Environ. Sci. Pollut. Res. 2020, 27, 6637–6648. [Google Scholar] [CrossRef]
  71. Sun, K.; Chen, J.; Viboud, C. Early Epidemiological Analysis of the Coronavirus Disease 2019 Outbreak Based on Crowdsourced Data: A Population-Level Observational Study. Lancet Digit. Health 2020, 2, e201–e208. [Google Scholar] [CrossRef] [PubMed]
  72. Hellewell, J.; Abbott, S.; Gimma, A.; Bosse, N.I.; Jarvis, C.I.; Russell, T.W.; Munday, J.D.; Kucharski, A.J.; Edmunds, W.J.; Sun, F.; et al. Feasibility of Controlling COVID-19 Outbreaks by Isolation of Cases and Contacts. Lancet Glob. Health 2020, 8, e488–e496. [Google Scholar] [CrossRef] [PubMed]
  73. Wang, Z.; Yao, M.; Meng, C.; Claramunt, C. Risk Assessment of the Overseas Imported COVID-19 of Ocean-Going Ships Based on AIS and Infection Data. ISPRS Int. J. Geo-Inf. 2020, 9, 351. [Google Scholar] [CrossRef]
  74. Phelan, A.L.; Katz, R.; Gostin, L.O. The Novel Coronavirus Originating in Wuhan, China: Challenges for Global Health Governance. JAMA 2020, 323, 709–710. [Google Scholar] [CrossRef]
  75. Anderson, R.M.; Heesterbeek, H.; Klinkenberg, D.; Hollingsworth, T.D. How Will Country-Based Mitigation Measures Influence the Course of the COVID-19 Epidemic? Lancet 2020, 395, 931–934. [Google Scholar] [CrossRef]
Figure 1. Development of the COVID-19 epidemic in China ((a) shows the time-series variation in new cases of COVID-19 in China; (b) shows the spatial distribution of the cumulative number of confirmed cases of COVID-19 as of 12 April 2020).
Figure 1. Development of the COVID-19 epidemic in China ((a) shows the time-series variation in new cases of COVID-19 in China; (b) shows the spatial distribution of the cumulative number of confirmed cases of COVID-19 as of 12 April 2020).
Ijgi 12 00485 g001
Figure 2. Rank-size plot (note: the x-axis is the sort and the y-axis is the size distribution of the corresponding sorted data values. The mean is m and the corresponding sort is R(m), with the head (e.g., 13%) and the tail (e.g., 87%)).
Figure 2. Rank-size plot (note: the x-axis is the sort and the y-axis is the size distribution of the corresponding sorted data values. The mean is m and the corresponding sort is R(m), with the head (e.g., 13%) and the tail (e.g., 87%)).
Ijgi 12 00485 g002
Figure 3. The recursive flow of head/tail break (the 1001 numbers [1, 1/2, …, 1/1001] are divided into 5 levels, [1/134, 1/135, …, 1/1001], [1/25, 1/26, …, 1/133], [1/7, 1/8 …, 1/24], [1/3, 1/4, …, 1/6], [1, 1/2], with 5 inherent levels: L1, L2, L3, L4, L5).
Figure 3. The recursive flow of head/tail break (the 1001 numbers [1, 1/2, …, 1/1001] are divided into 5 levels, [1/134, 1/135, …, 1/1001], [1/25, 1/26, …, 1/133], [1/7, 1/8 …, 1/24], [1/3, 1/4, …, 1/6], [1, 1/2], with 5 inherent levels: L1, L2, L3, L4, L5).
Ijgi 12 00485 g003
Figure 4. Population density and the heavy-tailed distribution of COVID-19 confirmed cases ((a,b) show the first three levels of rank-size plots for population density and first-week COVID-19 confirmed case head/tail break results, respectively; (c) shows the power-law variation for 12 weeks of COVID-19 confirmed cases; (d,e) show the trend of power-law index α and goodness-of-fit p, respectively).
Figure 4. Population density and the heavy-tailed distribution of COVID-19 confirmed cases ((a,b) show the first three levels of rank-size plots for population density and first-week COVID-19 confirmed case head/tail break results, respectively; (c) shows the power-law variation for 12 weeks of COVID-19 confirmed cases; (d,e) show the trend of power-law index α and goodness-of-fit p, respectively).
Ijgi 12 00485 g004
Figure 5. Time-series variation in population density and ht index of COVID-19 confirmed cases.
Figure 5. Time-series variation in population density and ht index of COVID-19 confirmed cases.
Ijgi 12 00485 g005
Figure 6. Geographical distributions of the spatial hierarchy of population density and COVID-19 confirmed cases ((a) shows the hierarchical structure of the spatial hierarchy of population density; (bd) show the change in the hierarchical structure of the spatial hierarchy in the 1st, 6th, and 12th weeks, respectively).
Figure 6. Geographical distributions of the spatial hierarchy of population density and COVID-19 confirmed cases ((a) shows the hierarchical structure of the spatial hierarchy of population density; (bd) show the change in the hierarchical structure of the spatial hierarchy in the 1st, 6th, and 12th weeks, respectively).
Ijgi 12 00485 g006
Figure 7. Geographical distributions of high-risk cities (note: (al) represent the spatial distributions of high-risk cities from week 1 to week 12, respectively; the darker the colour, the higher the risk).
Figure 7. Geographical distributions of high-risk cities (note: (al) represent the spatial distributions of high-risk cities from week 1 to week 12, respectively; the darker the colour, the higher the risk).
Ijgi 12 00485 g007
Figure 8. Numbers of high-risk cities and statistics of the risk assessment indicator during 12 weeks (note: (ad) show the number of cities at risk, mean, standard deviation, and maximum risk value changes per week, respectively).
Figure 8. Numbers of high-risk cities and statistics of the risk assessment indicator during 12 weeks (note: (ad) show the number of cities at risk, mean, standard deviation, and maximum risk value changes per week, respectively).
Ijgi 12 00485 g008
Figure 9. Time-series variation in risk, incidence, and the ht index rank of population migration intensity in six cities ((af) represent: Beijing, Guangzhou, Wuhan, Shanghai, Wenzhou, Chongqing; blue indicates risk values R i , t , red indicates prevalence, black indicates population migration intensity ht index rank).
Figure 9. Time-series variation in risk, incidence, and the ht index rank of population migration intensity in six cities ((af) represent: Beijing, Guangzhou, Wuhan, Shanghai, Wenzhou, Chongqing; blue indicates risk values R i , t , red indicates prevalence, black indicates population migration intensity ht index rank).
Ijgi 12 00485 g009
Table 1. Population density and calculation of head/tail breaks of confirmed COVID-19 cases in week 1.
Table 1. Population density and calculation of head/tail breaks of confirmed COVID-19 cases in week 1.
CityMeanHeadTail%Head
Population density367403.7112624134%
126861.24309624%
301668.0582227%
83150.713538%
34666.171233%
First week COVID-19 confirmed case3676.967029719%
7029.99175324%
1787.0651229%
5188.41420%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, T.; Hu, B.; Luo, J.; Qi, S. A Head/Tail Breaks-Based Approach to Characterizing Space-Time Risks of COVID-19 Epidemic in China’s Cities. ISPRS Int. J. Geo-Inf. 2023, 12, 485. https://doi.org/10.3390/ijgi12120485

AMA Style

Wu T, Hu B, Luo J, Qi S. A Head/Tail Breaks-Based Approach to Characterizing Space-Time Risks of COVID-19 Epidemic in China’s Cities. ISPRS International Journal of Geo-Information. 2023; 12(12):485. https://doi.org/10.3390/ijgi12120485

Chicago/Turabian Style

Wu, Tingting, Bisong Hu, Jin Luo, and Shuhua Qi. 2023. "A Head/Tail Breaks-Based Approach to Characterizing Space-Time Risks of COVID-19 Epidemic in China’s Cities" ISPRS International Journal of Geo-Information 12, no. 12: 485. https://doi.org/10.3390/ijgi12120485

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop