Next Article in Journal
Periodic Waves and Ligaments on the Surface of a Viscous Exponentially Stratified Fluid in a Uniform Gravity Field
Next Article in Special Issue
Proving, Refuting, Improving—Looking for a Theorem
Previous Article in Journal
Stability Analysis of Delayed COVID-19 Models
Previous Article in Special Issue
Does Set Theory Really Ground Arithmetic Truth?
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Fuzzy C-Means Clustering Analysis of U.S. Presidential Election and COVID-19 Related Factors in the Rustbelt States in 2020

Department of International Business, Chung Yuan Christian University, Taoyuan City 320314, Taiwan
Axioms 2022, 11(8), 401; https://doi.org/10.3390/axioms11080401
Submission received: 29 June 2022 / Revised: 7 August 2022 / Accepted: 10 August 2022 / Published: 15 August 2022
(This article belongs to the Special Issue 10th Anniversary of Axioms: Logic)

Abstract

:
The rustbelt states play a key role in determining the vote turnout in the U.S. elections. The current study attempts to utilize the spatial fuzzy C-means method to analyze the U.S. presidential election in the rustbelt states in 2020. We intend to explore that the U.S. presidential election had related factors, including COVID-19-related factors, such as the mask-wearing percentage and the COVID-19 death tolls in each county of the rust belt states. Contrary to the related literature, the study uses education level, number of house units, unemployment rate, household income, COVID-19-related factors and the share of Republican’s votes in the presidential election. The results indicate that spatial generalized fuzzy C-means analysis has better clustering results than the C-means clustering method. Moreover, the COVID-19 death toll in each county did not affect the Republican’s vote share in the rustbelt states, while the mask-wearing behavior in some regions had a negative impact on the Republican’s vote share.
MSC:
03B52; 03C45

1. Introduction

The U.S. presidential election in 2020 was influenced by the COVID-19 pandemic, including increasing infections, death tolls, and lockdowns. The previous literature indicated that political polarization was aggravated due to intense fear during the disaster [1,2]. People tended to search for assuage by insisting on their conservative political viewpoints and supporting the ruling party, while other scholars believed that some voters would punish the political elite for worse management during the natural or man-made disaster. Since COVID-19-related policies were created in a very short period of time, without full deliberation, it was possible to arouse public discontent [3]. People were more supportive of their governments during the early stage of the COVID-19 pandemic [4]. However, the evaluations of the policies about the pandemic were influenced by two polarized mindsets. Some voters chose to punish the politicians for the conditions caused by the pandemic, which were out of their control, while some voters were attentive to the political elites’ reactions and determined their feelings accordingly [5].
The previous literature about the U.S. presidential election in 2020 focused on the effects of COVID-19 on the U.S. presidential election results. Hart (2021) stated that the COVID-19 pandemic seemed to have decreased the support for Trump among the Democrats, while it increased for independent voters [6]. Baccini et al. [7] pointed out that COVID-19-related factors negatively affected Donald Trump’s re-election, and the effect was stronger in urban areas. They also observed that COVID-19 had a positive effect on the voters’ mobilization for Joe Biden. The rustbelt states are traditionally “swing states” in the U.S. presidential elections, including Illinois, Wisconsin, Indiana, Michigan, Ohio, West Virginia, Pennsylvania, and New York. Geographical and racial divergences increased in the counties of rustbelt states in the past five years [8]. The geographical factors enable these divergences to become more visible, and people tend to live in more politically polarized conditions [9]. The voting results of rustbelt states have a pivotal influence on the whole country. However, there are fewer instances in the literature about the voting results’ analysis of the rustbelt states. Gimpel [10] pointed out that some counties in rustbelt states changed their support to the Democrats in the presential election in 2020. The influencing factors of the voting results need to be examined. In order to analyze the topic more thoroughly, we attempt to analyze the COVID-19 pandemic effects along with the regional factors’ influence, the related economic variables, and the Republican’s support rate in the 2020 U.S. presidential election.
The structure of this research is as follows: the Research Method Section presents our research design and related descriptive statistics of the variables. The Discussion Section presents the results of the research model. The research findings are listed in the Conclusions Section.

2. Methodology

2.1. Research Method

The current study used the spatial fuzzy C-means clustering method to analyze the influencing factors of COVID-19 on the U.S. presidential election. In order to explore the impacts of COVID-19 and other factors, such as social and geographical factors, as the mentioned in the Introduction, the study also used educational level, number of house units, unemployment rate, and household income variables to create the clustering. The previous literature utilized daily experience sampling (ESM) to analyze the impact of COVID-19 on employee uncertainty [11]. Di Nardo et al. (2019) utilized the literature review method to provide useful information about COVID-19 infection on neonates and children [12]. Regarding the fuzzy clustering approach, Indelicato et al. (2022) used the method with the fuzzy TOPSIS model to analyze the determinants of immigrants in Cuenca, Ecuador [13]. Compared to the COVID-19-related research about its effects on U.S. elections, the study considered spatial factors and attempted to describe the regional differences under the influence of these variables.

2.2. Data Description

The study explored the influencing factors of the pandemic on the 2020 U.S presidential election. The study used the Republican’s voting share (X1) in the U.S. presidential election in 2020 as one of the variables related to the U.S. presidential election. The data were obtained from the web repository (https://github.com/tonmcg/US_County_Level_Election_Results_08-20 (accessed on 6 August 2022)); it collected the 2020 election results at the county level, which were scraped from the results published by Fox News, Politico, and the New York Times.
In order to measure mask-wearing behavior in the rustbelt states (X2), the study used the dataset collected by the survey firm, Dynata. Dynata surveyed 250 thousand respondents in the U.S. between 2 and 14 July 2020. The survey asked the respondents whether or not they wore face masks often in public. The responses included “always”, “frequently”, “sometimes”, “rarely”, and “never”, according to the descending frequency.
The variables (X3, X4, X5, X6) were obtained from the dataset of the U.S. Census Bureau. These variables were released on a flow basis throughout each year.
The study also used the death toll (X7) before the U.S. presidential election as a COVID-19-related variable. Other variables included education level and household economic condition. The descriptive statistics of all the variables are listed in Table 1 and Table 2:

2.3. C-Means Clustering

Initially, the study used the classical C-means method to create the fuzzy unsupervised classification. The fuzziness degree (m) was set at 1.5 in order to obtain the satisfied results. The classical C-means method includes the following two equations. The first equation is the updated values of membership in each iteration of uik [14]:
u i k = ( | | x k v i | | 2 ) 1 m 1 j = 1 c ( | | x k v j | | 2 ) 1 m 1
The center of the cluster is as follows:
v i = k = 1 N u i k m ( x k ) k = 1 N u i k m
In Equations (1) and (2), xk represents the observation of k’s value, vi is the value of the center of the cluster i, c is the cluster number, and m is the index of fuzziness.

2.4. Fuzzy C-Means Clustering

Fuzzy C-means clustering is an algorithm that permits a data point to pertain to two or more clusters. Let X = {x1, x2, …, xn} represent an image with n pixels, where xi is the gray value of the ith pixel. The objective function of the standard FCM algorithm is as follows:
J = k = 1 K i = 1 n u k i m | | x i v k | | 2
In Equation (3), the center of the kth cluster is vk (1 ≤ k ≤ K), and uki (1 ≤ k ≤ K, 1 ≤ i ≤ n) is the membership degree function value of the ith pixel, which pertains to the kth cluster. uki also needs to meet the requirements of the following constraints:
k = 1 K u k i = 1 ,       u k i [ 0 , 1 ] ,       0 i = 1 n u k i n
In Equation (3), the distance between xi and vk is used in the Euclidean form, and parameter m (m > 1) is a weighting parameter that relates to the level of fuzziness and the resulting partition. The minimization of the objective function in Equation (3) can obtain the updated equations of the membership degree function uki and the cluster center vk as follows:
u k i = 1 i = 1 k ( | | x i v k | | 2 | | x i v l | | 2 ) 1 m 1
v k = i = 1 n u k i m x i i = 1 n u k i m
The goal of these functions is to obtain suitable clusters for the data points.

2.5. Spatial Fuzzy C-Means Clustering

Fuzzy C-means clustering (FCM) has shortcomings due to its sensitivity to noise. Some algorithms were developed to overcome this shortcoming by utilizing the spatial information obtained from the neighborhood window around each pixel. Mean spatial information and median spatial information are two prevalent types of local information. The mean spatial information of the ith pixel is denoted as follows [15]:
δ i = 1 | S i | p ϵ S i x p
In Equation (7), Si is the set of neighboring pixels in a window centered at the ith pixel, and |Si| represents its cardinality. The median spatial information can be represented as:
ε i = m e d i a n   { x p } ,   p ϵ S i
Most of the FCM algorithms utilize the above-mentioned local spatial information in the objective function; however, FCM algorithms with local spatial information can obtain a better image segmentation performance with a low noise level. The local spatial information obtained from the near pixels of a pixel is not efficient due to possible contamination. In fact, there are many pixels with a similar neighborhood configuration in an image. It is more beneficial to utilize pixels with a similar neighborhood configurations to the given pixel to obtain the spatial information than only using the neighboring pixels of the given pixel. Such types of spatial information can be taken as non-local spatial information. The non-local spatial information for the ith pixel x i ¯ is calculated by the following equation [16]:
x ¯ i = j w i r w i j x j
In Equation (9), ω i r represents the r × r search window centered at the ith pixel. The non-local spatial information of the ith pixel is computed by using the pixels in the window. The weight between the ith and jth pixels can be denoted as wij ( j w i r ) , 0 ≤ wij ≤ 1 and j w i r w i j = 1 . The weight wij is defined as follows:
w i j = 1 Z i e x p ( | | x ( N i ) x ( N j ) | | 2 , σ 2 / h 2 )
In Equation (10), h means the filtering degree parameter and directs the decreasing weight function wij, and Z i = j w i r e x p ( | | x ( N i ) x ( N j ) | | 2 , σ 2 / h 2 ) is the normalizing constant. The weight wij depends on the similarity between the ith and jth pixels. The similarity is computed by the Gaussian weighted Euclidean distance | | x ( N i ) x ( N j ) | | 2 , σ 2   . The positive term σ is the Euclidean distance, which means the standard deviation of the Gaussian kernel. x(Ni) is the gray level vector with an s × s square neighborhood Ni centered at ith pixel.
Fuzzy clustering algorithm with spatial information uses the spatial information for individual pixels to determine the spatial constant term, and then obtains the spatial constraint to the objective function of FCM.

3. Results

3.1. Fuzzy C-Means and Generalized Fuzzy C-Means Clustering

The study used the classical K-means to determine the number of clusters. According to Figure 1, the four clusters can explain almost 40% of the original data variance.
Then, the study used the “fclust” package of R language to analyze the quality of the classification [17]. The study also utilized the “geocmeans” package of the R language to compute the generalized version of the c-means algorithm [18]. The algorithm can accelerate convergence and obtain less fuzzy results by adjusting the membership matrix at each iteration. It needs an extra beta parameter controlling the effectiveness of the modification. The modification only influences the formula updating the membership matrix.
u k i = ( | | x k v j | | 2 β k ) 1 m 1 i = 1 c ( | | x k v j | | 2 β k ) 1 m 1
In Equation (11), β k = m i n ( | | x k v | | 2 ) and 0 ≤ β ≤ 1. In order to choose an adequate value for this parameter, the study sought all the possible values between 0 and 1 with a step of 0.05. The results of the related index were obtained according to the ascending β values in Table 3.
According to Table 1, the study chose beta = 0.8, maintained a satisfied silhouette index, increased the Xie and Beni index, and explained inertia. The results of GFCM (generalized version of fuzzy C-means clustering) and FCM are listed in Table 4.
The results indicate that the GFCM provides a less fuzzy solution (with higher explained inertia and lower partition entropy), but keeps a good silhouette index and a lower Xie and Beni index. The study created two membership matrices maps and the most likely group for each observation. The study used the function map clusters from geocmeans in R language. We set a threshold of 0.45. If an observation only obtained values below this probability in a membership matrix, it was marked as “undecided” (represented by transparency on the map).
In Figure 2, the left-hand-side graph was the fuzzy C-means clustering result. The right-hand-side graph was the generalized fuzzy C-means clustering result. We can observe that the right-hand-side graph had fewer undecided parts.

3.2. Spatial C-Means and Generalized C-Means

The study used the SFCM function of R language to execute spatial c-means clustering. The first step was to determine a spatial weight matrix indicating the observations that were neighbors and the strength of their relationship. The study attempted to use a basic queen neighbor matrix (built with the spdep package of R language). The matrix should be row-standardized to ensure that the interpretation of all the parameters remains clear.
The two following equations indicate how the functions renewing the condition of the membership matrix and the centers of the clusters are modified.
u i k = ( | | x k v i | | 2 + α | | x k ¯ v i | | 2 ) 1 m 1 j = 1 c ( | | x k v i | | 2 + α | | x k ¯ v i | | 2 ) 1 m 1
v i = k = 1 N u i k m ( x k + α x k ¯ ) ( 1 + α ) k = 1 N u i k m
In Equations (12) and (13), x ¯ is the lagged version of x, and α ≥ 0.
The SFCM (spatial fuzzy C-means) can be taken as a spatially smoothed version of the classical c-means, and alpha controls the degree of spatial smoothness. This smoothing can be taken as an attempt to reduce the spatial overfitting of the classical c-means.
The study chose the best alpha value in order to reduce spatial inconsistency as much as possible and to maintain a good classification quality. The relationship between the spatial inconsistency and alpha value is shown in Figure 3.
In Figure 3, the increasing alpha value results in the decrease in the spatial inconsistency.
In Figure 4, the explained inertia decreased when the alpha value increased and again followed an inverse function. The classification searched for a compromise between the original and lagged values. However, the loss was only 3% between alpha = 0 and alpha = 2.
According to Figure 5 and Figure 6, as a larger silhouette index means a better classification, and a smaller Xie and Beni index represents a better classification, the study intended to retain the alpha = 0.25 value to provide a good balance between spatial consistency and classification quality.

3.3. Spatial Generalized Fuzzy C-Means (SGFCM)

In order to facilitate the clustering process of the SGFCM method, we needed to determine the alpha and beta values of the following equation regarding the center of the clusters.
u i k = ( | | x k v i | | 2 β k + α | | x k ¯ v i | | 2 ) 1 m 1 j = 1 c ( | | x k v i | | 2 β k + α | | x k ¯ v i | | 2 ) 1 m 1
The study attempted to use the multiprocessing approach to select the suitable alpha and beta values. The impact of alpha and beta values on the various indices is shown as follows:
Figure 7 and Figure 8 indicate that some specific combinations of alpha and beta values generate good results in the range of 0.3 < alpha < 0.7 and 0.4 < beta < 0.6. Figure 9 shows that the selection of beta has no impact on spatial consistency.
Regarding Figure 7, Figure 8 and Figure 9, the study selected beta = 0.5 and alpha = 0.25, which obtained better results for all the indices considered. Based on the alpha and beta values, the study acquired the results of the SFCM and SGFCM results (see Table 5).
The results of the SGFCM are better concerning the semantic and spatial aspects due to the lower partition entropy, Xie Beni index, and Fukuyama Sugeno index, and higher values of other indices.
The SFCM and SGFCM clustering maps are listed as follows.
According to Figure 10, the right-hand-side graph is the SGFCM clustering map. The left-hand-side graph is the SFCM clustering map. We can observe that the undecided units are less on the SGFCM clustering map.

3.4. Comparison of the Four Algorithms

The study attempted to perform a thorough spatial analysis and compare the spatial consistency of the four classifications (FCM, GFCM, SFCM, SGFCM) (see Table 6).
The Moran I value according to the membership matrices were higher for SFCM and SGFCM, representing strongaer spatial structures in the classifications.
The study also checked that the values of spatial inconsistency for SGFCM were significantly lower than those of SFCM. The study used the previously mentioned 250 values obtained by permutations; we could calculate a pseudo p-value = 0.032 > 1/250 = 0.004. This means that the SGFCM algorithm did not have a predominant advantage over the SFCM algorithm. However, the SGFCM clustering map indicated that the undecided points were fewer than that of the SFCM.
We can observe that the undecided parts were fewer as compared with Figure 2 and Figure 10.

4. Discussion

The study attempted to utilize the spatial fuzzy C-means clustering method to analyze the relationship among COVID-19-related factors and the vote share of Republicans in the U.S. presidential election in the rustbelt states in 2020. The study found that spatial generalized fuzzy C-means clustering (SGFCM) produced better results compared to the other three algorithms according to Table 3. The study also found the SGFCM clustering graph in Figure 10 presented better results because the uncertain parts (areas that did not belong to any cluster) were fewer compared to the other clustering results shown in Figure 2.
The descriptive statistics of the four clusters (Table A1, Table A2, Table A3 and Table A4) are listed in the Appendix A. According to the four tables, we can conclude the four clusters are as follows:
(1)
First cluster: the cluster had lower X1 (mean < 0.5), higher X2, higher X4, lower X5, and higher X6 values. Other variables did not seem obvious. We can conclude that people in this region were not inclined to support the Republican candidate, often wore masks, had more high-school graduates or above, had a lower unemployment rate, and a higher income. The first cluster included a little part of southeastern Pennsylvania, New York state and other scatter parts of the rustbelt states.
(2)
Second cluster: The cluster had higher X1 (mean > 0.5), higher X2, lower X4, lower X5, and higher X6 values. Other variables did not seem obvious. We can conclude that people in this region were inclined to support the Republican candidate, often wore masks, had less high-school graduates, a lower unemployment rate, and higher income. The second cluster included the larger part of New York state, most part of Michigan and northern Illinois.
(3)
Third cluster: The cluster had higher X1 (mean > 0.5), lower X2, lower X4, higher X5, lower X6, and higher X7 values. This means that people in this region tended to support the Republican candidate, wore masks less frequently, had less high-school graduates or above, a higher unemployment rate, lower income, and higher COVID-19 death toll. The cluster included some parts of Kentucky, West Virginia and Ohio and other scatter parts of the rustbelt states.
(4)
Fourth cluster: The cluster had higher X1 (mean > 0.5), lower X2, lower X4, lower X5, higher X6, and higher X7 values. This means that people in this region tended to support the Republican candidate, wore masks less frequently, had less high-school graduates or above, a lower unemployment rate, higher income, and higher COVID-19 death toll. The cluster included the larger part of Indiana, Ohio and part of Illinois.
The results seem to slightly contrast with the previous literature. Warshaw et al. (2020) found that COVID-19 fatalities decreased the support for Donald Trump in the 2020 presidential election [19]. However, our results show that the third and fourth clusters in the rustbelt states have higher COVID-19 death tolls with higher Republican vote shares and residents less inclined to wear face masks. Meanwhile, the second cluster had higher Republican vote shares and the residents there often wore face masks, while the COVID-19 death toll seemed unimportant. We can conclude that the COVID-19 death toll in each county did not affect the Republican vote shares in the rustbelt states, while the mask-wearing behavior in some regions had a negative impact on the Republican vote shares.
According to Figure 11, we can observe that cluster 2 accounts for the largest area in the rustbelt states. Cluster 1 accounts for the smallest area. The clustering results indicate that the U.S. presidential election-related factors and COVID-19-related factors are closely related to the clustering results. It enables the researchers in the related field to conduct further studies.

5. Conclusions

The present study intended to use the spatial fuzzy C-means clustering to analyze the related factors of COVID-19 and the U.S. presidential election in the rustbelt states in 2020. The study found that the spatial generalized fuzzy C-means (SGFCM) method produced better clustering results. The SGFCM method divided the rustbelt states into four areas. The results imply that the COVID-19 death toll in each county did not affect the Republican vote shares in the rustbelt states, while the mask-wearing behavior in some regions had a negative impact on the Republican vote shares. It is worth conducting further research.

Funding

This research was supported by the Preliminary Research Resource Funding from Chung Yuan Christian University.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not Applicable.

Data Availability Statement

The COVID-19-related data for the U.S. can be downloaded from https://github.com/nytimes/COVID-19-data (accessed on 6 August 2022). The U.S. presidential election results in each county can be downloaded from https://github.com/tonmcg/US_County_Level_Election_Results_08-20/blob/f9b5f335ad1c66a7eba681539db49eec0c22787b/2020_US_County_Level_Presidential_Results.csv (accessed on 6 August 2022). The education level and household economic condition can be downloaded from https://www.census.gov/ (accessed on 6 August 2022).

Acknowledgments

The author would like to show their gratitude for the preliminary research fund received by Chung Yuan Christian University.

Conflicts of Interest

The author declares no conflict of interest.

Appendix A

Table A1. Descriptive statistics for cluster 1.
Table A1. Descriptive statistics for cluster 1.
X1X2X3X4X5X6X7
Q50.2220.514289872.62.946,288.27
Q100.270.5496716,480.83.249,51519
Q250.370.64119841,7643.458,22245
Q500.4460.742379132,1273.866,27081
Q750.5330.788501211,5974.286,108103
Q900.6140.82596347,971.44.994,521153
Q950.6780.842632522,0615.4100,887165
Mean0.4480.71342.689168,655.43.90170,973.1580.35
Std0.1340.107187.555199,205.90.79317,662.7648.11
Table A2. Descriptive statistics for cluster 2.
Table A2. Descriptive statistics for cluster 2.
X1X2X3X4X5X6X7
Q50.4170.44949.44579.63.143,1189
Q100.4630.487895836.63.346,26215
Q250.5390.5419811,1163.849,76737
Q500.6050.61236820,2044.453,90177
Q750.6740.72351041,2294.960,121115
Q900.7290.7960868,5505.566,521155
Q950.7620.827633.6109,4625.773,006.8174.2
Mean0.5990.627356.19335,019.474.41555,596.3579.845
Std0.1050.119186.09862,713.130.8999592.19452.128
Table A3. Descriptive statistics for cluster 3.
Table A3. Descriptive statistics for cluster 3.
X1X2X3X4X5X6X7
Q50.5760.34129.424153.8430,9507
Q100.6240.3685632974.233,21813
Q250.6930.40915550724.938,17143
Q500.7470.47534183545.643,45781
Q750.7870.5451813,6706.448,182129
Q900.830.61160425,2217.451,812.2169
Q950.8560.641631.634,390.88.355,443.8195
Mean0.7340.481334.75714,615.755.74343,457.8488.095
Std0.0880.097199.11247,465.071.3778146.73857.705
Table A4. Descriptive statistics for cluster 4.
Table A4. Descriptive statistics for cluster 4.
X1X2X3X4X5X6X7
Q50.5550.285353184.22.741,799.29
Q100.6020.33604188.82.9844,91321
Q250.6730.39214669123.348,34249
Q500.7280.46228911,761452,79897
Q750.760.52947318,6894.557,705145
Q900.7890.58458533,791.45.163,827.4175.4
Q950.8090.627629.645,4965.4667,758193
Mean0.7070.459308.85618,494.644.00953,761.2798.385
Std0.0840.106190.40146,181.820.918948.19258.712

References

  1. Greenberg, J.; Pyszczynski, T.; Solomon, S.; Rosenblatt, A.; Veeder, M.; Kirkland, S.; Lyon, D. Evidence for terror management theory II: The effects of mortality salience on reactions to those who threaten or bolster the cultural worldview. J. Pers. Soc. Psychol. 1990, 58, 308. [Google Scholar] [CrossRef]
  2. Kosloff, S.; Greenberg, J.; Weise, D.; Solomon, S. The effects of mortality salience on political preferences: The roles of charisma and political orientation. J. Exp. Soc. Psychol. 2010, 46, 139–145. [Google Scholar] [CrossRef]
  3. Altiparmakis, A.; Bojar, A.; Brouard, S.; Foucault, M.; Kriesi, H.; Nadeau, R. Pandemic politics: Policy evaluations of government responses to COVID-19. West Eur. Polit. 2021, 44, 1159–1179. [Google Scholar] [CrossRef]
  4. Bol, D.; Giani, M.; Blais, A.; Loewen, P. The effect of COVID-19 lockdowns on political support: Some good news for democracy? Eur. J. Polit. Res. 2020, 60, 497–505. [Google Scholar] [CrossRef]
  5. Gasper, J.T.; Reeves, A. Make it rain? Retrospection and the attentive electorate in the context of natural disasters. Am. J. Pol. Sci. 2011, 55, 340–355. [Google Scholar] [CrossRef]
  6. Hart, J. Did the COVID-19 pandemic help or hurt Donald Trump’s political fortunes? PLoS ONE 2021, 16, e0247664. [Google Scholar] [CrossRef] [PubMed]
  7. Baccini, L.; Brodeur, A.; Weymouth, S. The COVID-19 pandemic and the 2020 US presidential election. J. Popul. Econ. 2021, 34, 739–767. [Google Scholar] [CrossRef] [PubMed]
  8. Panos, A. Reading about geography and race in the rural rustbelt: Mobilizing dis/affiliation as a practice of whiteness. Linguist. Educ. 2021, 65, 100955. [Google Scholar] [CrossRef]
  9. Gugushvili, A.; Koltai, J.; Stuckler, D.; Mckee, M. Populism, and pandemics. Int. J. Public Health 2020, 65, 721–722. [Google Scholar] [CrossRef] [PubMed]
  10. Gimpel, J.G. The 2020 election campaign was over quickly. Polit. Geogr. 2021, 102430. [Google Scholar] [CrossRef]
  11. Yoon, S.; McClean, S.T.; Chawla, N.; Kim, J.K.; Koopman, J.; Rosen, C.C.; Trougakos, J.P.; McCarthy, J.M. Working through an “infodemic”: The impact of COVID-19 news consumption on employee uncertainty and work behaviors. J. Appl. Psychol. 2021, 106, 501–517. [Google Scholar] [CrossRef] [PubMed]
  12. Di Nardo, M.; van Leeuwen, G.; Loreti, A.; Barbieri, M.A.; Guner, Y.; Locatelli, F.; Ranieri, V.M. A literature review of 2019 novel coronavirus (SARS-CoV2) infection in neonates and children. Pediatr. Res. 2021, 89, 1101–1108. [Google Scholar] [CrossRef] [PubMed]
  13. Martin, J.C.; Bustamante-Sánchez, N.S.; Indelicato, A. Analyzing the Main Determinants for Being an Immigrant in Cuenca (Ecuador) Based on a Fuzzy Clustering Approach. Axioms 2022, 11, 74. [Google Scholar] [CrossRef]
  14. Cai, W.; Chen, S.; Zhang, D. Fast and robust fuzzy c-means clustering algorithms incorporating local information for image segmentation. Pattern Recognit. 2007, 40, 825–838. [Google Scholar] [CrossRef]
  15. Zhao, F.; Jiao, L.; Liu, H. Kernel generalized fuzzy c-means clustering with spatial information for image segmentation. Digit. Signal Process. A Rev. J. 2013, 23, 184–199. [Google Scholar] [CrossRef]
  16. Zhao, F.; Liu, H.; Fan, J. A multiobjective spatial fuzzy clustering algorithm for image segmentation. Appl. Soft Comput. J. 2015, 30, 48–57. [Google Scholar] [CrossRef]
  17. Ferraro, M.B.; Giordani, P.; Serafini, A. Fclust: An R Package for Fuzzy Clustering. R J. 2019, 11, 1–18. [Google Scholar] [CrossRef]
  18. Gelb, J.; Apparicio, P. Apport de la classification floue c-means spatiale en géographie: Essai de taxinomie socio-résidentielle et environnementale à Lyon. Cybergeo 2021, 1–26. [Google Scholar] [CrossRef]
  19. Warshaw, C.; Vavreck, L.; Baxter-King, R. Fatalities from COVID-19 are reducing Americans’ support for Republicans at every level of federal office. Sci. Adv. 2020, 6, eabd8564. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Impact of the number of groups on the explained variance.
Figure 1. Impact of the number of groups on the explained variance.
Axioms 11 00401 g001
Figure 2. FCM and GFCM clusters.
Figure 2. FCM and GFCM clusters.
Axioms 11 00401 g002
Figure 3. Link between alpha value and spatial inconsistency.
Figure 3. Link between alpha value and spatial inconsistency.
Axioms 11 00401 g003
Figure 4. The relationship between the alpha value and explained inertia.
Figure 4. The relationship between the alpha value and explained inertia.
Axioms 11 00401 g004
Figure 5. Link between alpha and Xie and Beni index.
Figure 5. Link between alpha and Xie and Beni index.
Axioms 11 00401 g005
Figure 6. Link between alpha value and silhouette index.
Figure 6. Link between alpha value and silhouette index.
Axioms 11 00401 g006
Figure 7. Influence of beta and alpha values on silhouette index.
Figure 7. Influence of beta and alpha values on silhouette index.
Axioms 11 00401 g007
Figure 8. Influence of beta and alpha values on Xie and Beni index.
Figure 8. Influence of beta and alpha values on Xie and Beni index.
Axioms 11 00401 g008
Figure 9. Influence of beta and alpha values on spatial inconsistency.
Figure 9. Influence of beta and alpha values on spatial inconsistency.
Axioms 11 00401 g009
Figure 10. Most likely cluster and undecided units of SFCM and SGFCM.
Figure 10. Most likely cluster and undecided units of SFCM and SGFCM.
Axioms 11 00401 g010
Figure 11. Final cluster of SGFCM.
Figure 11. Final cluster of SGFCM.
Axioms 11 00401 g011
Table 1. All variables used for clustering.
Table 1. All variables used for clustering.
VariableMeaning
X1Republican’s share of votes in U.S. presidential election
X2The share of respondents who thought they wore face masks often
X3The number of housing units
X4The number of residents who were high-school graduates or above
X5Unemployment rate
X6Household income
X7Death toll of COVID-19 cases
Table 2. Descriptive statistics of all variables.
Table 2. Descriptive statistics of all variables.
StatisticNMeanSt.DevMin.Max.
X16690.6620.1270.1200.900
X26690.5360.1390.1900.880
X366952,630.64135,268.211072,204,019
X466934,032.2384,810.536161,314,995
X56694.5911.2732.40013.00
X666952,867.0712,235.3126,278115,301
X766971.175306.1705517
Table 3. Some indices with ascending β values.
Table 3. Some indices with ascending β values.
BetaSilhouette IndexXie and Beni IndexExplained Inertia
00.2872.4760.161
0.050.292.2820.171
0.10.2942.1130.181
0.150.2981.9640.191
0.20.31.830.201
0.250.3031.7060.212
0.30.3071.5840.223
0.350.3131.470.235
0.40.3151.3740.247
0.450.3151.2920.26
0.50.2921.4780.265
0.550.2891.410.277
0.60.2861.3490.289
0.650.2831.2950.301
0.70.2811.2490.313
0.750.2771.2110.325
0.80.2731.1820.337
0.850.2681.1630.349
0.90.2591.1570.361
0.950.2491.1720.371
10.2351.2960.374
Table 4. Comparison of the indices between GFCM and FCM.
Table 4. Comparison of the indices between GFCM and FCM.
GFCMFCM
Silhouette index0.2730.287
Partition entropy0.3230.951
Partition coeff0.8370.486
XieBeni index1.1822.476
Fukuyama Sugeno index1096.841706.23
Explained inertia0.3370.161
Table 5. Comparison of the indices between SFCM and SGFCM.
Table 5. Comparison of the indices between SFCM and SGFCM.
SFCMSGFCM
Silhouette index0.2190.319
Partition entropy1.0430.682
Partition coeff0.4310.633
XieBeni index5.0081.394
Fukuyama Sugeno index1824.581290.69
Explained inertia0.1340.248
sp consistency0.2760.262
Table 6. Moran I index for the columns of the membership matrices among the four algorithms.
Table 6. Moran I index for the columns of the membership matrices among the four algorithms.
FCMGFCMSFCMSGFCM
Cluster 10.6420.6020.7690.696
Cluster 20.3490.1870.5010.66
Cluster 30.6910.5950.8090.823
Cluster 40.2050.140.6740.73
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wu, S. Spatial Fuzzy C-Means Clustering Analysis of U.S. Presidential Election and COVID-19 Related Factors in the Rustbelt States in 2020. Axioms 2022, 11, 401. https://doi.org/10.3390/axioms11080401

AMA Style

Wu S. Spatial Fuzzy C-Means Clustering Analysis of U.S. Presidential Election and COVID-19 Related Factors in the Rustbelt States in 2020. Axioms. 2022; 11(8):401. https://doi.org/10.3390/axioms11080401

Chicago/Turabian Style

Wu, Shianghau. 2022. "Spatial Fuzzy C-Means Clustering Analysis of U.S. Presidential Election and COVID-19 Related Factors in the Rustbelt States in 2020" Axioms 11, no. 8: 401. https://doi.org/10.3390/axioms11080401

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop