Next Article in Journal
Removal of Brilliant-Green Dye Using Carbon-Loaded Zinc Oxide Nanoparticles: A Comparative Isotherm Study
Previous Article in Journal
Recent Progress to Address the Challenges of Conductive Inks for E-Textiles
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Proceeding Paper

Optimizing Police Locations around Football Stadiums Based on a Multicriteria Unsupervised Clustering Analysis †

by
Antonio Marcos de Lima
1,*,
Thyago C. C. Nepomuceno
1,2,*,
Isaac Pergher
1,
Victor D. H. de Carvalho
3 and
Thiago Poleto
4
1
Núcleo de Tecnologia, Centro Acadêmico do Agreste, Federal University of Pernambuco, Caruaru 50670-901, Brazil
2
Dipartimento di Ingegneria Informatica, Automatica e Gestionale, Sapienza University of Rome, 00185 Rome, Italy
3
Technologies Axis, Campus do Sertão, Federal University of Alagoas, Delmiro Gouveia 57480-000, Brazil
4
Department of Business Administration, Institute for Applied Social Sciences, Federal University of Pará, Belém 66075-110, Brazil
*
Authors to whom correspondence should be addressed.
Presented at the 4th International Electronic Conference on Applied Sciences, 27 October–10 November 2023; Available online: https://asec2023.sciforum.net/.
Eng. Proc. 2023, 56(1), 275; https://doi.org/10.3390/ASEC2023-15230
Published: 26 October 2023
(This article belongs to the Proceedings of The 4th International Electronic Conference on Applied Sciences)

Abstract

:
This work proposes a methodology based on multicriteria decision aid (MCDA) and a cluster analysis to identify ideal locations for the installation of police facilities or vehicle parking and policing around stadiums in Recife, Brazil, during potential violent sports events (criminal occurrences from football supporters or fanbases). A K-means unsupervised clustering algorithm is used to group criminal data into homogeneous clusters based on their characteristics. Each type of criminal occurrence is linked to a single cluster. The optimal location is addressed based on the PROMETHEE method (Preference Ranking Organization Method for Enrichment Evaluation), allowing clusters to be organized into a hierarchy based on the number of facilities (N), the average distance (D) from the criminal occurrence to the associated cluster, and the coverage level (C), which is the proportion of crime occurring in a location less than 500 m from the associated cluster. Through a data analysis of crimes and violence in the region, this study seeks to identify patterns of criminal behaviour and high-risk areas to determine the most strategic locations for police units and enhance the public security decision-making process. The choice for the k parameters ranged from 1 to 30, incorporating all regions of the analysis, with a computational cost of 43 min of running time using an Intel Core i3-3217U (1800 GHz and 10 GB of RAM). This approach and methodology can be useful for supporting public security policies in the region and can contribute to reducing violence around stadiums. The empirical application can help guide public managers’ decisions regarding resource allocation and the implementation of more effective security policies, with the aim of ensuring a safer environment for fans and residents in the areas near stadiums.

1. Introduction

Football (soccer) is a social event that brings together people from different social, economic, and cultural backgrounds, representing a synthesis of multiple passions and objective and subjective determinations [1]. Nevertheless, not randomly, such events result in episodes of violence and misdemeanors in stadiums and their surroundings, especially in countries with a deep-rooted football culture [2]. Areas close to sports venues are often the scene of conflicts between rival fans, in addition to vandalism, theft, and other forms of violence. Some interesting works on the topic include [3,4] studies on the violent behavior of football supporters in Brazil.
The proper location of police units can be an important factor in preventing and reducing violence around stadiums. When there is a police unit close to the stadium, the police can act more quickly in case of conflicts or acts of violence. This can help ensure the safety of fans and residents in areas close to the stadium. Furthermore, a police presence can have a deterrent effect on potential offenders, reducing the risk of violence. This work seeks to identify strategic locations for installing police facilities, considering their quantity and using a hybrid spatial and multicriteria analysis method. In this work, a police vehicle containing up to four police officers is considered a police facility to provide more efficient and rapid responses to possible demands.
This work proposes a hybrid method using the PROMETHEE and K-means tools to identify possible strategic locations for installing police facilities depending on their quantity, improving public security and mitigating violence in regions around stadiums in Recife, Brazil. Such a hybrid method has a well-founded statistical base and is widely available in different languages and software. The justification for using unsupervised clustering is the need for an exploratory analysis without predefined labels or categories. In the case of crime data in Recife, defining explicit categories or classes may be challenging as criminal activities can be multifaceted and dynamic. In addition, unsupervised clustering methods are flexible and adaptable to different data types. In Recife, where several types of crimes are considered, this clustering approach identifies commonalities and differences across different crime categories and locations.
The next section offers a theoretical foundation for the employed methods. Section 3 is dedicated to the results and a discussion of the empirical application. Section 4 concludes the work with a summary of the results and discussion.

2. Methods

The facility location problem (FLP) is a topic of great interest in areas such as logistics, transport, and public service management where there is a need to decide where to install new service units or equipment. The objective is to decide in the best possible way where to locate a certain number of services to minimize costs and maximize the efficiency of the service provided [5].
The FLP is a complex problem involving several factors, such as the distance between facilities and customers, customer characteristics, facility capabilities, and the demand for services. The specialized literature presents several approaches to the solution of the problem, from mathematical models that seek an exact solution to the problem to heuristic algorithms that seek approximate solutions in a reasonable time. Many studies have considered methods to address the facility selection problem through the use of mathematical analyses and computer simulations [6].
The FLP solution can bring many benefits to society, such as improving the quality of services provided, reducing costs, optimizing resource distribution, and improving users’ well-being. Ideally, undesirable events (such as robberies, assaults, and vandalism) should be as far away from society as possible, but we have no control over them. Therefore, bringing desirable services closer to society is of interest. Many quantitative tools and information technologies are adequate for addressing socioeconomic issues and the problem of public security [7,8,9,10]. For the present study, the facility location problem is addressed by combining K-means clustering with multicriteria decision aid (the PROMETHEE model).

2.1. K-Means

K-means is a widely used unsupervised clustering algorithm for grouping data into homogeneous groups based on their characteristics. It is one of the most popular cluster classification algorithms due to its simplicity and efficiency [11].
This algorithm is iterative, starting with a random choice of k centroids representing each cluster’s center. Then, each data point is assigned to the cluster whose centroid is closest. Once all points have been assigned to a group, the centroids are recalculated based on the data points assigned to each group. This process is repeated until there are no more changes in assigning data points to groups or until the maximum number of iterations is reached.
Although K-means is a simple and fast algorithm, it has some limitations, such as sensitivity to choosing the number of k groups and the random initialization of centroids. However, there are techniques to deal with these problems, such as multiple random initialization and hierarchical sampling initialization. We combined this algorithm with the PROMEHTEE methodology for a more robust classification of vulnerable urban spaces.

2.2. PROMETHEE

The PROMETHEE method (Preference Ranking Organization Method for Enrichment Evaluation) is an approach that guides the decision-making process, allowing options to be organized in a hierarchy based on the preferences of the decision maker [12]. Developed by Brans et al. [13], this technique helps classify actions according to established criteria, allowing for a more precise and structured analysis of the available possibilities. Non-compensatory decision-making models such as PROMETHEE have been used well in the literature to evaluate job satisfaction and resource allocation [14,15].
The PROMETHEE method can be defined in two steps. In the first step, a comparison is made using preference functions of different types, such as difference functions and similarity functions. These comparisons generate a pre-order matrix that indicates the preference relationship between the alternatives in relation to each criterion individually. In a second step, the person responsible for making the decision assigns weights to the criteria according to priorities so that the most relevant criteria receive more significant weights.
It is necessary to assign weights to each criterion such that each criterion i must have a weight p i . Thus, the degree of overclassification given by the equation can be calculated as follows:
π a , b = i = 1 n p i F i a , b
where
n = decision criteria;
π a , b = outranking of alternative a over alternative b;
F i = preference function for criteria i ;
p i = weight for criteria i .
Based on the outranking degree of all alternatives, flows of alternatives can be calculated. There are three types of flow: positive flow, negative flow, and net flow. The positive flow + a measures how much an action is preferred over the other n − 1 alternatives. Analogously, the negative flow ( a ) tells us how preferable the n − 1 alternatives are. The net flow a subtracts the negative flow from the positive flow. This is basically how the alternative performs compared to the others. The greater the net flow, the more preferable the alternative. The equations used to calculate the flows are as follows:
+ a = 1 n 1 b a π a , b
a = 1 n 1 b a π b , a
a = + a a

3. Results and Discussion

The results of applying data-driven analytics and virtual learning technologies are crucial in optimizing resource provision and allocation, with valuable insights into the socioeconomic landscape [7,9,16]. By analyzing diverse datasets, including crime patterns, demographic information, and economic indicators, policymakers can make informed decisions to address specific challenges and allocate resources effectively.
The SEPLAG (State Planning and Management Secretariat) made available a database of CVP cases (property crimes) and their respective geolocations throughout Pernambuco. This base covers cases from 2018 to 2021 across the state. Among the crimes present in this database are thefts: theft occurring in bank institutions, theft on public transport, mugging and robberies, theft of cargo, burglary, and commercial burglary. See Nepomuceno and Costa [17], De Carvalho and Costa [8], and Borba et al. [18] for a better description of these geographic-referenced crimes and the context in which they operate.
Even without other types of crimes such as physical, sexual, psychological, or moral crimes, the existence of robbery data in general can be a useful indicator of occurrences of violence in urban areas. This is because robberies are often associated with acts of violence, such as physical aggression, threats, and intimidation. Figure 1 illustrates a plot of crimes in Recife through the ArcGis version 10.8 software, using OpenStreetMap as a base map.
The following Table, Table 1, reports the proportion of occurrences for each stadium based on the type of crime.
In order to minimize the problem of the initial choice of centroids, the multiple random initialization technique was adopted. This technique is the default in the Scikit library. The algorithm is executed with seeds from different centroids according to the specified number of times. The seed that generates the clusters with the smallest sum of the squared distances of the points to the cluster centroid is chosen. For this study, each cluster was executed with 5000 different seeds. This was achieved using a notebook equipped with an Intel(R) Core(TM) i3-3217U processor at a frequency of 1.80 GHz and with 10 GB of RAM memory, taking 1 hour and 43 minutes to complete.
Scenarios were created to identify the best allocation of facilities based on the number of facilities (N), average distance (D), and coverage level (C). The weights for each were provided by the ROC technique [19], considering the order of preference (under the criteria). Table 2 below reports the number of facilities (locations) for the five top-ranked alternatives for each stadium considering all six scenarios: the first scenario (C, D, N), second scenario (C, N, D), third scenario (D, C, N), fourth scenario (N, C, D), fifth scenario (D, N, C), and the sixth scenario (N, D, C).

4. Conclusions

The results report interesting prospects considering the three criteria and six options in order of preference (scenarios). Among the scenarios, only two obtained a balanced number of facilities. The remaining scenarios are not recommended options for a decision maker in any of the stages since these scenarios imply a high or insufficient number of facilities. Among the recommended scenarios at the Ilha do Retiro stadium, the second-best alternative solved the problem of intersections with rivers.
The proposed method has the advantage of simultaneously considering multiple criteria, such as socioeconomic factors, demographic data, and crime rates. This integration enhances the identification of patterns and relationships within crime data, aiding in comprehensive decision making for law enforcement and policymakers. Combining unsupervised clustering with multicriteria decision aid addresses subjectivity by balancing subjective criteria with data-driven insights. It also assists in identifying trade-offs in crime-prevention strategies and uncovers complex interactions within the data, leading to more effective crime analysis. This integration ensures a consistent and informed decision-making process in addressing crime challenges.
Applying this methodology in other Brazilian stadiums and considering other criteria not addressed in this work could be an interesting extension of the current analysis in addition to using data standardization to verify the impact on the generation of clusters. Furthermore, the possibility of applying a multicriteria system that serves multiple decision makers can also add value for the current approach.

Author Contributions

Conceptualization, A.M.d.L. and T.C.C.N.; methodology, A.M.d.L. and T.C.C.N.; software, A.M.d.L. and T.C.C.N.; validation, A.M.d.L. and T.C.C.N.; formal analysis, A.M.d.L. and T.C.C.N.; investigation, A.M.d.L. and T.C.C.N.; resources A.M.d.L. and T.C.C.N.; data curation, A.M.d.L. and T.C.C.N.; writing—original draft preparation, A.M.d.L. and T.C.C.N.; writing—review and editing, A.M.d.L. and T.C.C.N.; visualization, A.M.d.L. and T.C.C.N.; supervision, T.C.C.N., I.P., V.D.H.d.C. and T.P.; project administration, A.M.d.L. and T.C.C.N.; funding acquisition, T.C.C.N. All authors have read and agreed to the published version of the manuscript.

Funding

This research received support from the Brazilian National Council for Scientific and Technological Development (Conselho Nacional de Desenvolvimento Científico e Tecnoógico, CNPq).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data can be provided upon request to the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Murad, M. A violência e o Futebol: Dos Estudos Clássicos aos Dias de Hoje; Editora FGV: Rio de Janeiro, Brazil, 2007. [Google Scholar]
  2. Palhares, M.F.S.; Schwartz, G.M.; Teruel, A.P.; Santiago, D.R.P.; Trevisan, P.R.T.D.C. Lazer, agressividade e violência: Considerações sobre o comportamento das torcidas organizadas. Mot. Rev. De Educ. Física 2012, 18, 186–199. [Google Scholar] [CrossRef]
  3. Nepomuceno, T.C.C.; de Moura, J.A.; e Silva, L.C.; Costa, A.P.C.S. Alcohol and violent behavior among football spectators: An empirical assessment of Brazilian’s criminalization. Int. J. Law Crime Justice 2017, 51, 34–44. [Google Scholar] [CrossRef]
  4. Nepomuceno, T.C.C.; de Carvalho, V.D.H.; Silva, L.C.E.; de Moura, J.A.; Costa, A.P.C.S. Exploring the Bedouin syndrome in the football fan culture: Addressing the hooliganism phenomena through networks of violent behavior. Int. J. Environ. Res. Public Health 2022, 19, 9711. [Google Scholar] [CrossRef] [PubMed]
  5. Fernandes, S.; Captivo, M.E.; Clımaco, J. Um Sistema de apoio a Decisão para Análise de Problemas de Localizaçao Bicritério; Working Paper 2/05; Centro de Investigaçao Operacional, Faculdade de Ciências, Universidade de Lisboa: Lisbon, Portugal, 2005. (In Portuguese) [Google Scholar]
  6. Adeleke, O.J.; Olukanni, D.O. Facility location problems: Models, techniques, and applications in waste management. Recycling 2020, 5, 10. [Google Scholar] [CrossRef]
  7. Nepomuceno, T.C.C. Data-driven Analytics for Socioeconomic Challenges in a Contemporary World. Socioecon. Anal. 2023, 1, 1–4. [Google Scholar] [CrossRef]
  8. de Carvalho, V.D.H.; Costa, A.P.C.S. Exploring Text Mining and Analytics for Applications in Public Security: An in-depth dive into a Systematic Literature Review. Socioecon. Anal. 2023, 1, 5–55. [Google Scholar] [CrossRef]
  9. Poleto, T.; de Carvalho, V.D.H.; de Oliveira, R.C.P. Applying Spatial Decision Support for Maternal Mortality Analysis in a Brazilian State. Socioecon. Anal. 2023, 1, 92–102. [Google Scholar] [CrossRef]
  10. Boezi, G.; Cottarelli, A.; Manimendra, I.; Pagmozzi, D. The Efficiency of European Universities and the Role of Exogenous Factors. Socioecon. Anal. 2023, 1, 103–125. [Google Scholar] [CrossRef]
  11. Cabrera-Barona, P.F.; Bayón, M.; Durán, G.; Bonilla, A.; Mejía, V. Generating and mapping Amazonian urban regions using a geospatial approach. ISPRS Int. J. Geo-Inf. 2020, 9, 453. [Google Scholar] [CrossRef]
  12. Mareschal, B.; De Smet, Y. Visual PROMETHEE: Developments of the PROMETHEE & GAIA multicriteria decision aid methods. In Proceedings of the 2009 IEEE International Conference on Industrial Engineering and Engineering Management, Hong Kong, China, 8–11 December 2009; pp. 1646–1649. [Google Scholar]
  13. Brans, J.P.; Vincke, P.; Mareschal, B. How to select and how to rank projects: The PROMETHEE method. Eur. J. Oper. Res. 1986, 24, 228–238. [Google Scholar] [CrossRef]
  14. Nepomuceno, T.C.; Daraio, C.; Costa, A.P.C. Multicriteria ranking for the efficient and effective assessment of police departments. Sustainability 2021, 13, 4251. [Google Scholar] [CrossRef]
  15. Nepomuceno, T.C.C.; Daraio, C.; Costa, A.P.C.S. Combining multicriteria and directional distances to decompose non-compensatory measures of sustainable banking efficiency. Appl. Econ. Lett. 2020, 27, 329–334. [Google Scholar] [CrossRef]
  16. Ji, S.; Dai, M.; Wen, H.; Zhang, H.; Zhang, Z.; Xia, Z.; Zhu, J. An improved virtual metrology method in chemical vapor deposition systems via multitask gaussian processes and adaptive active learning. Int. J. Adv. Manuf. Technol. 2022, 122, 3149–3159. [Google Scholar] [CrossRef]
  17. Nepomuceno, T.C.C.; Costa, A.P.C.S. Spatial visualization on patterns of disaggregate robberies. Oper. Res. 2019, 19, 857–886. [Google Scholar] [CrossRef]
  18. Borba, B.F.D.C.; de Gusmão, A.P.H.; Clemente, T.R.N.; Nepomuceno, T.C.C. Optimizing Police Facility Locations Based on Cluster Analysis and the Maximal Covering Location Problem. Appl. Syst. Innov. 2022, 5, 74. [Google Scholar] [CrossRef]
  19. Barron, F.H. Selecting a best multiattribute alternative with partial information about attribute weights. Acta Psychol. 1992, 80, 91–103. [Google Scholar] [CrossRef]
Figure 1. CVP (property crime) occurrences: (a) total occurrences in Recife; (b) occurrences around stadiums (up to 1.5 km).
Figure 1. CVP (property crime) occurrences: (a) total occurrences in Recife; (b) occurrences around stadiums (up to 1.5 km).
Engproc 56 00275 g001
Table 1. Proportion of CVPs (property crimes).
Table 1. Proportion of CVPs (property crimes).
Types of CVPsArrudaIlha Do RetiroAflitos
Robberies13.5%2.9%6.7%
Mugging76.9%88.4%81.3%
Burglary3.8%1.4%0.0%
Theft and Robberies in Bus3.8%0.0%1.3%
Commercial Burglary1.9%5.9%10.7%
Theft and Robberies in Public Transport0%1.4%0%
Table 2. Best alternatives in each scenario.
Table 2. Best alternatives in each scenario.
Scenarios
Stadiums123456Ranking
Aflitos30 Locations11 Locations30 Locations7 Locations26 Locations1 Local1st
26 Locations10 Locations29 Locations8 Locations28 Locations2 Locations2nd
29 Locations12 Locations28 Locations9 Locations27 Locations7 Locations3rd
28 Locations13 Locations27 Locations10 Locations29 Locations3 Locations4th
27 Locations14 Locations26 Locations1 Locations30 Locations6 Locations5th
Ilha do retiro30 Locations10 Locations30 Locations7 Locations30 Locations1 Local1st
29 Locations11 Locations29 Locations6 Locations29 Locations2 Locations2nd
28 Locations12 Locations28 Locations8 Locations26 Locations6 Locations3rd
27 Locations9 Locations27 Locations9 Locations28 Locations7 Locations4th
26 Locations14 Locations26 Locations5 Locations27 Locations3 Locations5th
Arruda30 Locations8 Locations30 Locations6 Locations30 Locations4 Locations1st
28 Locations10 Locations29 Locations7 Locations28 Locations1 Local2nd
29 Locations11 Locations28 Locations5 Locations29 Locations5 Locations3rd
27 Locations7 Locations27 Locations4 Locations27 Locations6 Locations4th
26 Locations9 Locations26 Locations8 Locations26 Locations7 Locations5th
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

de Lima, A.M.; Nepomuceno, T.C.C.; Pergher, I.; de Carvalho, V.D.H.; Poleto, T. Optimizing Police Locations around Football Stadiums Based on a Multicriteria Unsupervised Clustering Analysis. Eng. Proc. 2023, 56, 275. https://doi.org/10.3390/ASEC2023-15230

AMA Style

de Lima AM, Nepomuceno TCC, Pergher I, de Carvalho VDH, Poleto T. Optimizing Police Locations around Football Stadiums Based on a Multicriteria Unsupervised Clustering Analysis. Engineering Proceedings. 2023; 56(1):275. https://doi.org/10.3390/ASEC2023-15230

Chicago/Turabian Style

de Lima, Antonio Marcos, Thyago C. C. Nepomuceno, Isaac Pergher, Victor D. H. de Carvalho, and Thiago Poleto. 2023. "Optimizing Police Locations around Football Stadiums Based on a Multicriteria Unsupervised Clustering Analysis" Engineering Proceedings 56, no. 1: 275. https://doi.org/10.3390/ASEC2023-15230

Article Metrics

Back to TopTop