Application of Machine Learning in Data Analysis and Process

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: closed (30 November 2023) | Viewed by 1776

Special Issue Editor


E-Mail Website
Guest Editor
School of Computer Data and Mathematical Sciences, Western Sydney University, Penrith, NSW, Australia
Interests: advanced data modeling and analysis

Special Issue Information

Dear Colleagues,

The application of machine learning (ML) in data analysis and data processing has transformed the way insights are extracted and decisions are made from complex datasets. ML techniques, including predictive analytics, anomaly detection, natural language processing, image analysis, recommendation systems, and process optimization, offer valuable capabilities that complement traditional statistical modeling approaches.

ML algorithms enhance predictive analytics by leveraging historical data to make accurate forecasts and support proactive decision making. Anomaly detection powered by ML enables the identification of outliers and patterns indicative of fraud, network intrusions, and quality control issues. ML's integration with natural language processing facilitates sentiment analysis, text classification, and information extraction from unstructured text sources.

In image analysis, ML algorithms excel at tasks such as object recognition, facial recognition, and content moderation, with applications across healthcare, security, and multimedia content management. ML-based recommendation systems provide personalized suggestions, enhancing user experiences and engagement. Additionally, ML optimization techniques improve process efficiency by identifying patterns and optimizing workflows, resource allocation, and cost reduction.

The strengths of ML lie in its ability to handle complex datasets, process unstructured data types, and automate decision-making processes. However, ongoing research is needed to address challenges related to interpretability, fairness, scalability, and adapting to new data sources.

In summary, the integration of ML with data analysis and processing offers a powerful approach to extract insights, make accurate predictions, and optimize processes. ML complements traditional statistical modeling techniques, expanding their capabilities and enabling advanced analyses. The continuous advancement of ML holds immense potential for unlocking deeper insights, driving data-driven decision making, and addressing the challenges posed by complex and ever-growing datasets.

Dr. Kenan Matawie
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Published Papers (2 papers)

Order results
Result details
Select all
Export citation of selected articles as:

Research

18 pages, 2474 KiB  
Article
Examining Commercial Crime Call Determinants in Alley Commercial Districts before and after COVID-19: A Machine Learning-Based SHAP Approach
by Hyun Woo Kim, Dakota McCarty and Minju Jeong
Appl. Sci. 2023, 13(21), 11714; https://doi.org/10.3390/app132111714 - 26 Oct 2023
Viewed by 709
Abstract
Although several previous studies have examined factors influencing crime at a specific point in time, limited research has assessed how factors influencing crime change in response to social disasters such as COVID-19. This study examines factors, along with their relative importance and trends [...] Read more.
Although several previous studies have examined factors influencing crime at a specific point in time, limited research has assessed how factors influencing crime change in response to social disasters such as COVID-19. This study examines factors, along with their relative importance and trends over time, and their influence on 112 commercial crime reports (illegal street vendors, dining and dashing, minor quarrels, theft, drunkenness, assault, vagrancy and disturbing the peace) in Seoul’s alley commercial districts between 2019 and 2021. Variables that may affect the number of commercial crime reports are classified into four characteristics (socioeconomic, neighborhood, park/greenery and commercial district attributes), explored using machine learning regression-based modeling and analyzed through the use of Shapley Additive exPlanations to determine the importance of each factor on crime reports. The Partial Dependence Plot is used to understand linear/non-linear relationships between key independent variables and crime reports. Among several machine learning models, the Extra Trees Regressor, which has the highest performance, is selected for the analysis. The results show a mixture of linear and non-linear relationships with the increasing crime rates, finding that store density, dawn sales ratio, the number of gathering facilities, perceived urban decline score, green view index and land appraisal value may play a crucial role in the number of commercial crimes reported, regardless of social trends. The findings of this study may be used as a basis for building a safe commercial district that can respond resiliently to social disasters. Full article
(This article belongs to the Special Issue Application of Machine Learning in Data Analysis and Process)
Show Figures

Figure 1

24 pages, 6646 KiB  
Article
Enhanced Particle Swarm Optimization Algorithm for Sea Clutter Parameter Estimation in Generalized Pareto Distribution
by Bin Yang and Qing Li
Appl. Sci. 2023, 13(16), 9115; https://doi.org/10.3390/app13169115 - 10 Aug 2023
Viewed by 784
Abstract
Accurate parameter estimation is essential for modeling the statistical characteristics of ocean clutter. Common parameter estimation methods in generalized Pareto distribution models have limitations, such as restricted parameter ranges, lack of closed-form expressions, and low estimation accuracy. In this study, the particle swarm [...] Read more.
Accurate parameter estimation is essential for modeling the statistical characteristics of ocean clutter. Common parameter estimation methods in generalized Pareto distribution models have limitations, such as restricted parameter ranges, lack of closed-form expressions, and low estimation accuracy. In this study, the particle swarm optimization (PSO) algorithm is used to solve the non-closed-form parameter estimation equations of the generalized Pareto distribution. The goodness-of-fit experiments show that the PSO algorithm effectively solves the non-closed parameter estimation problem and enhances the robustness of fitting the generalized Pareto distribution to heavy-tailed oceanic clutter data. In addition, a new parameter estimation method for the generalized Pareto distribution is proposed in this study. By using the difference between the statistical histogram of the data and the probability density function/cumulative distribution function of the generalized Pareto distribution as the target, an adaptive function with weighted coefficients is constructed to estimate the distribution parameters. A hybrid PSO (HPSO) algorithm is used to search for the best position of the fitness function to achieve the best parameter estimation of the generalized Pareto distribution. Simulation analysis shows that the HPSO algorithm outperforms the PSO algorithm in solving the parameter optimization task of the generalized Pareto distribution. A comparison with other traditional parameter estimation methods for generalized Pareto distribution shows that the HPSOHPSO algorithm exhibits strong parameter estimation performance, is efficient and stable, and is not limited by the parameter range. Full article
(This article belongs to the Special Issue Application of Machine Learning in Data Analysis and Process)
Show Figures

Figure 1

Back to TopTop