Data Science Methods in Big Data Era

A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".

Deadline for manuscript submissions: 20 June 2024 | Viewed by 1078

Special Issue Editors


E-Mail Website
Guest Editor
Department of Computer Science and Artificial Intelligence, Universidad de Granada, 18071 Granada, Spain
Interests: data mining; big data; machine learning; information retrieval; misinformation; correlation statistical measures; fuzzy logic and fuzzy sets theory; sentence quantification and fuzzy quantification; information fusion; energy efficiency; federated learning
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
Department of Computer Science and Artificial Intelligence, University of Granada, Granada, Spain
Interests: data science; text mining; big data; artificial intelligence; machine learning; knowledge management; federated learning; misinformation; sentimental analysis; social network analysis; natural language processing; food computing; energy efficiency; health

E-Mail Website
Guest Editor Assistant
1. Department of Computer Science and Artificial Intelligence, Universidad de Granada, 318011 Granada, Spain
2. UCL Department of Experimental Psychology, University College London, London WC1H 0AP, UK
Interests: data mining; big data; machine learning; misinformation; fuzzy association rules; information fusion; energy efficiency; explainable artificial intelligence
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor Assistant
Department of Computer Science and Artificial Intelligence, Universidad de Granada, Granada, ‎Spain
Interests: multidimensional analysis; data mining; NLP; machine learning; deep learning; knowledge graphs

Special Issue Information

Dear Colleagues,

Today, the amount of data generated every day on the Internet, in social media channels or in economic transactions exceeds the usual limits for its analysis using conventional data mining and machine learning techniques. In the last decade, numerous approaches have been proposed in different fields such as security, economy, energy, health, tourism, biological processes, customer profiles, anomaly detection, emergency management, etc. Therefore, it is necessary to continue investigating new methodologies and approaches following the big data paradigm in order to improve the analysis and obtain valuable information from massive datasets.

This Special Issue aims to discuss critical issues and challenges that the development of analysis and learning methods may face when dealing with massive amounts of data. Therefore, it aims to collect works at the forefront of data mining or machine learning with a focus on big data applications.

Topics include but are not limited to:

  • Theoretical and/or technical application of data or text mining methods in big data;
  • Theoretical and/or technical application of machine learning methods in big data;
  • Cloud computing in big data analysis;
  • Semantic models and knowledge representation for big data mining;
  • Parallel and distributed algorithms for big data mining or machine learning;
  • Social media analysis or web mining;
  • Stream mining and time series analysis;
  • Big data in fuzzy sets;
  • Information summarization and/or visualization in big data;
  • Novel applications of big data algorithms in several ambits: security, economy, health, tourism, energy, biological process, customer profiles, anomaly detection, emergency management, situation recognition, etc.

Dr. M. Dolores Ruiz
Prof. Dr. Maria J. Martin Bautista
Guest Editors

Dr. Carlos Fernández-Basso
Dr. Karel Gutiérrez-Batista
Guest Editor Assistants

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • data mining
  • machine learning
  • big data
  • stream mining
  • cloud computing

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

30 pages, 1654 KiB  
Article
A Hierarchical Orthographic Similarity Measure for Interconnected Texts Represented by Graphs
by Maxime Deforche, Ilse De Vos, Antoon Bronselaer and Guy De Tré
Appl. Sci. 2024, 14(4), 1529; https://doi.org/10.3390/app14041529 - 14 Feb 2024
Viewed by 518
Abstract
Similarity measures play a pivotal role in automatic techniques designed to analyse large volumes of textual data. Conventional approaches, treating texts as paradigmatic examples of unstructured data, tend to overlook their structural nuances, leading to a loss of valuable information. In this paper, [...] Read more.
Similarity measures play a pivotal role in automatic techniques designed to analyse large volumes of textual data. Conventional approaches, treating texts as paradigmatic examples of unstructured data, tend to overlook their structural nuances, leading to a loss of valuable information. In this paper, we propose a novel orthographic similarity measure tailored for the semi-structured analysis of texts. We explore a graph-based representation for texts, where the graph’s structure is shaped by a hierarchical decomposition of textual discourse units. Employing the concept of edit distances, our orthographic similarity measure is computed hierarchically across all components in this textual graph, integrating precomputed similarity values among lower-level nodes. The relevance and applicability of the presented approach are illustrated by a real-world example, featuring texts that exhibit intricate interconnections among their components. The resulting similarity scores, between all different structural levels of the graph, allow for a deeper understanding of the (structural) interconnections among texts and enhances the explainability of similarity measures as well as the tools using them. Full article
(This article belongs to the Special Issue Data Science Methods in Big Data Era)
Show Figures

Figure 1

Back to TopTop