Journal Browser

► Journal Browser

Data Science Methods in Big Data Era

Share This Special Issue

Special Issue Editors

Dr. M. Dolores Ruiz

E-Mail Website
Guest Editor

Department of Computer Science and Artificial Intelligence, Universidad de Granada, 18071 Granada, Spain
Interests: data mining; big data; machine learning; information retrieval; misinformation; correlation statistical measures; fuzzy logic and fuzzy sets theory; sentence quantification and fuzzy quantification; information fusion; energy efficiency; federated learning
Special Issues, Collections and Topics in MDPI journals

Prof. Dr. Maria J. Martin Bautista

E-Mail Website
Guest Editor

Department of Computer Science and Artificial Intelligence, University of Granada, Granada, Spain
Interests: data science; text mining; big data; artificial intelligence; machine learning; knowledge management; federated learning; misinformation; sentimental analysis; social network analysis; natural language processing; food computing; energy efficiency; health

Dr. Carlos Fernández-Basso

E-Mail Website
Guest Editor Assistant

1. Department of Computer Science and Artificial Intelligence, Universidad de Granada, 318011 Granada, Spain
2. UCL Department of Experimental Psychology, University College London, London WC1H 0AP, UK
Interests: data mining; big data; machine learning; misinformation; fuzzy association rules; information fusion; energy efficiency; explainable artificial intelligence
Special Issues, Collections and Topics in MDPI journals

Dr. Karel Gutiérrez-Batista

E-Mail Website
Guest Editor Assistant

Department of Computer Science and Artificial Intelligence, Universidad de Granada, Granada, ‎Spain
Interests: multidimensional analysis; data mining; NLP; machine learning; deep learning; knowledge graphs

Special Issue Information

Dear Colleagues,

Today, the amount of data generated every day on the Internet, in social media channels or in economic transactions exceeds the usual limits for its analysis using conventional data mining and machine learning techniques. In the last decade, numerous approaches have been proposed in different fields such as security, economy, energy, health, tourism, biological processes, customer profiles, anomaly detection, emergency management, etc. Therefore, it is necessary to continue investigating new methodologies and approaches following the big data paradigm in order to improve the analysis and obtain valuable information from massive datasets.

This Special Issue aims to discuss critical issues and challenges that the development of analysis and learning methods may face when dealing with massive amounts of data. Therefore, it aims to collect works at the forefront of data mining or machine learning with a focus on big data applications.

Topics include but are not limited to:

Theoretical and/or technical application of data or text mining methods in big data;
Theoretical and/or technical application of machine learning methods in big data;
Cloud computing in big data analysis;
Semantic models and knowledge representation for big data mining;
Parallel and distributed algorithms for big data mining or machine learning;
Social media analysis or web mining;
Stream mining and time series analysis;
Big data in fuzzy sets;
Information summarization and/or visualization in big data;
Novel applications of big data algorithms in several ambits: security, economy, health, tourism, energy, biological process, customer profiles, anomaly detection, emergency management, situation recognition, etc.

Dr. M. Dolores Ruiz
Prof. Dr. Maria J. Martin Bautista
Guest Editors

Dr. Carlos Fernández-Basso
Dr. Karel Gutiérrez-Batista
Guest Editor Assistants

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

data mining
machine learning
big data
stream mining
cloud computing

Published Papers (1 paper)

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Research

30 pages, 1654 KiB

Open AccessArticle

A Hierarchical Orthographic Similarity Measure for Interconnected Texts Represented by Graphs

by Maxime Deforche, Ilse De Vos, Antoon Bronselaer and Guy De Tré

Appl. Sci. 2024, 14(4), 1529; https://doi.org/10.3390/app14041529 - 14 Feb 2024

Viewed by 518

Abstract

Similarity measures play a pivotal role in automatic techniques designed to analyse large volumes of textual data. Conventional approaches, treating texts as paradigmatic examples of unstructured data, tend to overlook their structural nuances, leading to a loss of valuable information. In this paper, we propose a novel orthographic similarity measure tailored for the semi-structured analysis of texts. We explore a graph-based representation for texts, where the graph’s structure is shaped by a hierarchical decomposition of textual discourse units. Employing the concept of edit distances, our orthographic similarity measure is computed hierarchically across all components in this textual graph, integrating precomputed similarity values among lower-level nodes. The relevance and applicability of the presented approach are illustrated by a real-world example, featuring texts that exhibit intricate interconnections among their components. The resulting similarity scores, between all different structural levels of the graph, allow for a deeper understanding of the (structural) interconnections among texts and enhances the explainability of similarity measures as well as the tools using them. Full article

(This article belongs to the Special Issue Data Science Methods in Big Data Era)

► Show Figures

Journal Menu

Journal Browser

Data Science Methods in Big Data Era

Share This Special Issue

Special Issue Editors

Special Issue Information

Keywords

Published Papers (1 paper)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI