Information

Journal Browser

► Journal Browser

Text Mining: Classification, Clustering, and Summarization

Share This Special Issue

Special Issue Editor

Prof. Dr. Duke Taeho Jo

E-Mail Website
Guest Editor

School of Game, Hongik University, Seoul, Korea
Interests: text mining; neural networks; machine learning; information retrieval

Special Issue Information

Dear Colleagues,

Text mining is defined as the process of extract implicit knowledge from textual data, as a special type of data mining. Main instances of text mining are text classification, text clustering, text summarization, and text segmentation. The text classification means the process of classifying a text into one among predefined categories; especially, spam mail filtering is the typical instance of text categorization. Text clustering means the process of segmenting a group of texts into subgroups each of which contains content based similar texts. Recently, as well as machine learning algorithms, such as K Nearest Neighbour, Naïve Bayes, and Support Vector Machine, deep learning algorithms are applied to the text classification.

The Special Issue on “Text Mining: Classification, Clustering, and Summarization” aims to improve the performances of each text mining tasks by applying deep learning algorithms, to derive hybrid tasks by combing the text mining tasks with each other, and to apply the text mining tasks to the real problems such as fraud document detection and financial prediction. Authors should submit papers describing significant, original and unpublished work. Possible topics include, but are not limited to:

Machine Learning Algorithms to improve Text Mining Tasks
Application of Deep Learning Algorithms to Text Mining Tasks
Application of Text Mining System to Real Tasks
Hybrid Text Mining Tasks
Web Mining: Web Contents Mining, Web Structure Mining, and Web Usage Mining
Multimedia Mining: Hybrid Mining of Multimedia Data

Prof. Dr. Duke Taeho Jo
Guest Editor

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Information is an international peer-reviewed open access monthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 1600 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

Word Classification
Word Clustering
Automatic Keyword Extraction
Index Optimization
Text Classification
Text Clustering
Text Summarization
Text Segmentation
Machine Learning
Deep Learning

Published Papers (1 paper)

Order results

Result details

Show export options Show export options

Select all

Export citation of selected articles as:

Research

16 pages, 450 KiB

Open AccessArticle

Ontological Semantic Annotation of an English Corpus Through Condition Random Fields

by Guidson Coelho de Andrade, Alcione de Paiva Oliveira and Alexandra Moreira

Information 2019, 10(5), 171; https://doi.org/10.3390/info10050171 - 9 May 2019

Cited by 2 | Viewed by 4255

Abstract

One way to increase the understanding of texts by machines is through adding semantic information to lexical items by including metadata tags, a process also called semantic annotation. There are several semantic aspects that can be added to the words, among them the information about the nature of the concept denoted through the association with a category of an ontology. The application of ontologies in the annotation task can span multiple domains. However, this particular research focused its approach on top-level ontologies due to its generalizing characteristic. Considering that annotation is an arduous task that demands time and specialized personnel to perform it, much is done on ways to implement the semantic annotation automatically. The use of machine learning techniques are the most effective approaches in the annotation process. Another factor of great importance for the success of the training process of the supervised learning algorithms is the use of a sufficiently large corpus and able to condense the linguistic variance of the natural language. In this sense, this article aims to present an automatic approach to enrich documents from the American English corpus through a CRF model for semantic annotation of ontologies from Schema.org top-level. The research uses two approaches of the model obtaining promising results for the development of semantic annotation based on top-level ontologies. Although it is a new line of research, the use of top-level ontologies for automatic semantic enrichment of texts can contribute significantly to the improvement of text interpretation by machines. Full article

(This article belongs to the Special Issue Text Mining: Classification, Clustering, and Summarization)

► Show Figures

Journal Menu

Journal Browser

Text Mining: Classification, Clustering, and Summarization

Share This Special Issue

Special Issue Editor

Special Issue Information

Keywords

Published Papers (1 paper)

Research

Further Information

Guidelines

MDPI Initiatives

Follow MDPI