Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets

Almasoud, Ahmed S.; Alshahrani, Hala J.; Hassan, Abdulkhaleq Q. A.; Almalki, Nabil Sharaf; Motwakel, Abdelwahed

doi:10.3390/electronics12194125

Open AccessArticle

Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets

by

Ahmed S. Almasoud

¹

,

Hala J. Alshahrani

²,

Abdulkhaleq Q. A. Hassan

³

,

Nabil Sharaf Almalki

⁴ and

Abdelwahed Motwakel

^5,*

¹

Department of Information Systems, College of Computer and Information Sciences, Prince Sultan University, Riyadh 12435, Saudi Arabia

²

Department of Applied Linguistics, College of Languages, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh 11671, Saudi Arabia

³

Department of English and Applied Linguistics, College of Science and Arts at Mahayil, King Khalid University, Abha 62529, Saudi Arabia

⁴

Department of Special Education, College of Education, King Saud University, Riyadh 12372, Saudi Arabia

⁵

Department of Management Information Systems, College of Business Administration in Hawtat Bani Tamim, Prince Sattam bin Abdulaziz University, Al-Kharj 16278, Saudi Arabia

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(19), 4125; https://doi.org/10.3390/electronics12194125

Submission received: 17 August 2023 / Revised: 7 September 2023 / Accepted: 15 September 2023 / Published: 3 October 2023

(This article belongs to the Special Issue Trends and Prospects in Hybrid Methods for Natural Language Processing)

Download

Browse Figures

Versions Notes

Abstract

:

In recent times, global cities have been transforming from traditional cities to sustainable smart cities. In text sentiment analysis (SA), many people face critical issues namely urban traffic management, urban living quality, urban information security, urban energy usage, urban safety, etc. Artificial intelligence (AI)-based applications play important roles in dealing with these crucial challenges in text SA. In such scenarios, the classification of COVID-19-related tweets for text SA includes using natural language processing (NLP) and machine learning methodologies to classify tweet datasets based on their content. This assists in disseminating relevant information, understanding public sentiment, and promoting sustainable practices in urban areas during this pandemic. This article introduces a modified aquila optimizer with a stacked deep learning-based COVID-19 tweet Classification (MAOSDL-TC) technique for text SA. The presented MAOSDL-TC technique incorporates FastText, an effective and powerful text representation approach used for the generation of word embeddings. Furthermore, the MAOSDL-TC technique utilizes an attention-based stacked bidirectional long short-term memory (ASBiLSTM) model for the classification of sentiments that exist in tweets. To improve the detection results of the ASBiLSTM model, the MAO algorithm is applied for the hyperparameter tuning process. The presented MAOSDL-TC technique is validated on the benchmark tweets dataset. The experimental outcomes implied the promising results of the MAOSDL-TC technique compared to recent models in terms of different measures. This MAOSDL-TC technique improves accuracy and interpretability of sentiment prediction.

Keywords:

artificial intelligence; COVID-19; emotions; Twitter data; sentiment analysis

1. Introduction

Social media platforms play an important part during extreme crises as individuals use these communications media to share feedback, sentiments, thoughts, and reactions with other people to manage and respond to crises [1]. Thus, this study focuses on explorative collective reactions to events expressed on social platforms [2]. Special consideration will be given to analyzing the public’s responses to worldwide medical-relevant events, particularly the pandemic, described through Twitter’s social network, due to its widespread reputation and ease of access utilizing the application programming interface (API) [3]. Sentiment analysis (SA) is a kind of technique employed to represent, separate, or define personal data like ideas communicated in a given content, depending on natural language processing (NCP) and computational methods [4]. The major goal of SA is to define the author’s feelings as negative, positive, or neutral regarding different subjects [5]. To evaluate the effects of social media information relevant to the COVID-19 pandemic, research associated with people’s opinions on medical information and applications gained major significance [6]. In particular, text analysis of Twitter information has been the emphasis in several reviews, allowing researchers to analyze massive instances of user-defined content to find views, which can inform decision-making and earlier reaction mechanisms [7]. The Twitter platform has been undergoing a large infusion of data relevant to COVID-19 problems [8]. For SA, researchers have been using different kinds of textual documents such as Facebook posts and tweets [9].

Several research works on SA using social media data are available in the literature [10]. Identification of such sentiments from social media can support respondents in comprehending network dynamism, for example, panics, users’ important problems, and emotional impacts on members’ skills [11]. This study aims to examine the application of deep learning (DL) methods and natural language processing (NLP) approaches, namely SA, to support policymakers and communities to avoid the growth of misleading information, incitement of insurrection, and fake news [12]. SA or public view mining can be described as a way of employing machine learning (ML) and NLP for the classification of sentiments and subjective data [13]. SA is defined as the most common research field in the domain of NLP as it provides the ability to study and analyze sentiments that are expressed by various individuals [14].

This article introduces a modified aquila optimizer with stacked deep learning-based tweets classification (MAOSDL-TC) technique for text SA. The presented MAOSDL-TC technique incorporates FastText, an effective powerful text representation approach used for the generation of word embeddings. Furthermore, the MAOSDL-TC technique utilizes an attention-based stacked bidirectional long short-term memory (ASBiLSTM) model for the classification of sentiments that exist on Twitter. To improve the detection results of the ASBiLSTM model, the MAO algorithm is applied for the hyperparameter tuning process. The presented MAOSDL-TC technique is validated against the benchmark of a COVID-19 tweets dataset.

2. Related Works

Qorib et al. [15] downloaded public tweets day-to-day from Twitter using the Tweeter API and pre-processed and labelled them. Vocabulary normalization was based mainly on the stemming and lemmatization processes. The NRCLexicon method was used to transform tweets into 10 different classes. A T-test was deployed to check the statistical point of the relationship between the sentiments. Lastly, neural networks including the bidirectional encoder representations from transformers (BERT), 1-dimensional convolutional neural network (1DCNN), long short-term memory (LSTM), and multilayer perceptron (MLP) were tested and trained. In [16], an approach was introduced that was designed to provide an ensemble module where the advantages of automatic feature extraction and handcrafted features were linked through ML and DL algorithms. Before training ML techniques, unstructured information was attained, pre-processed, and annotated using VADER and TextBlob. Sunagar et al. [17] implemented the tweet classification of COVID-19 datasets via implementing DL approaches. The algorithm was executed using two-word embedding methods such as Global Vector for Word2Vec and Word Representation (GloVe).

In [18], the researchers presented an NLP technique based on the bidirectional LSTM (BiLSTM) method to implement sentiment classification and detect several problems related to public sentiment on COVID-19. BiLSTM is an enhanced version of classical LSTM to generate the outputs from right and left contexts at every time step. This enabled authorized institutions utilizing this model to alleviate the effect of negative messages and to understand the people’s concerns. Tatineni et al. [19] presented a technique to evaluate the emotion of live tweets. The technique comprised a dashboard with different functionalities. The central dashboard had a clickable map of India that illustrated state-wide data visualization and had country-wide data visualization of the emotion drawn from Twitter. Live emotion prediction of tweets can be accomplished using the DL techniques. Tweet fetching is a dynamic to obtain new data automatically. Vaddadi et al. [20] developed a technique that used automated implementation to extract details regarding COVID-19 from the up-to-date tweet data. SA uses LSTM, which is a kind of recurrent neural network (RNN) employed by using Twitter’s COVID-19 hashtags to see people’s reactions to the outbreak. Then, the tweet datasets are categorized and labelled as positive, negative, and neutral and the results visualized.

Chakraborty et al. [21] presented SA on the amount of tweets gathered on COVID-19. In the beginning, they analyzed the trends of public sentiments related to COVID-19 using the n-gram analysis and evolutionary classification. Next, the sentiment rating was calculated on gathered tweets based on the class. Lastly, the LSTM model was trained through two classes of rated tweets to forecast sentiment on the COVID-19 dataset. Tawfik and Makhlouf [22] analyzed public opinions on the program of vaccination against COVID-19. To achieve this, an ensemble mechanism based on DL was established, which fused LSTM and bidirectional gated recurrent unit (BiGRU). The accuracy of the presented algorithm was compared with five different ML techniques, and two DL algorithms using advanced approaches.

Raheja and Asthana [23] implemented an SA of tweets in lockdown utilizing a multinomial LR approach. The presented methodology design followed the pre-processed, polarity and scoring, and extracting features before executing the ML approach. In [24], a novel algorithm was presented for automatic sentiment classification of COVID-19 tweets utilizing ANFIS approaches. Jain et al. [25] purposed to analyze the performance of many classification techniques that demand an input value and identified to which resultant classification they belong. Six ML approaches, two ensemble systems, and four DL methods were utilized for this work. In [26], the R programming language was used to conduct an investigation of Twitter data. During this case, the authors planned a method named Hybrid Heterogeneous SVM (H-SVM) and carried out the sentiment classification and categorized tweets as negative, neutral, and positive.

3. The Proposed Model

This article is concentrated on the improvement of the MAOSDL-TC technique for text SA. The MAOSDL-TC technique mainly concentrates on the recognition and categorization of different kinds of sentiments in COVID-19 tweets. In the presented MAOSDL-TC technique, the following set of processes are involved, namely pre-processing, FastText, ASBiLSTM-based classification, and MAO-based parameter selection. Figure 1 depicts the workflow of the MAOSDL-TC algorithm.

3.1. Data Pre-Processing and Word Embedding

Text preprocessing is the technique used to clean the original text data. A robust text pre-processing technique is crucial for applications of NLP tasks. After preprocessing, the attained text components act as key elements of input that are fed into the processing of textual data. Preprocessing consists of different approaches for translating the original texts using a well-defined method: special characters or symbols, lemmatization, elimination of stopwords, lexical analysis (ignore case sensitivity, word tokenization, and removal of punctuation). Afterwards, the FastText method was employed for the processing of word embedding. FastText is a widely used text representation method that generates word embeddings that are a dense vector representation of words. This embedding captures the semantic meaning of an individual word and its subword information and morphological structure. Particularly, this makes FastText more effective in handling out-of-vocabulary words and capturing the relationship between words with related prefixes or suffixes. FastText works by assuming a word is a mixture of subword units (character n-grams). This technique enables it to create embedding for known and unknown words by leveraging the subword component.

3.2. Tweet Data Classification Using ASBiLSTM Model

Once the tweets are preprocessed, classification takes place using the ASBiLSTM model. In this study, we used the ASBiLSTM model as an essential element of the presented method, which has the benefit of simultaneously extracting temporal features of time series [27]. The BILSTM is an augmentation of the LSTM. The LSTM is a kind of RNN, which overcomes the problems of vanishing gradient from RNN through the inclusion of a gating module. In comparison with RNN, LSTM is composed of memory cells, forget, input, and outputs, in which the cell memory is liable to store the overview of historical input series, and the gate modules control the flow of information between the input and output datasets. LSTM aids efficient learning of long-term temporal dependency relationships by taking their well-developed structure into account.

Consider

c_{t - 1}

as the memory cell state of the prior

t

−

1

time step, an input vector

x_{t}

at

t

time steps, and

h_{t - 1}

indicates the hidden layer of the prior

t

−

1

time step.

f_{t},

i_{t}

, and

0_{t}

show the gate vector that controls how much data is to be forgotten, updated, and output from the memory cell, correspondingly. The operation of LSTM was formulated by the following expression:

f_{t} = σ (W_{x f} x_{t} + W_{h f} h_{t - 1} + b_{f}) i_{t} = σ (W_{x i} x_{t} + W_{h i} h_{t - 1} + b_{i}) o_{t} = σ (W_{x_{0}} x_{t} + W_{h o} h_{t - 1} + b_{o}) c_{t} = f_{t} ⊙ c_{t - 1} + i_{t} ⊙ t a n h (W_{x c} x_{t} + W_{h c} h_{t - 1} + b_{c}) h_{t} = O_{t} ⊙ t a n h (C)

(1)

From the expression,

t h e T a n h

function ensures that the value of HL remains in

[- 1, 1]

the interval.

σ (∙)

indicates the sigmoid function; the symbol

⊙

shows the pointwise multiplication. The learnable parameters

W

and

b

are weight and deviation during the training model, respectively.

BILSTM combines a bidirectional conceptualization into LSTM that exploits forward and backward LSTM for feature extraction and concatenates respective hidden features for extracting patterns or bidirectional features. Accordingly, BILSTM attains context data in the previous observation for the entire input. This bidirectional extraction on the time series simplifies the capture of backwards and forward temporal attributes in wind power-related data considering the variation patterns. With the context feature, BiLSTM allows a hybrid model for wind power-related data to attain feature extraction capabilities and better representation that enables more accurate and efficient prediction of future observation by leveraging past observation.

Particularly, BILSTM trains its parameters in backward and forward paths to realize the context. During the backward layer, the LSTM estimates the derivation of transmission errors in the forward layer. The LSTM updates the parameters from the conventional way in the forward layer. Considering an input of length

T

, the operational procedures are shown below:

\vec{h_{t}} = \vec{L S T M} (h_{t - 1}, x_{t}, c_{t - 1},), t \in [1, T] {\overset{\leftarrow}{h}}_{t} = \overset{⃡}{L S T M} (h_{t + 1}, x_{t}, c_{t + 1}), t \in [T, 1] H_{t} = [{\vec{h}}_{t}, {\overset{\leftarrow}{h}}_{t}]

(2)

where

H_{t}

indicates the hidden layer (HL) of BILSTM at time step

t

,

{\vec{h}}_{t}

and

{\overset{\leftarrow}{h}}_{t}

signifies the HL in the forward and backward layers at

t i m e s t e p t

.

In ASBiLSTM, the attention module was used to optimize the prediction outcomes. Figure 2 signifies the framework of ASBiLSTM. The attention module is a weighting quantity of sequences that allocates great weight to targets with higher correlation. An attention module minimizes the loss of prior datasets and extracts relevant information by highlighting the contribution of the most powerful and useful parts of the input to the outputs. In the DL technique, the attention module allocates weight to the output of BiLSTM by mapping the weights and the learning parameter matrix can be focused on the input that contributes to the outputs.

As shown in Equations (3) to (5), a series of outputs

H_{1}, H_{2}, \dots, H_{t}

through the HL of BILSTM are fed as input to the attention model, and the distribution of attention weights is attained. Equation (5) indicates the accomplishment of the last state of the attention mechanism. Equation (4) shows the computation of attention weight by standardizing the score. Equation (3) defines the computation of similarities or correlations between the input and output features.

e_{i} = V_{e} t a n h (W H_{i} + b)

(3)

α_{i} = \frac{e x p (e_{i})}{\sum_{i} e x p (e_{i})}

(4)

C = \sum_{i} α_{i} H_{i}

(5)

where

V_{e}

and

W

signify the weighted coefficient of the parameter learned in the training model.

e_{i}

indicates the distribution probability at

i t h

time steps.

b

shows the bias.

3.3. Hyperparameter Tuning Using MAO Algorithm

The MAO algorithm can be applied in this work for the hyperparameter tuning of the ASBiLSTM module. The AO mainly depends upon the prey-grabbing nature of the Aquila. AO is a population-based algorithm which exhibits its effectiveness in the field of complex and nonlinear optimization in a short period of time. The classical AO principally focuses on five significant steps namely initialization, expanded exploration, narrowed exploration, expanded exploitation and narrowed exploitation.

An MAO was introduced in this study [28]. By modifying the SCF from IAO, MAO was inspired to make further amendments to the AO. However, the convergence properties of SCF decelerate the accuracy of the epochs in IAO. These properties may be responsible for certain challenges in searching for an optimum result. To overcome these challenges, a modified version of IAO was introduced that integrates a modified search control factor (MSCF) that is particularly adapted to the 2nd and 3rd search processes. The subsequent section provides a detailed description of the MAO technique, which highlights certain modifications that were made and their effects on the optimization technique. The MSCF is used to control the search range, which reduces movement of the Aquila in terms of epochs. Accordingly, compared to the prior SCF, the search space is considerably narrower. Furthermore, the optimum solution is found considerably more quickly than in the prior technique. The modified MSCF is shown as follows:

M - S C F (t) = 2 \times e x p (1 - (\frac{t \times (t \times 0.1)}{T})) \times d i r .

(6)

d i r = \{\begin{array}{l} 1 & i f r < 0.5, \\ - 1 & e l s e . \end{array}

(7)

where

t

denotes the existing iteration and

T

shows the maximal iteration. The

r

parameter shows a random integer ranging from zero to one, where

d i r

indicates the direction control factor. These factors play a major role in controlling the fight direction of the Aquila.

The MSCF function aims to attain fast convergence by restricting the movement of the Aquila. Furthermore, it decreases optimization latency. The modified technique needs less time to recognize the optimum solution set than the original AO. Both optimization approaches were performed with sizes of 250 and 250 epochs.

With the incorporation of the MSCF function, the presented technique includes four different search stages that are discussed in the following:

Step 1: Vertical Dive Attack $(S_{1})$

The Aquila begin its hunting by identifying the target region and selecting the optimum hunting position by swooping high in the air. These attacks are called vertical dive attacks and are expressed as follows:

S_{1} (t + 1) = S_{b e s t} (t) \times (1 - \frac{t}{T}) + (S_{M} (t) - S_{b e s t} (t) \times r)

(8)

In Equation (8),

S_{1} (t + 1)

denotes the solution candidate of

(t + 1)

epochs,

r

shows the random integer in [0, 1] the interval, and

S_{b e s t} (t)

shows the better solution attained to the

i t h

generation.

(1 - T t)

is used for controlling the search region. Now,

S (t)

denotes the mean value of the existing solution to

i t h

epochs.

Step 2: Modified Full Search with a Short Glide Attack $(M S)$

Before attacking the prey, the Aquila comprehensively searches the solution space via different directions and speeds, in what is called a full search with shorter glide attacks that can be shown as follows:

M S_{2} (t + 1) = S_{R} (t) + M - S C F (t) \times (S_{b e s t} (t) - S (t)) \times r \times (y - x)

(9)

In Equation (9),

x

, and

y

correspond to the positions or coordinates of the point making the spiral shape during the search step,

r

indicates the random integer within [0,1], and

M C F (t)

denotes the modified search control factor. Rather than applying the Levy flight (LF) distribution, we integrated MSCF to eliminate the problems of getting stuck in a locally optimal solution.

Step 3: Modified Search Around Prey and Attack $(M S)$

The prey’s region is located accurately after the

M S_{2}

search step. The Aquila thoroughly explores around the target, and with pseudo attacks, recognizes the prey’s reaction in what is called a search around prey and attack.

M S_{3} (i, j) = l b_{j} + r \times (u b_{j} - l b_{j}) + r \times (S_{R} (j) - S_{b e s t} (j)) \times M - S C F (t) \times (1 - \frac{t}{T})

(10)

In Equation (10),

S_{R} (j)

denotes the random set of solutions and

M S_{3} (i, j)

indicates the existing solution for

t

epochs.

Step 4: Walk and Grab Attack (S)

Finally, the Aquila attacks from above based on the prey’s movement for the 4th search approach. This search process can be denoted as “Walk and Grab Prey”,

S_{4} (t + 1) = Q_{F} \times S_{b e s t} (t) - (G_{1} \times S (t) \times r) - G_{2} \times l e v (D),

(11)

Q_{F} = t^{\frac{2 \times r - 1}{(1 - T)^{2}}},

(12)

G_{1} = 2 \times r a n d o m - 1,

(13)

G_{2} = 2 \times (1 - \frac{t}{T}) .

(14)

where

S_{4} (t + 1)

represents the solution attained so far, and

l e v (D)

shows the Levy distribution for the

D

dimensional range. QF indicates the quality function for balancing the search process,

G_{1}

denotes each kind of movement of Aquila during the hunt, and

G_{2}

shows the fight slope of hunting.

The fitness choice is a key component of the MAO method. Encoder performance is applied measure a better solution candidate. Now, the performance value is the foremost condition applied to develop an FF.

F i t n e s s = m a x (P)

(15)

P = \frac{T P}{T P + F P}

(16)

where

T P

and

F P

indicate the true and false positive values.

4. Results and Discussion

The performance validation of the MAOSDL-TC method on the sentiment classification of COVID-19 tweets takes place using the Kaggle dataset [29], which holds 2750 samples with 11 classes, as portrayed in Table 1.

Figure 3 represents the classifier performances of the MAOSDL-TC technique under the test database. Figure 3a,b shows the confusion matrix achieved by the MAOSDL-TC approach at 70:30 of the TR set/TS set. The outcome value determined that the MAOSDL-TC algorithm has classified and detected all 11 classes accurately. Then, Figure 3c depicts the PR examination of the MAOSDL-TC approach. The outcome demonstrated that the MAOSDL-TC algorithm has achieved greater PR outcomes in 11 classes. But, Figure 3d displays the ROC outcome of the MAOSDL-TC algorithm. The simulation value exhibited that the MAOSDL-TC approach led to able performances with higher values of ROC on 11 classes.

4.1. Result Analysis

A brief result of using the MAOSDL-TC technique on COVID-19 tweet classification is illustrated in Table 2 and Figure 4. The obtained results state that the MAOSDL-TC technique properly recognized all classes. On 70% of the TR set, the MAOSDL-TC technique provided an average

a c c u_{y}

of 99.19%,

p r e c_{n}

of 95.63%,

r e c a_{l}

of 95.55%,

F_{s c o r e}

of 95.54%, and JI of 91.49%. In addition, on 30% of the TS set, the MAOSDL-TC approach attained an average

a c c u_{y}

of 99.45%,

p r e c_{n}

of 97.15%,

r e c a_{l}

of 96.89%,

F_{s c o r e}

of 96.99%, and JI of 94.18%.

Figure 5 illustrates the training accuracy

T R_a c c u_{y}

and

V L_a c c u_{y}

of the MAOSDL-TC approach. The

T L_a c c u_{y}

is determined by the evaluation of the MAOSDL-TC technique on the TR dataset whereas the

V L_a c c u_{y}

is computed by evaluating performance on a separate testing dataset. The results exhibit that

T R_a c c u_{y}

and

V L_a c c u_{y}

upsurge with an increase in epochs. As a result, the outcome of the MAOSDL-TC technique increases on the TR and TS datasets with a rise in the number of epochs.

In Figure 6, the

T R_l o s s

and

V R_l o s s

results of the MAOSDL-TC approach without optimization are revealed. The

T R_l o s s

defines errors among the predictive performance and original values on the TR data. The

V R_l o s s

represents a measure of the performance of the MAOSDL-TC system on individual validation data. The outcome signifies that the

T R_l o s s

and

V R_l o s s

tend to decrease with rising epochs. It depicted the enhanced performance of the MAOSDL-TC method and its capability to make accurate classifications. The minimal value of

T R_l o s s

and

V R_l o s s

demonstrates the improved performance of the MAOSDL-TC technique in capturing patterns and relationships.

4.2. Discussion

In Table 3 and Figure 7, we offered an extensive comparative result of the MAOSDL-TC technique [30,31]. The results indicate that the MAOSDL-TC technique exhibits promising results over other models. Based on

a c c u_{y}

, the MAOSDL-TC technique reports an increasing

a c c u_{y}

of 99.45% while the RF, XGBoost, SVM, ensemble, DT, SFODLDSAC, and MPONLP-TSA techniques accomplish decreasing values of 91.20%, 91.40%, 90.95%, 93.19%, 90.82%, 98.50%, and 99.10%, respectively. At the same time, based on

p r e c_{n}

, the MAOSDL-TC approach reports a higher

p r e c_{n}

of 97.15% while the RF, XGBoost, SVM, ensemble, DT, SFODLDSAC, and MPONLP-TSA systems achieve limited values of 92.48%, 91.69%, 90.39%, 94.32%, 90.82%, 96.15%, and 96.74% correspondingly. Finally, based on

F_{s c o r e}

, the MAOSDL-TC technique reports a maximal

F_{s c o r e}

of 96.99% while the RF, XGBoost, SVM, ensemble, DT, SFODLDSAC, and MPONLP-TSA algorithms realize minimal values of 91.49%, 91.20%, 90.25%, 93.96%, 90.46%, 95.15%, and 95.90% correspondingly.

These results confirmed that the MAOSDL-TC technique exhibits enhanced performance over recent models.

5. Conclusions

This article has concentrated on the improvement of the MAOSDL-TC method for classification of text sentiments in COVID-19 tweets. The MAOSDL-TC technique mainly concentrates on the recognition and categorization of different kinds of sentiments in COVID-19-related tweets. In the presented MAOSDL-TC technique, the following set of processes were involved, namely pre-processing, FastText, ASBiLSTM-based classification, and MAO-based parameter selection. In this work, the ASBiLSTM model for the classification of sentiments existing in the tweets. Lastly, the MAO system can be applied for the hyperparameter tuning process, which aids in improving the detection results of the ASBiLSTM model. The presented MAOSDL-TC method is validated on the benchmark tweets dataset. The experimental outcomes, with maximum accuracy of 99.45%, suggested the promising results of the MAOSDL-TC technique compared to recent models. This MAOSDL-TC technique not only improves accuracy but also enhances the better interpretability of sentiment prediction.

Author Contributions

Conceptualization, A.S.A. and H.J.A.; Methodology, A.S.A., H.J.A., A.Q.A.H. and N.S.A.; Software, A.M.; Validation, N.S.A. and A.M.; Investigation, A.S.A.; Data curation, N.S.A.; Writing—original draft, A.S.A., H.J.A., A.Q.A.H. and A.M.; Writing—review & editing, H.J.A., A.Q.A.H., N.S.A. and A.M.; Visualization, N.S.A.; Project administration, A.M.; Funding acquisition, H.J.A. All authors have read and agreed to the published version of the manuscript.

Funding

The authors extend their appreciation to the Deanship of Scientific Research at King Khalid University for funding this work through large group Research Project under grant number (RGP2/185/44). Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2022R281), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. Research Supporting Project number (RSPD2023R521), King Saud University, Riyadh, Saudi Arabia. This study is supported via funding from Prince Sattam bin Abdulaziz University project number (PSAU/2023/R/1444). This study is partially funded by the Future University in Egypt (FUE).

Data Availability Statement

Data sharing does not apply to this article as no datasets were generated during the current study.

Conflicts of Interest

The authors declare that they have no conflict of interest.

References

Anuradha, K.; Parvathy, M. Multi-label Emotion Classification of COVID-19 Tweets with Deep Learning and Topic Modelling. Comput. Syst. Sci. Eng. 2023, 46, 3005–3021. [Google Scholar] [CrossRef]
Deva Priya, M.; Saranya, M.; Sharaha, N.; Tamizharasi, S. Classification of COVID-19 tweets using deep learning classifiers. In Proceedings of International Conference on Recent Trends in Computing: ICRTC 2021; Springer: Singapore, 2022; pp. 213–225. [Google Scholar]
Bangyal, W.H.; Qasim, R.; Rehman, N.U.; Ahmad, Z.; Dar, H.; Rukhsar, L.; Aman, Z.; Ahmad, J. Detection of fake news text classification on COVID-19 using deep learning approaches. Comput. Math. Methods Med. 2021, 2021, 1–14. [Google Scholar] [CrossRef] [PubMed]
Ainapure, B.S.; Pise, R.N.; Reddy, P.; Appasani, B.; Srinivasulu, A.; Khan, M.S.; Bizon, N. Sentiment Analysis of COVID-19 Tweets Using Deep Learning and Lexicon-Based Approaches. Sustainability 2023, 15, 2573. [Google Scholar] [CrossRef]
Fattoh, I.E.; Kamal Alsheref, F.; Ead, W.M.; Youssef, A.M. Semantic sentiment classification for COVID-19 tweets using universal sentence encoder. Comput. Intell. Neurosci. 2022, 2022, 6354543. [Google Scholar] [CrossRef]
Oumaima, S.; Soulaimane, K.; Omar, B. Artificial Intelligence in Predicting the Spread of Coronavirus to Ensure Healthy Living for All Age Groups. In Emerging Trends in ICT for Sustainable Development: The Proceedings of NICE2020 International Conference; Springer International Publishing: Cham, Switzerland, 2021; pp. 11–18. [Google Scholar]
Wang, T.; Deng, X.N. User characteristics, social media use, and fatigue during the coronavirus pandemic: A stressor–strain–outcome framework. Comput. Hum. Behav. Rep. 2022, 7, 100218. [Google Scholar] [CrossRef]
Sak, S.; Yavuzyiğit, B.B. Striving for wellbeing digitally in the city amidst the pandemic: Solidarity through Twitter in Ankara. Habitat Int. 2023, 137, 102846. [Google Scholar] [CrossRef]
Stitini, O.; Twil, A.; Kaloun, S.; Bencharef, O. How can we analyse emotions on twitter during an epidemic situation? A features engineering approach to evaluate people’s emotions during the COVID-19 pandemic. J. Tianjin Univ. Sci. Technol. 2021, 54. [Google Scholar] [CrossRef]
Sitaula, C.; Basnet, A.; Mainali, A.; Shahi, T.B. Deep learning-based methods for sentiment analysis on Nepali COVID-19-related tweets. Comput. Intell. Neurosci. 2021, 2021, 2158184. [Google Scholar] [CrossRef]
Klein, A.Z.; Kunatharaju, S.; O’Connor, K.; Gonzalez-Hernandez, G. Automatically Identifying Self-Reports of COVID-19 Diagnosis on Twitter: An Annotated Data Set, Deep Neural Network Classifiers, and a Large-Scale Cohort. J. Med. Internet Res. 2023, 25, e46484. [Google Scholar] [CrossRef]
Shahi, T.B.; Sitaula, C.; Paudel, N. A hybrid feature extraction method for Nepali COVID-19-related tweets classification. Comput. Intell. Neurosci. 2022, 2022, 5681574. [Google Scholar] [CrossRef]
Joloudari, J.H.; Hussain, S.; Nematollahi, M.A.; Bagheri, R.; Fazl, F.; Alizadehsani, R.; Lashgari, R.; Talukder, A. BERT-deep CNN: State of the art for sentiment analysis of COVID-19 tweets. Soc. Netw. Anal. Min. 2023, 13, 99. [Google Scholar] [CrossRef]
Hussain, S.; Ayoub, M.; Yu, Y.; Wahid, J.A.; Khan, A.; Moller, D.P.; Weiyan, H. Ensemble Deep Learning Framework for Situational Aspects-Based Annotation and Classification of International Student’s Tweets during COVID-19. Comput. Mater. Contin. 2023, 75, 5355–5377. [Google Scholar] [CrossRef]
Qorib, M.; Oladunni, T.; Denis, M.; Ososanya, E.; Cotae, P. COVID-19 Vaccine Hesitancy: A Global Public Health and Risk Modelling Framework Using an Environmental Deep Neural Network, Sentiment Classification with Text Mining and Emotional Reactions from COVID-19 Vaccination Tweets. Int. J. Environ. Res. Public Health 2023, 20, 5803. [Google Scholar] [CrossRef] [PubMed]
Umer, M.; Sadiq, S.; Nappi, M.; Sana, M.U.; Ashraf, I. ETCNN: Extra Tree and Convolutional Neural Network-based Ensemble Model for COVID-19 Tweets Sentiment Classification. Pattern Recognit. Lett. 2022, 164, 224–231. [Google Scholar] [CrossRef] [PubMed]
Sunagar, P.; Kanavalli, A.; Poornima, V.; Hemanth, V.M.; Sreeram, K.; Shivakumar, K.S. Classification of COVID-19 tweets using deep learning techniques. In Inventive Systems and Control: Proceedings of ICISC 2021; Springer: Singapore, 2021; pp. 123–136. [Google Scholar]
Arbane, M.; Benlamri, R.; Brik, Y.; Alahmar, A.D. Social media-based COVID-19 sentiment classification model using Bi-LSTM. Expert Syst. Appl. 2023, 212, 118710. [Google Scholar] [CrossRef] [PubMed]
Tatineni, P.; Babu, B.S.; Kanuri, B.; Rao, G.R.K.; Chitturi, P.; Naresh, C. March. Post COVID-19 Twitter user’s Emotions Classification using Deep Learning Techniques in India. In Proceedings of the 2021 International Conference on Artificial Intelligence and Smart Systems (ICAIS), Coimbatore, India, 25–27 March 2021; pp. 338–343. [Google Scholar]
Vaddadi, V.R.; Das, S.; Anupama, V. 2022, April. Exploration of COVID 19 Tweets Data for the Prediction of Negative Ontologies through Deep Learning Techniques. In Proceedings of the 2022 IEEE International Conference on Distributed Computing and Electrical Circuits and Electronics (ICDCECE), Ballari, India, 23–24 April 2022; pp. 1–6. [Google Scholar]
Chakraborty, A.K.; Das, S.; Kolya, A.K. Sentiment analysis of COVID-19 tweets using evolutionary classification-based LSTM model. In Proceedings of Research and Applications in Artificial Intelligence: RAAI 2020; Springer: Singapore, 2021; pp. 75–86. [Google Scholar]
Said, H.; Tawfik, B.S.; Makhlouf, M.A. A Deep Learning Approach for Sentiment Classification of COVID-19 Vaccination Tweets. Int. J. Adv. Comput. Sci. Appl. 2023, 14, 530–538. [Google Scholar] [CrossRef]
Raheja, S.; Asthana, A. Sentiment Analysis of Tweets During the COVID-19 Pandemic Using Multinomial Logistic Regression. Int. J. Softw. Innov. (IJSI) 2023, 11, 1–16. [Google Scholar] [CrossRef]
Mohammed, S.S.; Menaouer, B.; Zohra, A.F.F.; Nada, M. Sentiment analysis of COVID-19 tweets using adaptive neuro-fuzzy inference system models. Int. J. Softw. Sci. Comput. Intell. (IJSSCI) 2022, 14, 1–20. [Google Scholar] [CrossRef]
Jain, R.; Bawa, S.; Sharma, S. Sentiment analysis of COVID-19 tweets by machine learning and deep learning classifiers. In Advances in Data and Information Sciences: Proceedings of ICDIS 2021; Springer: Singapore, 2022; pp. 329–339. [Google Scholar]
Kaur, H.; Ahsaan, S.U.; Alankar, B.; Chang, V. A proposed sentiment analysis deep learning algorithm for analyzing COVID-19 tweets. Inf. Syst. Front. 2021, 23, 1417–1429. [Google Scholar] [CrossRef]
Ma, Z.; Mei, G. A hybrid attention-based deep learning approach for wind power prediction. Appl. Energy 2022, 323, 119608. [Google Scholar] [CrossRef]
Mumenin, K.M.; Biswas, P.; Khan, M.A.M.; Alammary, A.S.; Nahid, A.A. A Modified Aquila-Based Optimized XGBoost Framework for Detecting Probable Seizure Status in Neonates. Sensors 2023, 23, 7037. [Google Scholar] [CrossRef] [PubMed]
Available online: https://www.kaggle.com/competitions/sentimentanalysisof-covid-19-related-tweets/data?select=validation.csv (accessed on 12 March 2023).
Vaiyapuri, T.; Jagannathan, S.K.; Ahmed, M.A.; Ramya, K.C.; Joshi, G.P.; Lee, S.; Lee, G. Sustainable Artificial Intelligence-Based Twitter Sentiment Analysis on COVID-19 Pandemic. Sustainability 2023, 15, 6404. [Google Scholar] [CrossRef]
Singh, C.; Imam, T.; Wibowo, S.; Grandhi, S. A deep learning approach for sentiment analysis of COVID-19 reviews. Appl. Sci. 2022, 12, 3709. [Google Scholar] [CrossRef]

Figure 1. Workflow of MAOSDL-TC algorithm.

Figure 2. The architecture of the ASBiLSTM technique.

Figure 3. Performance of (a,b) confusion matrices, (c) PR_curve, and (d) ROC.

Figure 4. Average of MAOSDL-TC approach on 70:30 of the TR set/TS set.

Figure 5.

A c c u_{y}

curve of the MAOSDL-TC approach.

Figure 5.

A c c u_{y}

curve of the MAOSDL-TC approach.

Figure 6. Loss curve of the MAOSDL-TC approach.

Figure 7. Comparative outcome of MAOSDL-TC algorithm with recent methods.

Table 1. Description of database.

Class	No. of Instances
Optimistic	250
Thankful	250
Empathetic	250
Pessimistic	250
Anxious	250
Sad	250
Annoyed	250
Denial	250
Surprise	250
Official report	250
Joking	250
Total Number of Instances	2750

Table 2. COVID-19 tweet classification outcome of the MAOSDL-TC method on 70:30 of TR set/TS set.

Class	$A c c u_{y}$	$P r e c_{n}$	$R e c a_{l}$	$F_{S c o r e}$	Jaccard Index
TR set (70%)
Optimistic	99.17	96.51	94.32	95.40	91.21
Thankful	98.60	92.47	92.97	92.72	86.43
Empathetic	99.32	94.32	98.22	96.23	92.74
Pessimistic	99.22	97.11	94.38	95.73	91.80
Anxious	99.12	92.67	98.33	95.42	91.24
Sad	99.22	94.67	96.39	95.52	91.43
Annoyed	99.69	100.00	96.84	98.40	96.84
Denial	98.96	92.35	96.57	94.41	89.42
Surprise	99.43	100.00	93.45	96.62	93.45
Official report	99.12	92.44	97.55	94.93	90.34
Joking	99.22	99.38	92.00	95.55	91.48
Average	99.19	95.63	95.55	95.54	91.49
TS set (30%)
Optimistic	99.52	97.30	97.30	97.30	94.74
Thankful	99.39	98.39	93.85	96.06	92.42
Empathetic	99.03	92.94	97.53	95.18	90.80
Pessimistic	99.27	98.53	93.06	95.71	91.78
Anxious	99.76	100.00	97.14	98.55	97.14
Sad	99.39	96.47	97.62	97.04	94.25
Annoyed	99.64	96.72	98.33	97.52	95.16
Denial	99.39	96.05	97.33	96.69	93.59
Surprise	99.76	98.78	98.78	98.78	97.59
Official report	99.15	93.48	98.85	96.09	92.47
Joking	99.64	100.00	96.00	97.96	96.00
Average	99.45	97.15	96.89	96.99	94.18

Table 3. Comparative outcome of MAOSDL-TC algorithm with recent methodologies.

Methods	$A c c u_{y}$	$P r e c_{n}$	$R e c a_{l}$	$F_{S c o r e}$
RF	91.20	92.48	91.26	91.49
XGboost	91.40	91.69	91.82	91.20
SVM	90.95	90.39	90.32	90.25
Ensemble	93.19	94.32	93.15	93.96
DT	90.82	90.82	90.92	90.46
SFODLDSAC	98.50	96.15	95.13	95.15
MPONLP-TSA	99.10	96.74	95.88	95.90
MAOSDL-TC	99.45	97.15	96.89	96.99

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Almasoud, A.S.; Alshahrani, H.J.; Hassan, A.Q.A.; Almalki, N.S.; Motwakel, A. Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets. Electronics 2023, 12, 4125. https://doi.org/10.3390/electronics12194125

AMA Style

Almasoud AS, Alshahrani HJ, Hassan AQA, Almalki NS, Motwakel A. Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets. Electronics. 2023; 12(19):4125. https://doi.org/10.3390/electronics12194125

Chicago/Turabian Style

Almasoud, Ahmed S., Hala J. Alshahrani, Abdulkhaleq Q. A. Hassan, Nabil Sharaf Almalki, and Abdelwahed Motwakel. 2023. "Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets" Electronics 12, no. 19: 4125. https://doi.org/10.3390/electronics12194125

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Modified Aquila Optimizer with Stacked Deep Learning-Based Sentiment Analysis of COVID-19 Tweets

Abstract

1. Introduction

2. Related Works

3. The Proposed Model

3.1. Data Pre-Processing and Word Embedding

3.2. Tweet Data Classification Using ASBiLSTM Model

3.3. Hyperparameter Tuning Using MAO Algorithm

4. Results and Discussion

4.1. Result Analysis

4.2. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI