Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications

Wang, Cheng-Gang; Chen, Bor-Sen

doi:10.3390/stresses2040029

Open AccessArticle

Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications

by

Cheng-Gang Wang

and

Bor-Sen Chen

^*

Laboratory of Automatic Control, Signal Processing and Systems Biology, Department of Electrical Engineering, National Tsing Hua University, Hsinchu 30013, Taiwan

^*

Author to whom correspondence should be addressed.

Stresses 2022, 2(4), 405-436; https://doi.org/10.3390/stresses2040029

Submission received: 2 September 2022 / Revised: 19 October 2022 / Accepted: 20 October 2022 / Published: 3 November 2022

(This article belongs to the Special Issue SARS-CoV-2 and Stresses)

Download

Browse Figures

Versions Notes

Abstract

:

The coronavirus disease 2019 (COVID-19) pandemic has claimed many lives since it was first reported in late December 2019. However, there is still no drug proven to be effective against the virus. In this study, a candidate host–pathogen–interactive (HPI) genome-wide genetic and epigenetic network (HPI-GWGEN) was constructed via big data mining. The reverse engineering method was applied to investigate the pathogenesis of SARS-CoV-2 infection by pruning the false positives in candidate HPI-GWGEN through the HPI RNA-seq time profile data. Subsequently, using the principal network projection (PNP) method and the annotations of the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway, we identified the significant biomarkers usable as drug targets for destroying favorable environments for the replication of SARS-CoV-2 or enhancing the defense of host cells against it. To discover multiple-molecule drugs that target the significant biomarkers (as drug targets), a deep neural network (DNN)-based drug–target interaction (DTI) model was trained by DTI databases to predict candidate molecular drugs for these drug targets. Using the DNN-based DTI model, we predicted the candidate drugs targeting the significant biomarkers (drug targets). After screening candidate drugs with drug design specifications, we finally proposed the combination of bosutinib, erlotinib, and 17-beta-estradiol as a multiple-molecule drug for the treatment of the amplification stage of SARS-CoV-2 infection and the combination of erlotinib, 17-beta-estradiol, and sertraline as a multiple-molecule drug for the treatment of saturation stage of mild-to-moderate SARS-CoV-2 infection.

Keywords:

SARS-CoV-2; COVID-19; systems biology; reverse engineering; principal network projection (PNP); drug–target interaction (DTI) model; multiple-molecule drug; drug repositioning; deep learning

1. Introduction

In late December 2019, China reported cases of pneumonia to the World Health Organization (WHO). In January 2020, the WHO named this disease coronavirus disease 2019 (COVID-19). As of 21 June 2022, there have been more than 500 million confirmed cases of COVID-19 and more than 6 million deaths [1]. COVID-19 is caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). The high transmissibility and clinical severity of COVID-19 rapidly caused a global health crisis. In most cases, the symptoms of COVID-19 are mild. The most commonly reported symptoms are headache (34–70%), myalgia (36–63%), fatigue (63%), cough (50.3–63.2%), and fever (43–45%) [2,3]. However, some cases result in an acute course and complications. So far, the pathogenesis of COVID-19 has not been completely clarified, and more than two years after the beginning of the COVID-19 outbreak, there is still no drug proven to be effective against it.

SARS-CoV-2 is a positive-sense single-stranded RNA virus and encodes approximately 29 proteins, including four structural proteins [4,5,6]. Some proteins of SARS-COV-2 have been confirmed to interact with host proteins and affect host gene expression [7,8]. In addition to genetic regulation, epigenetic regulation is also important to the viral infection [9,10,11]. MicroRNA (miRNA) can cause RNA silencing and post-transcriptional repression of gene expression [12,13]. They can regulate various cellular activities, including apoptosis, cell growth, and differentiation [14]. Long noncoding RNA (lncRNA), a class of non-coding RNA, can be an inhibitory regulator of miRNA [15,16] or involved in transcriptional regulation. They are involved in different regulatory mechanisms during viral infection [17].

For new drug discovery, the global pharmaceutical industry faces high attrition rates [18], high costs increasing with time, and changing regulatory requirements. These risks cause investors to be less willing to invest in the pharmaceutical industry. Therefore, drug repositioning (also called drug repurposing) is proposed [19,20]. Drug repositioning refers to the identification of new usages for investigational or approved drugs which are outside the scope of the original medical indication. However, the experimental methods to verify the drug–target interaction (DTI) are extremely expensive and time-consuming. It is necessary that the systems biology method be employed to investigate pathogenic mechanisms to identify significant drug targets, and this should be followed by computational methods based on the DTI model to predict the drugs for significant drug targets on a large scale to reduce the high cost and development time, especially during the COVID-19 outbreak. Moreover, combination therapy for multiple proteins (drug targets) using more than one medication or modality has been used for many diseases, including various cancers [21] and infections [22]. It has been proposed for the treatment of COVID-19 in some research [23,24]. Veklury (Remdesivir) is an FDA-approved drug for mild-to-moderate COVID-19. However, the efficiency of Remdesivir is still debatable [25,26,27]. Olumiant (Baricitinib), the other FDA-approved drug, is used in the treatment of COVID-19 in hospitalized adults who require supplemental oxygen, invasive or non-invasive mechanical ventilation, or extracorporeal membrane oxygenation (ECMO—that is, Baricitinib is not used for mild-to-moderate COVID-19).

In this study, we first employed the systems biology method via host–pathogen–interactive (HPI) RNA-seq time profile data to investigate the pathogenesis of COVID-19 in order to identify significant biomarkers as drug targets and treat the SARS-CoV-2 infection at the amplification and saturation stages. Recently, a deep neural network (DNN)-based DTI model that targets proteins to significantly improve the prediction of DTI compared to conventional DTI models has been introduced by a deep learning algorithm through the feature vectors of molecular drugs [28,29]. Therefore, a DNN-based DTI model was trained by DTI databases [30,31,32,33,34] to efficiently predict candidate molecular drugs for each significant biomarker (drug target) of the amplification and saturation stages of SARS-CoV-2 infection. Then, based on drug design specifications, in order to prune some candidate drugs, a set of candidate molecular drugs combined as a multiple-molecule drug was proposed as a potential combination therapy to target these significant biomarkers at the amplification and saturation stages to disrupt the progression of SARS-CoV-2 infection. Therefore, the systematic procedure of repositioning a multiple-molecule drug for disrupting the progression of SARS-CoV-2 infection by utilizing the systems biology method through HPI RNA-seq data and DNN-based DTI model with drug design specifications are described as follows: (1) constructing the candidate HPI genome-wide genetic and epigenetic network (HPI-GWGEN) using big data mining from databases [35,36,37,38,39,40,41,42,43,44,45,46]; (2) system identification and system order selection to eliminate the false positives in the candidate HPI-GWGEN to obtain the real HPI-GWGEN using the system models and HPI RNA-seq time profile data; (3) applying the principal network projection (PNP) method to construct the core HPI-GWGEN and obtain the core HPI signaling pathways and their abnormal downstream cellular functions of SARS-CoV-2 infection using annotations of the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathways [47,48]; (4) investigating and comparing the pathogenic mechanisms between the amplification and saturation stages of SARS-CoV-2 infection to identify significant biomarkers as drug targets against SARS-CoV-2 infection at the amplification and saturation stages; and (5) predicting the candidate drugs for significant biomarkers using a DNN-based DTI model and selecting potential multiple-molecule drugs for the therapeutic treatment of the amplification and saturation stages of SARS-CoV-2 infection with three drug design specifications, i.e., regulation ability, toxicity, and sensitivity. We finally proposed the combination of bosutinib, erlotinib, and 17-beta-estradiol as a multiple-molecule drug for the treatment of the amplification stage of SARS-CoV-2 infection and the combination of erlotinib, 17-beta-estradiol, and sertraline as a multiple-molecule drug for the treatment of the saturation stage of mild-to-moderate SARS-CoV-2 infection.

2. Results

2.1. Core HPI Signaling Pathways during Amplification and Saturation Stage of SARS-CoV-2 Infection by the Systems Biology Method

To find the molecular drugs for disrupting SARS-CoV-2 infection, systematic methods including reverse engineering (i.e., reversing the infectious process of SARS-CoV-2) were first used to construct the real HPI-GWGEN by the genome-wide HPI RNA-seq time-profile data, and the PNP method was applied to extract the core HPI-GWGEN for analysis. With the annotation of KEGG pathways, the core HPI-GWGEN was annotated as the core signal pathways to identify significant biomarkers as drug targets for the pathogenesis of SARS-CoV-2 infection at the amplification and saturation stages (note that the amplification and saturation stages are defined according to the RNA-seq time profile data in Section 4.2.1). After training the DNN-based DTI model, we predicted the potential multi-molecule drugs that target the significant biomarkers (drug targets) against SARS-CoV-2 infection at the amplification and saturation stages based on the DNN-based DTI model and three drug design specifications, respectively. The procedure of the systems biology method and drug discovery is shown in Figure 1.

The number of nodes and edges in candidate HPI-GWGEN and real HPI-GWGENs of amplification and saturation infectious stages are shown in Table 1. The significant pruning of edges between candidate and real HPI-GWGEN implies that the false-positive edges in candidate HPI-GWGEN are eliminated by the system order selection. The real HPI-GWGENs of amplification and saturation infectious stages are respectively shown in Figure A1, which are represented with the aid of Cytoscape [49].

In addition to the significant deletion of false-positive edges, the PNP method was applied to evaluate the significance of each node to the network matrix according to its projection value. The core HPI-GWGENs in the amplification and saturation stages were built by extracting the top-6000-significance nodes from the corresponding real HPI-GWGENs of the amplification and saturation infectious stages. The core HPI-GWGENs of amplification and saturation infectious stages are shown in Figure 2 and Appendix A, which were represented with the aid of Cytoscape [49]. The top-6000-significance nodes can be uploaded to the database for annotation, visualization, and integrated discovery (DAVID) functional annotation tool [47,48] to help investigate the pathogenic mechanism of the amplification and saturation stages of SARS-CoV-2 infection. The enrichment analysis of KEGG pathways for amplification and saturation infectious stages are shown in Table 2 and Table 3, respectively. The enrichment analysis of KEGG may imply what pathways are involved in each infection stage. We try to find the pathogenetic molecular mechanism for the viral amplification in early infection and the pathogenetic molecular mechanism for the viral saturation in late infection.

With the help of the enrichment analysis of KEGG and the exiting literature, the core host–pathogen interactive signaling pathways and their downstream abnormal cellular functions of SARS-CoV-2 infection at the amplification and saturation stages are shown in Figure 3. The core signaling pathways with red background in the middle of Figure 3 are common between the amplification and saturation stages of SARS-CoV-2 infection; the specific signaling pathways on the left-hand-side are the amplification stage of SARS-CoV-2 infection; and the specific signaling pathways on the right-hand-side are the saturation stage of SARS-CoV-2 infection.

2.2. Investigation of Specific Core HPI Signaling Pathways and Their Downstream Abnormal Cellular Functions during SARS-CoV-2 Infection

2.2.1. Investigation of Specific Core HPI Signaling Pathways in Amplification Infectious Stage

According to the core HPI signaling pathways of SARS-CoV-2 infection, several downstream abnormal cellular functions were identified in Figure 3. We try to investigate the progression of SARS-CoV-2 infection. In specific core signaling pathways of the amplification infectious stage in the left column of Figure 3, AP2M1 and AAK1 were identified to be associated with endocytosis. AAK1 can phosphorylate AP2M1, which promotes receptor-mediated endocytosis to help with viral entry [50].

The receptor EGFR was identified to activate MAPK1 by signaling through SOS1, RRAS2, BRAF, and MEK1. The activation of receptor EGFR may be caused by the ligand HB-EGF. For coronavirus infection, the upregulation of ligand HB-EGF, which can bind to the receptor EGFR and ERBB4, has been observed [51]. MAPK1 is the pivotal component to activate the downstream cellular function of receptor EGFR. MAPK1 suppresses the TF FOXO3 by phosphorylation and activates MNK1 and TF FOS. The target gene of TF FOXO3, BCL2L11, encodes the Bcl-2-like protein 11 (BIM) and acts as an apoptotic activator [52]. Lots of pathogens may interfere in the host’s cellular functions to establish the proper environment for replication. Because viruses, including SARS-CoV-2, need to use the host cell for replication, the apoptosis of the host cell in early infection may be not desirable for it [53]. Thus, the suppression of BCL2L11 causes the survival of the host cell, which is necessary for the replication of SARS-CoV-2. MNK1 activates the translation initiation factor EIF4E. EIF4E can direct the ribosomes to bind to the 5′ cap of mRNA. It has been reported that the inhibition of the interaction between viral mRNA and EIF4E can suppress the replication of SARS-CoV-2 [54], which implies that the activation of EIF4E is also a desirable event for the replication of SARS-CoV-2. The activation of TF FOS induces the target genes CCL2 and IL12. CCL2 belongs to cytokine genes and is involved in inflammation and immunoregulatory processes. It exhibits a chemotactic activity for basophils and monocytes, which can help the host cell defend the SARS-CoV-2 virus. In addition to activating MAPK1, receptor EGFR also interacts with PIK3C2A and NUMB. Recently, the activation of PIK3C2A has been proven to act both in the control of receptor endocytosis and resensitization [55], and the activation of receptor EGFR can increase the activity of PIK3C2A [56]. The activation of endocytic adaptor protein NUMB has been reported to promote the recycling of EGFR [57]. The recycling and resensitization may enhance the activation of downstream genes of receptor EGFR.

The receptor ERBB4 is also activated by the binding of HB-EGF. After interacting with HB-EGF, receptor ERBB4 activates PIK3CB and then activates the AKT1. AKT1 can suppress TF FOXO3 and activate MTOR, CHUK, and TF CREB1. The phosphorylation of MTOR can activate the S6K1. The activation of S6K1 induces protein synthesis, including the viral protein at the ribosome [58]. The activation of TF CREB1 by AKT1 induces the target gene BCL2, which can encode an outer mitochondrial membrane protein to block the apoptosis of cells. Therefore, the expression of BCL2 can promote the survival of the infected cells. As mentioned above, the survival of infected cells may help in the replication of SARS-CoV-2. The TF CREB1 is also activated by lncRNA MALAT1. The high expression of MALAT1 has been reported to be differentially expressed in severe COVID-19 [59]. A previous study has shown that the lncRNA MALAT1 can maintain the phosphorylation of TF CREB1 for continuous CREB1 signaling activation [60]. The lncRNA MALAT1 has also been found to regulate the miRNA MIR106B. The miRNA MIR106B is silenced by MALAT1 and induces the target gene of MIR106B [61]. Here, the identified target gene of miRNA MIR106B is CDKN1A [62], which can block cell cycle progression by inhibiting the cyclin-dependent kinase (CDK). Lots of viruses can induce the host cell cycle arrest to produce resources and construct a proper environment for viral replication to increase replication efficiency [63].

In summary, the activated EGFR is involved in the endocytosis which may help the entry of virus and establishes a favorable environment, including the suppression of pro-apoptotic protein and the activation of eukaryotic translation initiation factor. Furthermore, receptor EGFR has been shown to act as a cofactor for the internalization of several viruses [64,65]. Endocytosis is also required for the entry of SARS-CoV-2 into the host cell. To disrupt the proper environment for viral replication, we choose EGFR and AKT1, as significant biomarkers, as drug targets for the specific pathogenesis of the amplification stage to destroy the favorable environment.

2.2.2. Investigation of Common Core HPI Signaling Pathways of Amplification and Saturation Infectious Stages

Some signaling pathways were identified in both the amplification and saturation stages of SARS-CoV-2 infection. The common signaling pathways of amplification and saturation infectious stages are shown in the middle column of Figure 3 with a red background. The receptor TLR2 has been shown to detect the SARS-CoV-2 envelope (E) protein as its ligand [66]. It is a common pathogen recognition receptor that activates the host’s innate immunity. After detecting the SARS-CoV-2 E protein, TF NFKB1 is activated by signaling through MYD88, IRAK4, TRAF6, TAK1, and CHUK. The activation of TF NFκ-B induces the target genes CCL2, IL1A, TNF, and IL12A. All of them are involved in the inflammation and innate immune responses [67,68,69] to defend against SARS-CoV-2 infection. TNF may also be implicated in apoptosis [70]. It is the ligand of receptor TNFR1 as well. In addition to the activation of TF NFκ-B as a defense response of the host cell, melanoma differentiation-associated protein 5 (MDA5) can detect replicative intermediates of both positive- and negative-strand RNA viruses [71]. After detecting the virus, TF IRF3 is phosphorylated through signaling proteins MDA5, MAVS, TRAF3, and TBK1, which can induce the target gene IFNB1. The IFNB1 can be the ligand of the receptor IFNAR1. The infected host cells will release interferons (IFN) and let nearby cells enhance their anti-viral defenses. Like other viruses [72], SARS-CoV-2 also interrupts the host cell’s immune system. The SARS-CoV-2 membrane (M) protein was identified to inhibit the production of IFN. A recent study has shown that SARS-CoV-2 M protein can prevent nuclear translocation of TF IRF3 and inhibit the phosphorylation of IRF3 [73].

The host cell cycle is also interfered with by the SARS-CoV-2 protein. The receptor TGFBR1 was identified to activate TF SMAD3, which may be caused by the ligand TGF-β. The upregulation of ligand TGF-β has been observed in SARS-CoV-2 infection [74,75]. The phosphorylation of SMAD3 induces the target genes CDKN1A, CDKN2B, and SERPINE1. Both CDKN1A and CDKN2B are CDK inhibitors to disrupt the cell cycle’s progression. As mentioned above, cell cycle arrest is a desirable event for viral replication. SARS-CoV-2 nucleocapsid (N) protein was identified to interact with TF SMAD3 to enhance the downstream target genes, which was consistent with previous research [76]. In addition to causing cell cycle arrest, TF SMAD3 is also involved in coagulopathy by inducing the target gene SERPINE1. The overexpression of SERPINE1 has been reported to play an important role in COVID-19-associated coagulopathy, leading to acute respiratory distress syndrome (ARDS) [77]. Coagulopathy may be fatal, so TF SMAD3 should be suppressed.

Briefly, the abnormal suppression of IRF3, which induces IFNB1, by the SARS-CoV-2 M protein and the overactivation of TF SMAD3 establishes the proper environment for the replication of SARS-CoV-2. Therefore, we choose IFNB1 and SMAD3 as significant biomarkers (drug targets) for the overlapping pathogenesis of the amplification and saturation stages.

2.2.3. Investigation of Specific Core HPI Signaling Pathways at Saturation Infectious Stage

In the saturation infectious stage, the mRNA level of the virus is decreased. Based on signaling pathways in the right column of Figure 3, we investigate the host defense mechanism against SARS-CoV-2 infection. The inducted proteins of target genes TNF and IFNB1 may be the ligands. While receiving the ligand TNF-α, the downstream proteins of receptor TNFR1 are activated. The activation of receptor TNFR1 can activate TRAF2 and FADD through signaling protein TRADD. After activation of TRAF2, TF JUN is phosphorylated via signaling proteins MEKK1/SEK1/JNK3, which induce the target genes TNF, IL12A, CXCL10, and TP53. The target gene CXCL10 is a pro-inflammatory cytokine that is involved in lots of processes such as apoptosis, chemotaxis, and the activation of peripheral immune cells [78,79,80]. The induction of TP53 can induce the target gene BAX. BAX is a pro-apoptosis protein that has been shown to be involved in P53-mediated apoptosis [81]. The other apoptotic pathway activated by the receptor TNFR1 is through FADD, which activates the BH3 interacting-domain (BID) protein through signaling protein CASP10 [82]. BID is also a pro-apoptosis protein. As mentioned above, the apoptosis of the infected host cells can reduce the replication of virus because the virus must employ the living host cell for replication.

Another specific signaling pathway of the saturation infectious stage is the production of ISG. With the binding of ligand IFN-β, receptor IFNAR1 interacts with TYK2. TYK2 can stimulate the phosphorylation of STAT1 to interact with TF IRF1 and TF IRF9. TF IRF1 induces the target gene IFNB1 and TF IRF9 targets genes ISG15 and MX1. Interferon-stimulated gene 15 (ISG15) is important to host cells against virus infection. It is involved in several cellular functions, including the limit of the newly synthesized virus proteins, induction of natural killer cell proliferation, and the enhancement of lytic capabilities of lymphokine-activated killer-like cells [83]. The target gene MX1 can antagonize the replication process of several different RNA and DNA viruses. Recent research has shown that MX1 is upregulated after SARS-CoV-2 infection, that MX1 has a direct effect on the viral ribonucleoprotein complex, and that its GTPase activity is essential for its antiviral function [84].

In summary, for the specific core HPI signaling pathways in the saturation infectious stage, the virus mRNA level begins to reduce. The activation of TNF and IFN signaling pathways may contribute to the host cell defense against SARS-CoV-2 infection. Furthermore, the enhancement of IFNB1 in common core HPI signaling pathways potentially enhances the IFN signaling pathway. Thus, the TF JUN is picked as the drug target for the specific pathogenesis in the saturation infectious stage to enhance the defense of SARS-CoV-2 infection.

2.3. Multiple-Molecule Drug Discovery and Design by DNN-Based DTI Model with Drug Design Specifications

After investigating the core HPI signaling pathways and their downstream abnormal cellular functions for SARS-CoV-2 infection at the amplification and saturation stages in the above subsection, we finally chose EGFR, AKT1, IFNB1, and SMAD3 as the drug targets for the amplification stage of SARS-CoV-2 infection and picked IFNB1, SMAD3, and JUN as the drug targets for saturation stage of SARS-CoV-2 infection. The purpose of drug targets for the amplification infectious stage is mostly to reduce the interferences of the host cell by SARS-CoV-2. The drug target JUN for therapy in the saturation infectious stage is to shorten the course of the SARS-CoV-2 infection by reducing the virus’s replication. Subsequently, the DNN-based DTI model is constructed and trained by DTI databases to predict the candidate repurposing drugs that target these significant biomarkers (drug targets). Finally, the potential multiple-molecule drug is proposed for SARS-CoV-2 infection by screening the candidate repurposing drugs with three drug design specifications. The architecture of the DNN-based DTI model for DTI prediction is shown in Figure 4.

2.3.1. Prediction Performance of DNN-Based DTI Model

The model was trained by Keras with a batch size = 64, epoch = 200 (with Early Stopping), and Adam optimizer (default arguments). A 10-fold cross-validation was applied to estimate the prediction performance of the DNN-based DTI model, which is shown in Table 4. The learning process (loss and accuracy of training and validation) is visualized in Figure 5 (the early stop at epoch = 117 to avoid overfitting). Finally, the receiver operating characteristic (ROC) curve is plotted in Figure 6, and the area under the ROC curve (AUC) is 0.991, which implies that the model has good ability to distinguish between positive and negative interactions.

2.3.2. Multiple-Molecule Drug Repositioning for Disrupting the Progression of SARS-CoV-2 Infection

After training the DNN-based DTI model, we can predict the candidate drugs which target the significant drug targets found by the systems biology method. The regulation ability, toxicity, and sensitivity of the drug are considered as drug design specifications for screening the candidate drugs. The regulation ability indicates the upregulation (>0) or downregulation (<0) of the drug target interaction. Therefore, if a drug target is upregulated during infection process, we need to select a drug to downregulate it, and vice versa. A small value of sensitivity indicates a lower sensitivity to chemical perturbation for the cell, and a higher LC50 implies a lower toxicity for the body. Thus, to upregulate a target gene, we tend to find a molecular drug with a larger regulation ability, a small value of sensitivity, and a larger LC50 value. Based on these drug design specifications, some candidate drugs for the significant drug targets are shown in Table 5. Finally, the potential multiple-molecule drug shown in Table 6 is proposed for the treatment of the amplification stage of SARS-CoV-2 infection, and the potential multiple-molecule drug shown in Table 7 is proposed for the saturation stage of mild-to-moderate SARS-CoV-2 infection.

3. Discussion

After two years of the COVID-19 pandemic, many drugs have been used to treat COVID-19. Nevertheless, there is still no drug proven effective against COVID-19 and no clear mechanism for how SARS-CoV-2 affects the host cell to replicate efficiently. We first constructed the dynamic models to describe the candidate HPI-GWGEN and used HPI RNA-seq time-profile data to identify the core signaling pathways of the HPI-GWGEN during SARS-CoV-2 infection. Then, we explored the possible systematic molecular mechanism of why SARS-CoV-2 can replicate rapidly by the reverse engineering method. Although the treatment of cytokine storm (the main cause of mortality in COVID-19) is a crucial issue, the suppression of viral load is also important. For viral infection, the drugs can be designed to target the host or virus. The drugs targeting the virus are designed to target viral proteins such as viral proteases to inhibit their biological function. The rapid mutation of viruses may cause the failure of the drugs that target the virus, especially RNA viruses. The variant of viruses may not only increase the transmissibility but also cause the failure of vaccines and the drugs targeting the viral protein. Although the design of drugs targeting the virus is more popular, antiviral drugs targeting host proteins have also been used for other viral infections. For instance, the commercial drug interferon alpha has been shown to be effective against the viral infection of hepatitis B and C virus, papillomavirus (Kaposi’s sarcoma) virus, and human herpesvirus 8 [85]. The drug Nitazoxanide shows antiviral activity for some viruses by targeting the host protein and affecting the host cellular functions [85,86]. Thus, we try to find the drugs that target the host proteins to reduce the risk of the variant of SARS-CoV-2. With the temporal information of HPI RNA-seq time-solved data, we could investigate the core HPI signaling pathways of SARS-CoV-2 infection, as shown in Figure 3, to identify the significant drug targets for the amplification and saturation infectious stages. Then, we utilized the DNN-based DTI model to predict the candidate drugs which can induce or inhibit the significant biomarkers (drug targets). After screening drug design specifications, we suggested the multiple-molecule drug consisting of bosutinib, erlotinib, and 17-beta-estradiol for the treatment of the amplification stage of SARS-CoV-2 infection and the multiple-molecule drug consisting of erlotinib, 17-beta-estradiol, and sertraline for the treatment of the saturation stage of mild-to-moderate SARS-CoV-2 infection.

In the multiple-molecule drug for the amplification stage of SARS-CoV-2 infection, Bosutinib, a synthetic quinolone derivative and tyrosine kinase inhibitor, is commonly used for the treatment of chronic myeloid leukemia. It has been shown that Bosutinib can inhibit the activation of EGFR and induce the apoptosis of cells [87]. In addition to the inhibition of EGFR activation, it can inhibit the activation of Akt as well [87]. Erlotinib, a quinazoline derivative, is a common EGFR inhibitor that is used to treat cancer, especially for EGFR mutation-positive, non-small-cell lung cancer [88]. In addition to the inhibition of EGFR, a previous study has also shown that Erlotinib can inhibit the activation of Smad2/3 [89]. Furthermore, Erlotinib was reported to prevent fibrosis development in in vivo models [51], and it was proposed as a therapeutic agent in the treatment of COVID-19 [90]. For the drug targeting IFNB1, we predicted 17-beta-estradiol. 17-beta-estradiol is a synthetic form of estradiol, a steroid sex hormone, which may be involved in inflammation and the immune [91]. A previous study shows that 17-beta-estradiol can induce IFN, especially IFN-β, via the activation of IRF1 [92]. 17-beta-estradiol is also reported to suppress the phosphorylation of Smad2 and Smad3 and reduce their gene reporter activity in response to TGF-beta [93]. Interestingly, many observations have shown that the mortality rate of men is higher than women [94,95,96]. Estradiol may be a protective role against COVID-19 [97]. Hence, it was also proposed as a therapy of COVID-19 [98]. Here, we think that it can induce IFNB1 and suppress the phosphorylation of SMAD3 to achieve protection against COVID-19. In the multiple-molecule drug for the saturation stage of SARS-CoV-2 infection, Sertraline, a selective serotonin reuptake inhibitor used in the treatment of depression, can upregulate JUN and induce apoptosis [99,100,101]. The apoptosis of infected cells is important to clean the viruses in infected cells. In addition to the upregulation of JUN, the anticoagulant property of Sertraline has been reported as well [102].

In summary, we applied the systems biology method to clearly understand the molecular mechanism of rapid replication of SARS-CoV-2. Subsequently, we identified the significant biomarkers as drug targets to destroy the proper microenvironment for the replication of SARS-CoV-2 or enhance the defense of host cells against SARS-CoV-2. EGFR, AKT1, IFNB1, and SMAD3 are chosen as the drug targets for the amplification stage of SARS-CoV-2 infection, and IFNB1, SMAD3, and JUN are picked out as the drug targets for the saturation stage of SARS-CoV-2 infection. After training the DNN-based DTI model with DTI databases, we could predict the potential molecular drugs with three design specifications for the treatment of SARS-CoV-2 infection. Finally, we proposed the combination of bosutinib, erlotinib, and 17-beta-estradiol as the multiple-molecule drug for the treatment of the amplification stage of SARS-CoV-2 infection and the combination of erlotinib, 17-beta-estradiol, and sertraline as the multiple-molecule drug for the treatment of the saturation stage of mild-to-moderate SARS-CoV-2 infection.

4. Materials and Methods

4.1. Construction of the Candidate HPI-GWGEN Using Big Data Mining

The HPI-GWGEN contains two networks: the HPI protein–protein interaction network (HPI-PPIN) and the HPI gene regulation network (HPI-GRN). Both of them can be further classified into the host intraspecies network, host–pathogen interspecies network, and pathogen intraspecies network. In the candidate HPI-GWGEN, we are only concerned with whether proteins, genes, miRNAs, or lncRNAs in the candidate HPI-GWGEN have existing interactions or regulations. This can be expressed by a Boolean matrix.

The host intraspecies of candidate HPI-PPINs were constructed using the data from some databases, including the Database of Interacting Proteins (DIP) [46], the Biological General Repository for Interaction Datasets database (BioGRID) [45], the Biomolecular Interaction Network Database (BIND) [44], the IntAct Molecular Interaction Database (IntAct) [43], and the Molecular INTeraction Database (MINT) [41]. The pathogen intraspecies and host–pathogen interspecies of candidate PPIN were constructed from BioGRID [45], IntAct [43], and UniProt [42].

The databases for the construction of the host intraspecies of candidate HPI-GRNs included TargetScan [40], CircuitsDB [39], and starBase v2.0 [38] for epigenetic regulation (miRNA and lncRNA) and the Human Transcriptional Regulation Interactions database (HTRIdb) [35], the Transcription Factor database (TRANSFAC) [37], and Integrated Transcription Factor Platform database (ITFP) [36] for other regulations (transcription factors). There are currently not enough regulations between humans and SARS-Cov-2 to construct an HPI-GRN. We first supposed that the regulations of the host on the virus genes are not negative. We then used systematic methods to eliminate the false positives.

4.2. System Identification of HPI-GWGEN Using HPI RNA-Seq Time-Profile Data

4.2.1. HPI RNA-Seq Time-Profile Data

To find the crosstalk between humans and SARS-CoV-2, the dynamic models for the candidate HPI-GWGEN were constructed, and HPI RNA-seq time-profile data were utilized to present the expressions of HPI-GWGEN during the amplification and saturation infectious stages for these HPI-GWGEN models. The dynamic models can describe the candidate HPI-GWGEN through the reverse engineering method [103] using HPI RNA-seq time-profile data to reflect the system behavior of HPI-GWGEN during SARS-CoV-2 infection.

The HPI RNA-seq data could be downloaded from the National Center for Biotechnology Information (NCBI) (GEO number: GSE163547) [104]. We found the average for two samples infected with the MOI of 0.25 at 0, 4, 24, 48, 72, and 96 h post-infection (hpi). The genome-wide HPI RNA-seq time-profile data were employed to identify the system parameters of candidate HPI-GWGENs using the system identification method [103]. Since the mRNA level of SARS-CoV-2 majorly increased with time, reached the peak at 48 hpi, and slowly decreased (saturation) after 48 hpi, we defined the period from 0 to 48 hpi as the (viral) amplification stage and the period from 24 to 96 hpi as the (viral) saturation stage. With the Gencode v35/v27 annotation, the nodes were sorted into six types to be adopted for the dynamic models: host proteins, virus proteins, host genes, host miRNAs, host lncRNAs, and virus genes.

4.2.2. Dynamic Models for HPI-GWGEN

For the candidate HPI-PPIN in candidate HPI-GWGEN, the expression levels of host proteins and interactive pathogen proteins can be modeled as the following dynamic equations [103]:

p_{i}^{H} (t + 1) = p_{i}^{H} (t) + \sum_{h = 1}^{H_{i}} C_{h i}^{p H H} p_{i}^{H} (t) p_{h}^{H} (t) + \sum_{v = 1}^{V_{i}} C_{v i}^{p P H} p_{i}^{H} (t) p_{v}^{P} (t) + α_{H i} g_{i}^{H} (t) - γ_{H i} p_{i}^{H} (t) + β_{H i} + n_{H i} (t) α_{H i} \geq 0 a n d - γ_{H i} \leq 0, for i = 1, 2, \dots, I

(1)

where

p_{i}^{H} (t)

,

p_{h}^{H} (t)

,

p_{v}^{P} (t)

, and

g_{i}^{H} (t)

represent the expression level of the

i

th host protein, the

h

th host protein, the

v

th pathogen protein, and the

i

th host gene at time t, respectively;

H_{i}

and

V_{i}

are the number of the host proteins and the pathogen proteins that interact with the

i

th host protein, respectively;

C_{h i}^{p H H}

and

C_{v i}^{p H P}

specify the interactive ability between the

h

th host protein and the

i

th host protein and between the

v

th pathogen protein and the

i

th host protein, respectively;

α_{H i}

,

- γ_{H i}

, and

β_{H i}

indicate the translation rate from the corresponding mRNA, the degradation rate, and the basal activity level of the

i

th host protein, respectively; the basal level denotes the unknown or unavailable interaction such as phosphorylation;

n_{H i} (t)

is the stochastic noise of the

i

th host protein at time t;

I

is the total number of host proteins in candidate HPI-PPIN.

The dynamic interaction models of pathogen proteins of HPI-PPIN in the candidate HPI-GWGEN can be described as the following discrete-time dynamic equations [103]:

p_{k}^{P} (t + 1) = p_{k}^{P} (t) + \sum_{h = 1}^{H_{k}} C_{h k}^{p H P} p_{k}^{P} (t) p_{h}^{H} (t) + \sum_{v = 1}^{V_{k}} C_{v k}^{p P P} p_{k}^{P} (t) p_{v}^{P} (t) + α_{P k} g_{k}^{P} (t) - γ_{P k} p_{k}^{P} (t) + β_{P k} + n_{P k} (t) α_{P k} \geq 0 a n d - γ_{P k} \leq 0, for k = 1, 2, \dots, K

(2)

where

p_{k}^{P} (t)

,

p_{h}^{H} (t)

,

p_{v}^{P} (t)

, and

g_{k}^{P} (t)

denote the expression level of the

k

th pathogen protein, the

h

th host protein, the

v

th pathogen protein, and the

k

th pathogen gene at time t, respectively;

H_{k}

and

V_{k}

are the number of the host proteins and the pathogen proteins that interact with the

k

th pathogen protein in a candidate HPI-PPIN, respectively;

C_{h k}^{p H P}

and

C_{v k}^{p P P}

specify the interactive ability between the

h

th host protein and the

k

th pathogen protein and between the

v

th pathogen protein and the

k

th pathogen protein, respectively;

α_{P k}

,

- γ_{P k}

, and

β_{P k}

indicate the translation rate from the corresponding mRNA, the degradation rate, and the basal activity level of the

k

th pathogen protein, respectively;

n_{P k} (t)

represents the stochastic noise of the

k

th pathogen protein at time t;

K

is the total number of pathogen proteins in a candidate HPI-PPIN.

The dynamic regulatory models of host genes in the HPI-GRN of a candidate HPI-GWGEN can be depicted as the following discrete-time dynamic equations [103]:

g_{o}^{H} (t + 1) = g_{o}^{H} (t) + \sum_{h = 1}^{H_{o}} C_{h o}^{T G} p_{h}^{H} (t) + \sum_{μ = 1}^{U_{o}} C_{μ o}^{M G} g_{o}^{H} (t) m_{μ} (t) + \sum_{λ = 1}^{L_{o}} C_{λ o}^{L G} l_{λ} (t) - γ_{G o} g_{o}^{H} (t) + β_{G o} + n_{G o} (t), C_{μ o}^{M G} \leq 0 a n d - γ_{G o} \leq 0 for o = 1, 2, \dots, O

(3)

where

g_{o}^{H} (t)

,

p_{h}^{H} (t)

,

m_{μ} (t)

, and

l_{λ} (t)

denote the expression level of the

o

th host gene, the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA at time t, respectively;

H_{o}

,

U_{o}

, and

L_{o}

represent the number of host TFs, host miRNAs, and host lncRNAs that have regulation on the

o

th host gene, respectively;

C_{h o}^{T G}

,

C_{μ o}^{M G}

, and

C_{λ o}^{L G}

specify the regulation ability of the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA on the

o

th host gene, respectively;

- γ_{G o}

and

β_{G o}

indicate the degradation rate and the basal activity level of the

o

th host gene, respectively;

n_{G o} (t)

is the stochastic noise of the

o

th host gene at time t;

O

is the total number of host genes in a candidate HPI-GRN.

The dynamic regulatory models of the host miRNAs in the HPI-GRN of a candidate HPI-GWGEN can be modeled as the following discrete-time dynamic equations [103]:

m_{q} (t + 1) = m_{q} (t) + \sum_{h = 1}^{H_{q}} C_{h q}^{T M} p_{h}^{H} (t) + \sum_{μ = 1}^{U_{q}} C_{μ q}^{M M} m_{q} (t) m_{μ} (t) + \sum_{λ = 1}^{L_{q}} C_{λ q}^{L M} l_{λ} (t) - γ_{M q} m_{q} (t) + β_{M q} + n_{M q} (t), C_{μ q}^{M M} \leq 0 a n d - γ_{M q} \leq 0 for q = 1, 2, \dots, Q

(4)

where

m_{q} (t)

,

p_{h}^{H} (t)

,

m_{μ} (t)

, and

l_{λ} (t)

denote the expression level of the

q

th host miRNA, the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA at time t, respectively;

H_{q}

,

U_{q}

, and

L_{q}

represent the number of host TFs, host miRNAs, and host lncRNAs that have regulation on the

q

th host miRNA, respectively;

C_{h q}^{T M}

,

C_{μ q}^{M M}

, and

C_{λ q}^{L M}

specify the regulation ability of the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA on the

q

th host miRNA, respectively;

- γ_{M q}

and

β_{M q}

represent the degradation rate and the basal activity level of the

q

th host miRNA, respectively;

n_{M q} (t)

is the stochastic noise of the

q

th host miRNA at time t;

Q

is the total number of host miRNAs in candidate HPI-GRN.

The dynamic regulatory models of the host lncRNAs in the HPI-GRN of a candidate HPI-GWGEN can be depicted as the following discrete-time dynamic equations [103]:

l_{s} (t + 1) = l_{s} (t) + \sum_{h = 1}^{H_{s}} C_{h s}^{T L} p_{h}^{H} (t) + \sum_{μ = 1}^{U_{s}} C_{μ s}^{M L} l_{s} (t) m_{μ} (t) + \sum_{λ = 1}^{L_{s}} C_{λ s}^{L L} l_{λ} (t) - γ_{L s} l_{s} (t) + β_{L s} + n_{L s} (t), C_{μ s}^{M L} \leq 0 a n d - γ_{L s} \leq 0 for s = 1, 2, \dots, S

(5)

where

l_{s} (t)

,

p_{h}^{H} (t)

,

m_{μ} (t)

, and

l_{λ} (t)

denote the expression level of the

s

th host lncRNA, the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA at time t, respectively;

H_{s}

,

U_{s}

, and

L_{s}

represent the number of host TFs, host miRNAs, and host lncRNAs that have regulation on the

s

th host lncRNA, respectively;

C_{h s}^{T L}

,

C_{μ s}^{M L}

, and

C_{λ s}^{L L}

specify the regulation ability of the

h

th host TF, the

μ

th host miRNA, and the

λ

th host lncRNA on the

s

th host lncRNA, respectively;

- γ_{L s}

and

β_{L s}

indicate the degradation rate and the basal activity level of the

s

th host lncRNA, respectively;

n_{L s} (t)

is the stochastic noise of the

s

th host lncRNA at time t;

S

is the total number of host lncRNAs in a candidate HPI-GRN.

The dynamic regulatory models of the pathogen genes in the HPI-GRN of a candidate HPI-GWGEN can be depicted as the following discrete-time dynamic equations [103]:

g_{w}^{P} (t + 1) = g_{w}^{P} (t) + \sum_{h = 1}^{H_{w}} C_{h w}^{T V} p_{h}^{H} (t) + \sum_{μ = 1}^{U_{w}} C_{μ w}^{M V} g_{w}^{P} (t) m_{μ} (t) + \sum_{λ = 1}^{L_{w}} C_{λ w}^{L V} l_{λ} (t) + \sum_{y = 1}^{Y_{w}} C_{y w}^{P V} p_{y}^{P} (t) - γ_{V w} g_{w}^{P} (t) + β_{V w} + n_{V w} (t), C_{μ w}^{M V} \leq 0 a n d - γ_{V w} \leq 0 for w = 1, 2, \dots, W

(6)

where

g_{w}^{P} (t)

,

p_{h}^{H} (t)

,

m_{μ} (t)

,

l_{λ} (t)

, and

p_{y}^{P} (t)

denote the expression level of the

w

th pathogen gene, the

h

th host TF, the

μ

th host miRNA, the

λ

th host lncRNA and

y

th pathogen protein at time t, respectively;

H_{w}

,

U_{w}

,

L_{w}

, and

Y_{w}

represent the number of host TFs, host miRNAs, host lncRNAs, and pathogen TFs that have regulation on the

w

th pathogen gene, respectively;

C_{h w}^{T V}

,

C_{μ w}^{M V}

,

C_{λ w}^{L V}

, and

C_{y w}^{P V}

specify the regulation ability of the

h

th host TF, the

μ

th host miRNA, the

λ

th host lncRNA, and the

y

th pathogen protein on the

w

th pathogen gene, respectively;

- γ_{V w}

and

β_{V w}

indicate the degradation rate and the basal activity level of the

w

th pathogen gene, respectively;

n_{V w} (t)

is the stochastic noise of the

w

th pathogen gene at time t;

W

is the total number of pathogen genes in a candidate HPI-GRN.

4.2.3. System Identification and System Order Selection for HPI-GWGEN

With the discrete-time dynamic models for the candidate HPI-GWGEN, we can perform system identification using HPI RNA-seq time-profile data. Nevertheless, the number of parameters may be larger than the number of samples, which may cause over-fitting in the least square parameter estimation. Furthermore, the candidate HPI-GWGEN contains many false positives. Thus, we used the cubic spline interpolation to solve the over-fitting problem in the system identification process [103]. Then, we applied a system order detection method, the Akaike information criterion (AIC) [103], to detect the system order of the dynamic equations of protein interaction and gene regulation after the system identification to prune the false positives out of the system order in the candidate HPI-GWGENs to obtain the real HPI-GWGEN.

To estimate the parameters in Equations (1)–(6), we arranged each equation as the linear regression form with regressor (expression level from RNA-seq data)

ω_{i} (t)

and the parameter vector

C_{i}

as follows:

p_{i}^{H} (t + 1) = [p_{i}^{H} (t) p_{1}^{H} (t) \dots p_{i}^{H} (t) p_{H_{i}}^{H} (t) p_{i}^{H} (t) p_{1}^{P} (t) \dots p_{i}^{H} (t) p_{V_{i}}^{P} (t) g_{i}^{H} (t) p_{i}^{H} (t) 1] [\begin{matrix} C_{1 i}^{p H H} \\ ⋮ \\ C_{H_{i} i}^{p H H} \\ C_{1 i}^{p P H} \\ ⋮ \\ C_{V_{i} i}^{p P H} \\ α_{H i} \\ 1 - γ_{H i} \\ β_{H i} \end{matrix}] + n_{H i} (t) = ω_{H i} (t) C_{H i} + n_{H i} (t), for i = 1, 2, \dots, I

(7)

p_{k}^{P} (t + 1) = [p_{k}^{P} (t) p_{1}^{H} (t) \dots p_{k}^{P} (t) p_{H_{k}}^{H} (t) p_{k}^{P} (t) p_{1}^{P} (t) \dots p_{k}^{P} (t) p_{V_{k}}^{P} (t) g_{k}^{P} (t) p_{k}^{P} (t) 1] [\begin{matrix} C_{1 k}^{p H P} \\ ⋮ \\ C_{H_{k} k}^{p H P} \\ C_{1 k}^{p P P} \\ ⋮ \\ C_{V_{k} k}^{p P P} \\ α_{P k} \\ 1 - γ_{P k} \\ β_{P k} \end{matrix}] + n_{P k} (t) = ω_{P k} (t) C_{P k} + n_{P k} (t), for k = 1, 2, \dots, K

(8)

g_{o}^{H} (t + 1) = [p_{1}^{H} (t) \dots p_{H_{o}}^{H} (t) g_{o}^{H} (t) m_{1} (t) \dots g_{o}^{H} (t) m_{U_{o}} (t) l_{1} (t) \dots l_{L_{o}} (t) g_{o}^{H} (t) 1] [\begin{matrix} C_{1 o}^{T G} \\ ⋮ \\ C_{H_{o} o}^{T G} \\ C_{1 o}^{M G} \\ ⋮ \\ C_{U_{o} o}^{M G} \\ C_{1 o}^{L G} \\ ⋮ \\ C_{L_{o} o}^{L G} \\ 1 - γ_{G o} \\ β_{G o} \end{matrix}] + n_{G o} (t) = ω_{G o} (t) C_{G o} + n_{G o} (t), for o = 1, 2, \dots, O

(9)

m_{q} (t + 1) = [p_{1}^{H} \dots p_{H_{q}}^{H} (t) m_{q} (t) m_{1} (t) \dots m_{q} (t) m_{U_{q}} (t) l_{1} (t) \dots l_{L_{q}} (t) m_{q} (t) 1] [\begin{matrix} C_{1 q}^{T M} \\ ⋮ \\ C_{H_{q} q}^{T M} \\ C_{1 q}^{M M} \\ ⋮ \\ C_{U_{q} q}^{M M} \\ C_{1 q}^{L M} \\ ⋮ \\ C_{L_{q} q}^{L M} \\ 1 - γ_{M q} \\ β_{M q} \end{matrix}] + n_{M q} (t) = ω_{M q} (t) C_{M q} + n_{M q} (t), for q = 1, 2, \dots, Q

(10)

l_{s} (t + 1) = [p_{1}^{H} \dots p_{H_{s}}^{H} (t) l_{s} (t) m_{1} (t) \dots l_{s} (t) m_{U_{s}} (t) l_{1} (t) \dots l_{L_{s}} (t) l_{s} (t) 1] [\begin{matrix} C_{1 s}^{T L} \\ ⋮ \\ C_{H_{s} s}^{T L} \\ C_{1 s}^{M L} \\ ⋮ \\ C_{U_{s} s}^{M L} \\ C_{1 s}^{L L} \\ ⋮ \\ C_{L_{s} s}^{L L} \\ 1 - γ_{L s} \\ β_{L s} \end{matrix}] + n_{L s} (t) = ω_{L s} (t) C_{L s} + n_{L s} (t), for s = 1, 2, \dots, S

(11)

g_{w}^{P} (t + 1) = [p_{1}^{H} \dots p_{H_{w}}^{H} (t) g_{w}^{P} m_{1} (t) \dots g_{w}^{P} m_{U_{w}} (t) l_{1} (t) \dots l_{L_{w}} (t) p_{1}^{P} (t) \dots p_{Y_{w}}^{P} (t) g_{w}^{P} (t) 1] [\begin{matrix} C_{1 w}^{T V} \\ \dots \\ C_{H_{w} w}^{T V} \\ C_{1 w}^{M V} \\ \dots \\ C_{U_{w} w}^{M V} \\ C_{1 w}^{L V} \\ \dots \\ C_{L_{w} w}^{L V} \\ C_{1 w}^{P V} \\ \dots \\ C_{Y_{w} w}^{P V} \\ 1 - γ_{V w} \\ β_{V w} \end{matrix}] + n_{V w} (t) = ω_{V w} (t) C_{V w} + n_{V w} (t), for w = 1, 2, \dots, W

(12)

Then, we used the time points from

t_{2}

to

t_{T}

(T represents the number of time points of each infectious stage after interpolation) as the vector of observations. We could form the regressor matrices

π_{i}

and the vectors of observations

Π_{i}

with Equations (7)–(12) as the following augmented regression equations, respectively:

[\begin{matrix} p_{i}^{H} (t_{2}) \\ p_{i}^{H} (t_{3}) \\ ⋮ \\ p_{i}^{H} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{H i} (t_{1}) \\ ω_{H i} (t_{2}) \\ ⋮ \\ ω_{H i} (t_{T - 1}) \end{matrix}] C_{H i} + [\begin{matrix} n_{H i} (t_{1}) \\ n_{H i} (t_{2}) \\ ⋮ \\ n_{H i} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{H i} = π_{H i} C_{H i} + N_{H i}, for i = 1, 2, \dots, I

(13)

[\begin{matrix} p_{k}^{P} (t_{2}) \\ p_{k}^{P} (t_{3}) \\ ⋮ \\ p_{k}^{P} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{P k} (t_{1}) \\ ω_{P k} (t_{2}) \\ ⋮ \\ ω_{P k} (t_{T - 1}) \end{matrix}] C_{P k} + [\begin{matrix} n_{P k} (t_{1}) \\ n_{P k} (t_{2}) \\ ⋮ \\ n_{P k} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{P k} = π_{P k} C_{P k} + N_{P k}, for k = 1, 2, \dots, K

(14)

[\begin{matrix} g_{o}^{H} (t_{2}) \\ g_{o}^{H} (t_{3}) \\ ⋮ \\ g_{o}^{H} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{G o} (t_{1}) \\ ω_{G o} (t_{2}) \\ ⋮ \\ ω_{G o} (t_{T - 1}) \end{matrix}] C_{G o} + [\begin{matrix} n_{G o} (t_{1}) \\ n_{G o} (t_{2}) \\ ⋮ \\ n_{G o} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{G o} = π_{G o} C_{G o} + N_{G o}, for o = 1, 2, \dots, O

(15)

[\begin{matrix} m_{q} (t_{2}) \\ m_{q} (t_{3}) \\ ⋮ \\ m_{q} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{M q} (t_{1}) \\ ω_{M q} (t_{2}) \\ ⋮ \\ ω_{M q} (t_{T - 1}) \end{matrix}] C_{M q} + [\begin{matrix} n_{M q} (t_{1}) \\ n_{M q} (t_{2}) \\ ⋮ \\ n_{M q} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{M q} = π_{M q} C_{M q} + N_{M q}, for q = 1, 2, \dots, Q

(16)

[\begin{matrix} l_{s} (t_{2}) \\ l_{s} (t_{3}) \\ ⋮ \\ l_{s} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{L s} (t_{1}) \\ ω_{L s} (t_{2}) \\ ⋮ \\ ω_{L s} (t_{T - 1}) \end{matrix}] C_{L s} + [\begin{matrix} n_{L s} (t_{1}) \\ n_{L s} (t_{2}) \\ ⋮ \\ n_{L s} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{L s} = π_{L s} C_{L s} + N_{L s}, for s = 1, 2, \dots, S

(17)

[\begin{matrix} g_{w}^{P} (t_{2}) \\ g_{w}^{P} (t_{3}) \\ ⋮ \\ g_{w}^{P} (t_{T}) \end{matrix}] = [\begin{matrix} ω_{V w} (t_{1}) \\ ω_{V w} (t_{2}) \\ ⋮ \\ ω_{V w} (t_{T - 1}) \end{matrix}] C_{V w} + [\begin{matrix} n_{V w} (t_{1}) \\ n_{V w} (t_{2}) \\ ⋮ \\ n_{V w} (t_{T - 1}) \end{matrix}] \Rightarrow Π_{V w} = π_{V w} C_{V w} + N_{V w}, for w = 1, 2, \dots, W

(18)

With regressor matrices

π_{i}

and the regression vectors of observations

Π_{i}

, we could obtain the estimated parameter vectors

{\hat{C}}_{i}

by solving the following constrained optimization problems with some biological upper and lower bounds (i.e., the translation rate

α_{i}

\geq 0

, the degradation rate

- γ_{i}

\leq 0

, and the regulation ability of miRNAs

\leq 0

) on Equations (13)–(18) [103]:

\begin{matrix} {\hat{C}}_{H i} = \underset{C_{H i}}{a r g m i n} & ‖ π_{H i} C_{H i} - Π_{H i} ‖_{2}^{2}, \\ subject to [\underset{H_{i}}{\underset{⏟}{\begin{matrix} 0 & \dots & 0 \\ 0 & \dots & 0 \end{matrix}}} | \underset{V_{i}}{\underset{⏟}{\begin{matrix} 0 & \dots & 0 \\ 0 & \dots & 0 \end{matrix}}} | \begin{matrix} - 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}] {\hat{C}}_{H i} \leq [\begin{matrix} 0 \\ 1 \end{matrix}] \end{matrix}

(19)

\begin{matrix} {\hat{C}}_{P k} = \underset{C_{P k}}{a r g m i n} & ‖ π_{P k} C_{P k} - Π_{P k} ‖_{2}^{2}, \\ subject to [\underset{H_{k}}{\underset{⏟}{\begin{matrix} 0 & \dots & 0 \\ 0 & \dots & 0 \end{matrix}}} | \underset{V_{k}}{\underset{⏟}{\begin{matrix} 0 & \dots & 0 \\ 0 & \dots & 0 \end{matrix}}} | \begin{matrix} - 1 & 0 & 0 \\ 0 & 1 & 0 \end{matrix}] {\hat{C}}_{P k} \leq [\begin{matrix} 0 \\ 1 \end{matrix}] \end{matrix}

(20)

\begin{matrix} {\hat{C}}_{G o} = \underset{C_{G o}}{a r g m i n} & ‖ π_{G o} C_{G o} - Π_{G o} ‖_{2}^{2}, \\ subject to [\underset{H_{o}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{U_{o}}{\underset{⏟}{\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 1 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{L_{o}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \begin{matrix} 0 & 0 \\ 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ 1 & 0 \end{matrix}] {\hat{C}}_{G o} \leq [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}] \end{matrix}

(21)

\begin{matrix} {\hat{C}}_{M q} = \underset{C_{M q}}{a r g m i n} & ‖ π_{M q} C_{M q} - Π_{M q} ‖_{2}^{2}, \\ subject to [\underset{H_{q}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{U_{q}}{\underset{⏟}{\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 1 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{L_{q}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \begin{matrix} 0 & 0 \\ 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ 1 & 0 \end{matrix}] {\hat{C}}_{M q} \leq [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}] \end{matrix}

(22)

\begin{matrix} {\hat{C}}_{L s} = \underset{C_{L s}}{a r g m i n} & ‖ π_{L s} C_{L s} - Π_{L s} ‖_{2}^{2}, \\ subject to [\underset{H_{s}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{U_{s}}{\underset{⏟}{\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 1 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{L_{s}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \begin{matrix} 0 & 0 \\ 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ 1 & 0 \end{matrix}] {\hat{C}}_{L s} \leq [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}] \end{matrix}

(23)

\begin{matrix} {\hat{C}}_{V w} & = \underset{C_{V w}}{a r g m i n} ‖ π_{V w} C_{V w} - Π_{V w} ‖_{2}^{2}, \\ subject to [\underset{H_{w}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{U_{w}}{\underset{⏟}{\begin{matrix} 1 & 0 & \dots & 0 \\ 0 & 1 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 1 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{L_{w}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \underset{Y_{w}}{\underset{⏟}{\begin{matrix} 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ \\ 0 & 0 & \dots & 0 \\ 0 & 0 & \dots & 0 \end{matrix}}} | \begin{matrix} 0 & 0 \\ 0 & 0 \\ ⋮ & ⋮ \\ 0 & 0 \\ 1 & 0 \end{matrix}] {\hat{C}}_{V w} \leq [\begin{matrix} 0 \\ 0 \\ ⋮ \\ 0 \\ 1 \end{matrix}] \end{matrix}

(24)

We wanted to prune the false positives in the candidate HPI-GWGEN by detecting the edges out of the system order (i.e., the number of interactions or regulations for each node). The AIC considers both the estimated variance and model complexity to obtain the fittest number of edges (i.e., the system order). The AIC value for each model of a node with estimated parameters is given as follows (the

\dim

() in equations represents the dimension of vector) [103]:

A I C_{H i} (H_{i}, V_{i}) = \log (\frac{‖ π_{H i} {\hat{C}}_{H i} - Π_{H i} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{H i})}{T - 1}

(25)

A I C_{P k} (H_{k}, V_{k}) = \log (\frac{‖ π_{P k} {\hat{C}}_{P k} - Π_{P k} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{P k})}{T - 1}

(26)

A I C_{G o} (H_{o}, U_{o}, L_{o}) = \log (\frac{‖ π_{G o} {\hat{C}}_{G o} - Π_{G o} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{G o})}{T - 1}

(27)

A I C_{M q} (H_{q}, U_{q}, L_{q}) = \log (\frac{‖ π_{M q} {\hat{C}}_{M q} - Π_{M q} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{M q})}{T - 1}

(28)

A I C_{L s} (H_{s}, U_{s}, L_{s}) = \log (\frac{‖ π_{L s} {\hat{C}}_{L s} - Π_{L s} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{L s})}{T - 1}

(29)

A I C_{V w} (H_{o}, U_{o}, L_{o}, Y_{w}) = \log (\frac{‖ π_{V w} {\hat{C}}_{V w} - Π_{V w} ‖_{2}^{2}}{T - 1}) + \frac{2 \dim ({\hat{C}}_{V w})}{T - 1}

(30)

In the AICs in Equations (25)–(30), increasing the number of interactions or regulations would decrease the system identification error in the first term but increase the second term in the right-hand-side of Equations (25)–(30), and vice versa. The right number (system order of each node) of regulations and interactions would lead to the minimum value of AIC. In other words, when the false positives are considered in AIC with a larger model complexity, a larger AIC value is obtained because the false positives cannot reduce the estimated error variance in the first term and increase the model complexity in the second term. Hence, we can delete some false-positive interactions or regulations in each node of the candidate HPI-GWGEN to achieve the correct (real) HPI-GWGEN via the AIC method. After solving optimization problems in Equations (19)–(24) with the aid of the MATLAB function lsqlin and considering the system order, we could obtain the real HPI-GWGEN. To represent the real HPI-GWGEN as a network matrix, we needed to integrate the PPIN and GRN. When we described the dynamic models in Equations (1)–(6), we did not consider the zero term. To represent different models (especially, different dimension of network) with a network matrix, we had to fill the terms which lack interaction or regulation (i.e., without interaction or regulation in candidate HPI-GWGENs or the false positives pruned off after system order detection) with zeros. For convenience, we still used the same superscript for estimated coefficients of interaction or regulation to denote the relation between two nodes, i.e.,

\hat{C_{12}^{p H H}}

still represents the estimated interactive ability between the 1st host protein and the 2nd host protein; it will be zero if these proteins do not interact in the candidate HPI-GWGEN, or it will be a false positive pruned off after system order selection. Then, the network matrix

M \in R^{(I + K + O + Q + S + W) \times (I + K + M + L)}

of the real HPI-GWGEN was integrated as the following network matrix:

M = [\begin{matrix} P_{H P \leftrightarrow H P} & P_{P P \leftrightarrow H P} & 0_{I \times Q} & 0_{I \times S} \\ P_{H P \leftrightarrow P P} & P_{P P \leftrightarrow P P} & 0_{K \times Q} & 0_{K \times S} \\ G_{H P \to H G} & 0_{O \times K} & G_{H M \to H G} & G_{H L \to H G} \\ G_{H P \to H M} & 0_{Q \times K} & G_{H M \to H M} & G_{H L \to H M} \\ G_{H P \to H L} & 0_{S \times K} & G_{H M \to H L} & G_{H L \to H L} \\ G_{H P \to P G} & G_{P P \to P G} & G_{H M \to P G} & G_{H L \to P G} \end{matrix}] = [\begin{matrix} \hat{C_{11}^{p H H}} & \hat{C_{21}^{p H H}} & \dots & \hat{C_{I 1}^{p H H}} & \hat{C_{11}^{p P H}} & \dots & \hat{C_{K 1}^{p P H}} \\ \hat{C_{12}^{p H H}} & \hat{C_{22}^{p H H}} & \dots & \hat{C_{I 2}^{p H H}} & \hat{C_{12}^{p P H}} & \dots & \hat{C_{K 2}^{p P H}} & 0_{I \times Q} & 0_{I \times S} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 I}^{p H H}} & \hat{C_{2 I}^{p H H}} & \dots & \hat{C_{I I}^{p H H}} & \hat{C_{1 I}^{p P H}} & \dots & \hat{C_{K I}^{p P H}} \\ \hat{C_{11}^{p H P}} & \hat{C_{21}^{p H P}} & \dots & \hat{C_{I 1}^{p H P}} & \hat{C_{11}^{p P P}} & \dots & \hat{C_{K 1}^{p P P}} \\ \hat{C_{12}^{p H P}} & \hat{C_{22}^{p H P}} & \dots & \hat{C_{I 2}^{p H P}} & \hat{C_{12}^{p P P}} & \dots & \hat{C_{K 2}^{p P P}} & 0_{K \times Q} & 0_{K \times Q} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 K}^{p H P}} & \hat{C_{2 K}^{p H P}} & \dots & \hat{C_{I K}^{p H P}} & \hat{C_{1 K}^{p P P}} & \dots & \hat{C_{K K}^{p P P}} \\ \hat{C_{11}^{T G}} & \hat{C_{21}^{T G}} & \dots & \hat{C_{I 1}^{T G}} & \hat{C_{11}^{M G}} & \dots & \hat{C_{Q 1}^{M G}} & \hat{C_{11}^{L G}} & \dots & \hat{C_{S 1}^{L G}} \\ ⋮ & ⋮ & ⋱ & ⋮ & 0_{O \times K} & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 O}^{T G}} & \hat{C_{2 O}^{T G}} & \dots & \hat{C_{I O}^{T G}} & \hat{C_{1 O}^{M G}} & \dots & \hat{C_{Q O}^{M G}} & \hat{C_{1 O}^{L G}} & \dots & \hat{C_{S O}^{L G}} \\ \hat{C_{11}^{T M}} & \hat{C_{21}^{T M}} & \dots & \hat{C_{I 1}^{T M}} & \hat{C_{11}^{M M}} & \dots & \hat{C_{Q 1}^{M M}} & \hat{C_{11}^{L M}} & \dots & \hat{C_{S 1}^{L M}} \\ ⋮ & ⋮ & ⋱ & ⋮ & 0_{Q \times K} & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 Q}^{T M}} & \hat{C_{2 Q}^{T M}} & \dots & \hat{C_{I Q}^{T M}} & \hat{C_{1 Q}^{M M}} & \dots & \hat{C_{Q Q}^{M M}} & \hat{C_{1 Q}^{L M}} & \dots & \hat{C_{S Q}^{L M}} \\ \hat{C_{11}^{T L}} & \hat{C_{21}^{T L}} & \dots & \hat{C_{I 1}^{T L}} & \hat{C_{11}^{M L}} & \dots & \hat{C_{Q 1}^{M L}} & \hat{C_{11}^{L L}} & \dots & \hat{C_{S 1}^{L L}} \\ ⋮ & ⋮ & ⋱ & ⋮ & 0_{S \times K} & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 S}^{T L}} & \hat{C_{2 S}^{T L}} & \dots & \hat{C_{I S}^{T L}} & \hat{C_{1 S}^{M L}} & \dots & \hat{C_{Q S}^{M L}} & \hat{C_{1 S}^{L L}} & \dots & \hat{C_{S S}^{L L}} \\ \hat{C_{11}^{T V}} & \hat{C_{21}^{L L}} & \dots & \hat{C_{I 1}^{T V}} & \hat{C_{11}^{P V}} & \dots & \hat{C_{K 1}^{P V}} & \hat{C_{11}^{M V}} & \dots & \hat{C_{Q 1}^{M V}} & \hat{C_{11}^{L V}} & \dots & \hat{C_{S 1}^{L V}} \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ & ⋮ & ⋱ & ⋮ \\ \hat{C_{1 W}^{T V}} & \hat{C_{2 W}^{L V}} & \dots & \hat{C_{I W}^{T V}} & \hat{C_{1 W}^{P V}} & \dots & \hat{C_{K W}^{P V}} & \hat{C_{1 W}^{M V}} & \dots & \hat{C_{Q W}^{M V}} & \hat{C_{1 W}^{L V}} & \dots & \hat{C_{S W}^{L V}} \end{matrix}]

(31)

where P and G represent the PPIN and GRN, respectively; HP, PP, HG, HM, HL, and PG specify the host protein, the pathogen protein, the host gene, the host miRNA, the host lncRNA, and the pathogen gene, respectively.

We wanted to investigate the significant pathogenesis for the amplification and saturation stages based on their real HPI-GWGEN. However, the real HPI-GWGEN was too large to be analyzed by the annotation of KEGG pathways. Thus, we applied the PNP method to extract the core HPI-GWGEN from the real HPI-GWGEN, which is more easily annotated by KEGG pathways [47,48].

4.3. PNP Method to Extract the Core HPI-GWGEN from Network Matrix of Real HPI-GWGEN

The is a method that projects each row (node) of network matrix M in Equation (31) (real HPI-GWGEN) onto the significant 85% structure of the overall network so that we can know the importance of each node to the overall network according to the projection value. We first performed the singular value decomposition (SVD) for the network matrix using the following equation:

M = L E R^{T}, L \in R^{(R o w) \times (R o w)}, R^{T} \in R^{(C o l) \times (C o l)} and E \in R^{(R o w) \times (C o l)}

(32)

where the columns of L and R denote the left singular vectors and right singular vectors of M, respectively; E only contains the singular values of M on the entries of

e_{i i}

and

e_{11} \geq e_{22} \geq e_{33} \geq \dots \geq 0

;

R o w = I + K + O + Q + S + W

and

C o l = I + K + M + L

are the row and column dimensions of network matrix M, respectively.

Here, we adopted the truncated SVD. That is, we chose the top t singular values that consisted of more than 85% of the energy of network. In other words, the minimum t is satisfied with the following equation:

\frac{\sum_{k = 1}^{t} e_{k k}^{2}}{\sum_{i = 1}^{C o l} e_{i i}^{2}} \geq 85 %

(33)

Then, we define the projection value for the

a

th row of the network matrix on the top t singular vectors of the network matrix M using the following equation:

P r o j_{a} = \sqrt{\sum_{k = 1}^{t} {(M_{a} R_{k}^{T})}^{2}}, f o r a = 1, \dots, R o w

(34)

where

M_{a}

and

R_{k}^{T}

represent the

a

th row of M in Equation (31) and the

k

th column of

R

(the columns of R are the right singular vectors of M), respectively.

Subsequently, the projection values of each node were ranked from large to small values (a larger value implies more significance to the network). We used the top-6000-significance nodes to construct the core HPI-GWGEN, which is acceptable for the annotation of KEGG pathways in DAVID [47,48]. Thus, we could extract the core HPI-GWGEN of the amplification and saturation stages from the real HPI-GWGEN of the amplification and saturation stages, correspondingly. With the aid of the annotation of KEGG pathways, we investigated the pathogenic mechanism. Finally, considering the core HPI signaling pathways and their downstream abnormal cellular functions, we selected the significant biomarkers as drug targets against the amplification and saturation stages of SARS-CoV-2 infection.

4.4. Systematic Discovery and Design of Multiple-Molecule Drug by UtilizingDNN-Based DTI Model with Drug Design Specifications

4.4.1. Preprocess of Targets and Drugs Data

To train the DNN-based DTI model in Figure 4, we first collected the DTI data from databases, including ChEMBL [30], BindingDB [31], Pubchem [32], UniProt [33], and DrugBank [34]. To calculate chemical descriptors for drugs and properties of proteins, we used PyBioMed [105], a python package, to transform the drugs and targets into features. Subsequently, the drug and target features were merged into a feature vector

F_{D T}

as in the following equation:

F_{D T} = [F_{D}, F_{T}] = [f_{d_{1}}, f_{d_{2}}, \dots, f_{d_{a}}, \dots, f_{d_{A}}, f_{t_{1}}, f_{t_{2}}, \dots, f_{d_{b}}, \dots, f_{t_{B}}]

(35)

where

F_{D}

and

F_{T}

are the features of the drug and target (protein), respectively;

f_{d_{a}}

and

f_{t_{b}}

are the

a

th drug feature and the

b

th target feature, respectively; A and B are the total number of features of the drug and the total number of features of the target, respectively. The input of the DNN-based DTI model should be in the feature vector form.

The collected drug–target interaction data contain 80,291 proven (positive) interactions and 100,024 unproven (negative) interactions, which implies that the collected data suffer from data imbalance. Data imbalance can cause the model to predict the majority, leading to prediction bias. Hence, the negative interactions were randomly deleted to match the number of positive interactions. Then, we divided all the data into training data (four-fifths) and testing data (one-fifth). To improve the convergence of gradient descent, we first performed feature scaling for the training dataset. Because there were some outliers for some features, we used standardization for each feature of training data. Then, PCA was applied to reduce the dimensions of the feature vector to 900 for the convenience of computing the DNN-based DTI model with 900 neurons in the input layer, as shown in Figure 4.

4.4.2. Architecture of DNN-Based DTI Model

Recent studies [28,29] have shown that the DNN-based DTI model can improve the prediction of interaction probability. We employed the DNN to predict the DTI for the pre-candidate molecular drugs with the significant biomarkers (drug targets). The architecture of neural network is shown in Figure 4. The neural network contains four hidden layers. Each hidden layer contains 512, 256, 128, and 64 neurons, respectively. Each neuron in the hidden layer has a bias, ReLU as the activation function to learn the nonlinearity, and a dropout (

=

0.45) to avoid overfitting [106]. The output layer contains a neuron, a bias, and sigmoid as the activation function to represent the output between 0 and 1 as the drug–target interaction probability.

Since the DTI prediction was the binary classification, we used the binary cross-entropy as the loss function. The adaptive moment estimation (Adam) [107] was adopted as the optimization algorithm to update the parameters of the DNN-based DTI model. The DNN-based DTI model was trained by Keras with batch size = 64, epoch = 200 (with Early Stopping), and Adam optimizer (default arguments). The 10-fold cross-validation was first applied to examine the prediction performance of the DNN-based DTI model. Finally, the AUC was used to judge the ability of DNN-based DTI model to distinguish positive and negative interaction.

4.4.3. Drug Design Specifications

In addition to the drug–target interaction, the regulation ability, toxicity, and sensitivity of the drug are considered when we chose the drugs to make sure the quality of drugs. The Library of Integrated Network-Based Cellular Signatures (LINCS) L1000 dataset [108,109] is used for drug regulation ability, i.e., the indicates the upregulation (>0) or downregulation (<0) of the drug target interaction. The drug sensitivity is also considered. The PRISM Repurposing dataset [110] contains chemical perturbations of compounds for homo sapiens cells. The closer zero value of sensitivity indicates the less sensitivity chemical perturbation for the cell. The other drug design specification is the toxicity (LC50), which is obtained by the tool ADMETlab 2.0 [111]. The higher value of LC50 implies the lower toxicity for the body.

Author Contributions

Conceptualization, C.-G.W. and B.-S.C.; methodology, C.-G.W. and B.-S.C.; software, C.-G.W., and B.-S.C.; validation, C.-G.W. and B.-S.C.; formal analysis, C.-G.W. and B.-S.C.; investigation, C.-G.W. and B.-S.C.; data curation, C.-G.W.; writing—original draft preparation, C.-G.W.; writing—review and editing, C.-G.W. and B.-S.C.; visualization, C.-G.W.; supervision, B.-S.C.; funding acquisition, B.-S.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Ministry of Science and Technology, grant number MOST 111-2221-E-007-130-MY2.

Data Availability Statement

The RNA-seq data were downloaded from NCBI (accession number: GSE163547) (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE163547) on 1 September 2021.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. Visualization of real HPI-GWGEN of amplification and saturation infectious stages. The blue and green edges (lines) represent the protein–protein interactions and gene regulations, respectively. The numbers of each type of node (protein, receptor, transcription factor, miRNA, lncRNA, and virus) are shown in the figure. (A,B) are the real HPI-GWGENs at the amplification and saturation infectious stages, respectively.

References

WHO. Available online: https://covid19.who.int/ (accessed on 22 June 2022).
Lechien, J.R.; Chiesa-Estomba, C.M.; Place, S.; Van Laethem, Y.; Cabaraux, P.; Mat, Q.; Huet, K.; Plzak, J.; Horoi, M.; Hans, S.; et al. Clinical and epidemiological characteristics of 1420 European patients with mild-to-moderate coronavirus disease 2019. J. Intern. Med. 2020, 288, 335–344. [Google Scholar] [CrossRef] [PubMed]
Stokes, E.K.; Zambrano, L.D.; Anderson, K.N.; Marder, E.P.; Raz, K.M.; El Burai Felix, S.; Tie, Y.; Fullerton, K.E. Coronavirus Disease 2019 Case Surveillance—United States, 22 January–30 May 2020. MMWR Morb. Mortal. Wkly. Rep. 2020, 69, 759–765. [Google Scholar] [CrossRef] [PubMed]
Gordon, D.E.; Jang, G.M.; Bouhaddou, M.; Xu, J.; Obernier, K.; White, K.M.; O’Meara, M.J.; Rezelj, V.V.; Guo, J.Z.; Swaney, D.L.; et al. A SARS-CoV-2 protein interaction map reveals targets for drug repurposing. Nature 2020, 583, 459–468. [Google Scholar] [CrossRef]
Kim, D.; Lee, J.Y.; Yang, J.S.; Kim, J.W.; Kim, V.N.; Chang, H. The Architecture of SARS-CoV-2 Transcriptome. Cell 2020, 181, 914–921.e10. [Google Scholar] [CrossRef] [PubMed]
Wu, F.; Zhao, S.; Yu, B.; Chen, Y.M.; Wang, W.; Song, Z.G.; Hu, Y.; Tao, Z.W.; Tian, J.H.; Pei, Y.Y.; et al. A new coronavirus associated with human respiratory disease in China. Nature 2020, 579, 265–269. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Flynn, R.A.; Belk, J.A.; Qi, Y.; Yasumoto, Y.; Wei, J.; Alfajaro, M.M.; Shi, Q.; Mumbach, M.R.; Limaye, A.; DeWeirdt, P.C.; et al. Discovery and functional interrogation of SARS-CoV-2 RNA-host protein interactions. Cell 2021, 184, 2394–2411.e16. [Google Scholar] [CrossRef] [PubMed]
Stukalov, A.; Girault, V.; Grass, V.; Karayel, O.; Bergant, V.; Urban, C.; Haas, D.A.; Huang, Y.; Oubraham, L.; Wang, A.; et al. Multilevel proteomics reveals host perturbations by SARS-CoV-2 and SARS-CoV. Nature 2021, 594, 246–252. [Google Scholar] [CrossRef] [PubMed]
Balakrishnan, L.; Milavetz, B. Epigenetic Regulation of Viral Biological Processes. Viruses 2017, 9, 346. [Google Scholar] [CrossRef] [Green Version]
Zhang, Q.; Cao, X. Epigenetic regulation of the innate immune response to infection. Nat. Rev. Immunol. 2019, 19, 417–432. [Google Scholar] [CrossRef]
Leong, M.M.L.; Lung, M.L. The Impact of Epstein-Barr Virus Infection on Epigenetic Regulation of Host Cell Gene Expression in Epithelial and Lymphocytic Malignancies. Front. Oncol. 2021, 11, 629780. [Google Scholar] [CrossRef]
Bartel, D.P. Metazoan MicroRNAs. Cell 2018, 173, 20–51. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Qureshi, A.; Thakur, N.; Monga, I.; Thakur, A.; Kumar, M. VIRmiRNA: A comprehensive resource for experimentally validated viral miRNAs and their targets. Database 2014, 2014, bau103. [Google Scholar] [CrossRef] [PubMed]
Saliminejad, K.; Khorram Khorshid, H.R.; Soleymani Fard, S.; Ghaffari, S.H. An overview of microRNAs: Biology, functions, therapeutics, and analysis methods. J. Cell. Physiol. 2019, 234, 5451–5465. [Google Scholar] [CrossRef] [PubMed]
Dykes, I.M.; Emanueli, C. Transcriptional and Post-transcriptional Gene Regulation by Long Non-coding RNA. Genom. Proteom. Bioinform. 2017, 15, 177–186. [Google Scholar] [CrossRef]
Ma, L.; Cao, J.; Liu, L.; Du, Q.; Li, Z.; Zou, D.; Bajic, V.B.; Zhang, Z. LncBook: A curated knowledgebase of human long non-coding RNAs. Nucleic Acids Res. 2019, 47, D128–D134. [Google Scholar] [CrossRef] [Green Version]
Chen, L.; Zhou, Y.; Li, H. LncRNA, miRNA and lncRNA-miRNA interaction in viral infection. Virus Res. 2018, 257, 25–32. [Google Scholar] [CrossRef]
Waring, M.J.; Arrowsmith, J.; Leach, A.R.; Leeson, P.D.; Mandrell, S.; Owen, R.M.; Pairaudeau, G.; Pennie, W.D.; Pickett, S.D.; Wang, J.; et al. An analysis of the attrition of drug candidates from four major pharmaceutical companies. Nat. Rev. Drug Discov. 2015, 14, 475–486. [Google Scholar] [CrossRef]
Ashburn, T.T.; Thor, K.B. Drug repositioning: Identifying and developing new uses for existing drugs. Nat. Rev. Drug Discov. 2004, 3, 673–683. [Google Scholar] [CrossRef]
Pushpakom, S.; Iorio, F.; Eyers, P.A.; Escott, K.J.; Hopper, S.; Wells, A.; Doig, A.; Guilliams, T.; Latimer, J.; McNamee, C.; et al. Drug repurposing: Progress, challenges and recommendations. Nat. Rev. Drug Discov. 2019, 18, 41–58. [Google Scholar] [CrossRef]
Bayat Mokhtari, R.; Homayouni, T.S.; Baluch, N.; Morgatskaya, E.; Kumar, S.; Das, B.; Yeger, H. Combination therapy in combating cancer. Oncotarget 2017, 8, 38022–38043. [Google Scholar] [CrossRef]
Maenza, J.; Flexner, C. Combination antiretroviral therapy for HIV infection. Am. Fam. Physician 1998, 57, 2789–2798. [Google Scholar] [PubMed]
Fang, J.; Li, H.; Du, W.; Yu, P.; Guan, Y.-Y.; Ma, S.-Y.; Liu, D.; Chen, W.; Shi, G.-C.; Bian, X.-L. Efficacy of Early Combination Therapy With Lianhuaqingwen and Arbidol in Moderate and Severe COVID-19 Patients: A Retrospective Cohort Study. Front. Pharmacol. 2020, 11, 560209. [Google Scholar] [CrossRef] [PubMed]
Deng, J.; Zhou, F.; Hou, W.; Heybati, K.; Ali, S.; Chang, O.; Silver, Z.; Dhivagaran, T.; Ramaraju, H.B.; Wong, C.Y.; et al. Efficacy of lopinavir–ritonavir combination therapy for the treatment of hospitalized COVID-19 patients: A meta-analysis. Future Virol. 2022, 17, 169–189. [Google Scholar] [CrossRef]
Roshanshad, A.; Kamalipour, A.; Ashraf, M.A.; Roshanshad, R.; Jafari, S.; Nazemi, P.; Akbari, M. The efficacy of remdesivir in coronavirus disease 2019 (COVID-19): A systematic review. Iran. J. Microbiol. 2020, 12, 376–387. [Google Scholar] [CrossRef] [PubMed]
Ansems, K.; Grundeis, F.; Dahms, K.; Mikolajewska, A.; Thieme, V.; Piechotta, V.; Metzendorf, M.I.; Stegemann, M.; Benstoem, C.; Fichtner, F. Remdesivir for the treatment of COVID-19. Cochrane Database Syst. Rev. 2021, 8, Cd014962. [Google Scholar]
Gil-Sierra, M.D.; Briceño-Casado, M.P.; Alegre-Del Rey, E.J.; Sánchez-Hidalgo, M. Efficacy of early use of remdesivir: A systematic review of subgroup analysis. Rev. Esp. Quimioter. 2022, 35, 249–259. [Google Scholar] [CrossRef]
Chang, S.; Wang, L.H.; Chen, B.-S. Investigating Core Signaling Pathways of Hepatitis B Virus Pathogenesis for Biomarkers Identification and Drug Discovery via Systems Biology and Deep Learning Method. Biomedicines 2020, 8, 320. [Google Scholar] [CrossRef]
Chang, S.; Chen, J.-Y.; Chuang, Y.-J.; Chen, B.-S. Systems Approach to Pathogenic Mechanism of Type 2 Diabetes and Drug Discovery Design Based on Deep Learning and Drug Design Specifications. Int. J. Mol. Sci. 2021, 22, 166. [Google Scholar] [CrossRef]
Gaulton, A.; Bellis, L.J.; Bento, A.P.; Chambers, J.; Davies, M.; Hersey, A.; Light, Y.; McGlinchey, S.; Michalovich, D.; Al-Lazikani, B.; et al. ChEMBL: A large-scale bioactivity database for drug discovery. Nucleic Acids Res. 2012, 40, D1100–D1107. [Google Scholar] [CrossRef] [Green Version]
Liu, T.; Lin, Y.; Wen, X.; Jorissen, R.N.; Gilson, M.K. BindingDB: A web-accessible database of experimentally determined protein–ligand binding affinities. Nucleic Acids Res. 2007, 35, D198–D201. [Google Scholar] [CrossRef] [Green Version]
Kim, S.; Thiessen, P.A.; Bolton, E.E.; Chen, J.; Fu, G.; Gindulyte, A.; Han, L.; He, J.; He, S.; Shoemaker, B.A.; et al. PubChem Substance and Compound databases. Nucleic Acids Res. 2016, 44, D1202–D1213. [Google Scholar] [CrossRef] [PubMed]
UniProt Consortium. UniProt: A worldwide hub of protein knowledge. Nucleic Acids Res. 2019, 47, D506–D515. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Knox, C.; Law, V.; Jewison, T.; Liu, P.; Ly, S.; Frolkis, A.; Pon, A.; Banco, K.; Mak, C.; Neveu, V.; et al. DrugBank 3.0: A comprehensive resource for “omics” research on drugs. Nucleic Acids Res. 2011, 39, D1035–D1041. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Bovolenta, L.; Acencio, M.; Lemke, N. HTRIdb: An open-access database for experimentally verified human transcriptional regulation interactions. BMC Genom. 2012, 13, 405. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Zheng, G.; Tu, K.; Yang, Q.; Xiong, Y.; Wei, C.; Xie, L.; Zhu, Y.; Li, Y. ITFP: An integrated platform of mammalian transcription factors. Bioinformatics 2008, 24, 2416–2417. [Google Scholar] [CrossRef] [Green Version]
Wingender, E.; Chen, X.; Hehl, R.; Karas, H.; Liebich, I.; Matys, V.; Meinhardt, T.; Prüss, M.; Reuter, I.; Schacherer, F. TRANSFAC: An integrated system for gene expression regulation. Nucleic Acids Res. 2000, 28, 316–319. [Google Scholar] [CrossRef] [Green Version]
Li, J.-H.; Liu, S.; Zhou, H.; Qu, L.-H.; Yang, J.-H. starBase v2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2013, 42, D92–D97. [Google Scholar] [CrossRef] [Green Version]
Friard, O.; Re, A.; Taverna, D.; De Bortoli, M.; Corá, D. CircuitsDB: A database of mixed microRNA/transcription factor feed-forward regulatory circuits in human and mouse. BMC Bioinform. 2010, 11, 435. [Google Scholar] [CrossRef] [Green Version]
Agarwal, V.; Bell, G.W.; Nam, J.W.; Bartel, D.P. Predicting effective microRNA target sites in mammalian mRNAs. Elife 2015, 4, e05005. [Google Scholar] [CrossRef]
Licata, L.; Briganti, L.; Peluso, D.; Perfetto, L.; Iannuccelli, M.; Galeota, E.; Sacco, F.; Palma, A.; Nardozza, A.P.; Santonico, E.; et al. MINT, the molecular interaction database: 2012 update. Nucleic Acids Res. 2012, 40, D857–D861. [Google Scholar] [CrossRef]
UniProt Consortium. UniProt: The universal protein knowledgebase in 2021. Nucleic Acids Res. 2020, 49, D480–D489. [Google Scholar]
Orchard, S.; Ammari, M.; Aranda, B.; Breuza, L.; Briganti, L.; Broackes-Carter, F.; Campbell, N.H.; Chavali, G.; Chen, C.; del-Toro, N.; et al. The MIntAct project—IntAct as a common curation platform for 11 molecular interaction databases. Nucleic Acids Res. 2013, 42, D358–D363. [Google Scholar] [CrossRef] [PubMed]
Bader, G.D.; Betel, D.; Hogue, C.W. BIND: The Biomolecular Interaction Network Database. Nucleic Acids Res. 2003, 31, 248–250. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Stark, C.; Breitkreutz, B.J.; Reguly, T.; Boucher, L.; Breitkreutz, A.; Tyers, M. BioGRID: A general repository for interaction datasets. Nucleic Acids Res. 2006, 34, D535–D539. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Salwinski, L.; Miller, C.S.; Smith, A.J.; Pettit, F.K.; Bowie, J.U.; Eisenberg, D. The Database of Interacting Proteins: 2004 update. Nucleic Acids Res. 2004, 32, D449–D451. [Google Scholar] [CrossRef] [Green Version]
Huang, D.W.; Sherman, B.T.; Lempicki, R.A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat. Protoc. 2009, 4, 44–57. [Google Scholar] [CrossRef]
Huang, D.W.; Sherman, B.T.; Lempicki, R.A. Bioinformatics enrichment tools: Paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 2009, 37, 1–13. [Google Scholar] [CrossRef] [Green Version]
Shannon, P.; Markiel, A.; Ozier, O.; Baliga, N.S.; Wang, J.T.; Ramage, D.; Amin, N.; Schwikowski, B.; Ideker, T. Cytoscape: A software environment for integrated models of biomolecular interaction networks. Genome Res. 2003, 13, 2498–2504. [Google Scholar] [CrossRef]
Agajanian, M.J.; Walker, M.P.; Axtman, A.D.; Ruela-de-Sousa, R.R.; Serafin, D.S.; Rabinowitz, A.D.; Graham, D.M.; Ryan, M.B.; Tamir, T.; Nakamichi, Y.; et al. WNT Activates the AAK1 Kinase to Promote Clathrin-Mediated Endocytosis of LRP6 and Establish a Negative Feedback Loop. Cell Rep. 2019, 26, 79–93.e8. [Google Scholar] [CrossRef] [Green Version]
Venkataraman, T.; Coleman, C.M.; Frieman, M.B. Overactive Epidermal Growth Factor Receptor Signaling Leads to Increased Fibrosis after Severe Acute Respiratory Syndrome Coronavirus Infection. J. Virol. 2017, 91, e00182-17. [Google Scholar] [CrossRef] [Green Version]
Luo, S.; Rubinsztein, D.C. BCL2L11/BIM. Autophagy 2013, 9, 104–105. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Häcker, G. Apoptosis in infection. Microbes Infect. 2018, 20, 552–559. [Google Scholar] [CrossRef] [PubMed]
Kumar, R.; Afsar, M.; Khandelwal, N.; Chander, Y.; Riyesh, T.; Dedar, R.K.; Gulati, B.R.; Pal, Y.; Barua, S.; Tripathi, B.N.; et al. Emetine suppresses SARS-CoV-2 replication by inhibiting interaction of viral mRNA with eIF4E. Antivir. Res. 2021, 189, 105056. [Google Scholar] [CrossRef] [PubMed]
Campa, C.C.; Franco, I.; Hirsch, E. PI3K-C2α: One enzyme for two products coupling vesicle trafficking and signal transduction. FEBS Lett. 2015, 589, 1552–1558. [Google Scholar] [CrossRef] [Green Version]
Arcaro, A.; Zvelebil Marketa, J.; Wallasch, C.; Ullrich, A.; Waterfield Michael, D.; Domin, J. Class II Phosphoinositide 3-Kinases Are Downstream Targets of Activated Polypeptide Growth Factor Receptors. Mol. Cell. Biol. 2000, 20, 3817–3830. [Google Scholar] [CrossRef] [Green Version]
Abdi, K.; Neves, G.; Pyun, J.; Kiziltug, E.; Ahrens, A.; Kuo, C.T. EGFR Signaling Termination via Numb Trafficking in Ependymal Progenitors Controls Postnatal Neurogenic Niche Differentiation. Cell Rep. 2019, 28, 2012–2022.e4. [Google Scholar] [CrossRef] [Green Version]
Ramaiah, M.J. mTOR inhibition and p53 activation, microRNAs: The possible therapy against pandemic COVID-19. Gene Rep. 2020, 20, 100765. [Google Scholar] [CrossRef]
Huang, K.; Wang, C.; Vagts, C.; Raguveer, V.; Finn, P.W.; Perkins, D.L. Long non-coding RNAs (lncRNAs) NEAT1 and MALAT1 are differentially expressed in severe COVID-19 patients: An integrated single cell analysis. medRxiv 2021. [Google Scholar] [CrossRef]
Yao, J.; Wang, X.Q.; Li, Y.J.; Shan, K.; Yang, H.; Wang, Y.N.; Yao, M.D.; Liu, C.; Li, X.M.; Shen, Y.; et al. Long non-coding RNA MALAT1 regulates retinal neurodegeneration through CREB signaling. EMBO Mol. Med. 2016, 8, 346–362. [Google Scholar] [CrossRef]
Zhuang, M.; Zhao, S.; Jiang, Z.; Wang, S.; Sun, P.; Quan, J.; Yan, D.; Wang, X. MALAT1 sponges miR-106b-5p to promote the invasion and metastasis of colorectal cancer via SLAIN2 enhanced microtubules mobility. EBioMedicine 2019, 41, 286–298. [Google Scholar] [CrossRef] [Green Version]
Ivanovska, I.; Ball, A.S.; Diaz, R.L.; Magnus, J.F.; Kibukawa, M.; Schelter, J.M.; Kobayashi, S.V.; Lim, L.; Burchard, J.; Jackson, A.L.; et al. MicroRNAs in the miR-106b family regulate p21/CDKN1A and promote cell cycle progression. Mol. Cell. Biol. 2008, 28, 2167–2174. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Su, M.; Chen, Y.; Qi, S.; Shi, D.; Feng, L.; Sun, D. A Mini-Review on Cell Cycle Regulation of Coronavirus Infection. Front. Vet. Sci. 2020, 7, 586826. [Google Scholar] [CrossRef] [PubMed]
Iwamoto, M.; Saso, W.; Sugiyama, R.; Ishii, K.; Ohki, M.; Nagamori, S.; Suzuki, R.; Aizaki, H.; Ryo, A.; Yun, J.-H.; et al. Epidermal growth factor receptor is a host-entry cofactor triggering hepatitis B virus internalization. Proc. Natl. Acad. Sci. USA 2019, 116, 8487–8492. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hu, W.; Zhang, S.; Shen, Y.; Yang, Q. Epidermal growth factor receptor is a co-factor for transmissible gastroenteritis virus entry. Virology 2018, 521, 33–43. [Google Scholar] [CrossRef]
Zheng, M.; Karki, R.; Williams, E.P.; Yang, D.; Fitzpatrick, E.; Vogel, P.; Jonsson, C.B.; Kanneganti, T.-D. TLR2 senses the SARS-CoV-2 envelope protein to produce inflammatory cytokines. Nat. Immunol. 2021, 22, 829–838. [Google Scholar] [CrossRef]
DePaolo, R.W.; Lathan, R.; Rollins, B.J.; Karpus, W.J. The Chemokine CCL2 Is Required for Control of Murine Gastric Salmonella enterica Infection. Infect. Immun. 2005, 73, 6514–6522. [Google Scholar] [CrossRef] [Green Version]
Gschwandtner, M.; Derler, R.; Midwood, K.S. More Than Just Attractive: How CCL2 Influences Myeloid Cell Behavior Beyond Chemotaxis. Front. Immunol. 2019, 10, 2759. [Google Scholar] [CrossRef] [Green Version]
Dinarello, C.A. Overview of the IL-1 family in innate inflammation and acquired immunity. Immunol. Rev. 2018, 281, 8–27. [Google Scholar] [CrossRef] [Green Version]
Rath, P.C.; Aggarwal, B.B. TNF-induced signaling in apoptosis. J. Clin. Immunol. 1999, 19, 350–364. [Google Scholar] [CrossRef]
Wu, B.; Peisley, A.; Richards, C.; Yao, H.; Zeng, X.; Lin, C.; Chu, F.; Walz, T.; Hur, S. Structural basis for dsRNA recognition, filament formation, and antiviral signal activation by MDA5. Cell 2013, 152, 276–289. [Google Scholar] [CrossRef] [Green Version]
Alcami, A.; Koszinowski, U.H. Viral mechanisms of immune evasion. Trends Microbiol. 2000, 8, 410–418. [Google Scholar] [CrossRef]
Sui, L.; Zhao, Y.; Wang, W.; Wu, P.; Wang, Z.; Yu, Y.; Hou, Z.; Tan, G.; Liu, Q. SARS-CoV-2 Membrane Protein Inhibits Type I Interferon Production Through Ubiquitin-Mediated Degradation of TBK1. Front. Immunol. 2021, 12, 662989. [Google Scholar] [CrossRef] [PubMed]
Vaz de Paula, C.B.; Nagashima, S.; Liberalesso, V.; Collete, M.; da Silva, F.P.G.; Oricil, A.G.G.; Barbosa, G.S.; da Silva, G.V.C.; Wiedmer, D.B.; da Silva Dezidério, F.; et al. COVID-19: Immunohistochemical Analysis of TGF-β Signaling Pathways in Pulmonary Fibrosis. Int. J. Mol. Sci. 2021, 23, 168. [Google Scholar] [CrossRef]
Ferner, R.E.; Aronson, J.K. Remdesivir in COVID-19. BMJ 2020, 369, m1610. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, W.; Chen, J.; Hu, D.; Pan, P.; Liang, L.; Wu, W.; Tang, Y.; Huang, X.R.; Yu, X.; Wu, J.; et al. SARS-CoV-2 N Protein Induces Acute Kidney Injury via Smad3-Dependent G1 Cell Cycle Arrest Mechanism. Adv. Sci. 2022, 9, e2103248. [Google Scholar] [CrossRef]
Kellici, T.F.; Pilka, E.S.; Bodkin, M.J. Therapeutic Potential of Targeting Plasminogen Activator Inhibitor-1 in COVID-19. Trends Pharmacol. Sci. 2021, 42, 431–433. [Google Scholar] [CrossRef]
Angiolillo, A.L.; Sgadari, C.; Taub, D.D.; Liao, F.; Farber, J.M.; Maheshwari, S.; Kleinman, H.K.; Reaman, G.H.; Tosato, G. Human interferon-inducible protein 10 is a potent inhibitor of angiogenesis in vivo. J. Exp. Med. 1995, 182, 155–162. [Google Scholar] [CrossRef] [Green Version]
Romagnani, P.; Annunziato, F.; Lazzeri, E.; Cosmi, L.; Beltrame, C.; Lasagni, L.; Galli, G.; Francalanci, M.; Manetti, R.; Marra, F.; et al. Interferon-inducible protein 10, monokine induced by interferon gamma, and interferon-inducible T-cell alpha chemoattractant are produced by thymic epithelial cells and attract T-cell receptor (TCR) alphabeta+ CD8+ single-positive T cells, TCRgammadelta+ T cells, and natural killer-type cells in human thymus. Blood 2001, 97, 601–607. [Google Scholar]
Sidahmed, A.M.; León, A.J.; Bosinger, S.E.; Banner, D.; Danesh, A.; Cameron, M.J.; Kelvin, D.J. CXCL10 contributes to p38-mediated apoptosis in primary T lymphocytes in vitro. Cytokine 2012, 59, 433–441. [Google Scholar] [CrossRef] [Green Version]
BAX BCL2 Associated X, Apoptosis Regulator [Homo sapiens (Human)]. Available online: https://www.ncbi.nlm.nih.gov/gene/581 (accessed on 18 July 2022).
Milhas, D.; Cuvillier, O.; Therville, N.; Clavé, P.; Thomsen, M.; Levade, T.; Benoist, H.; Ségui, B. Caspase-10 Triggers Bid Cleavage and Caspase Cascade Activation in FasL-induced Apoptosis. J. Biol. Chem. 2005, 280, 19836–19842. [Google Scholar] [CrossRef] [Green Version]
Dzimianski, J.V.; Scholte, F.E.M.; Bergeron, É.; Pegan, S.D. ISG15: It’s Complicated. J. Mol. Biol. 2019, 431, 4203–4216. [Google Scholar] [CrossRef] [PubMed]
Bizzotto, J.; Sanchis, P.; Abbate, M.; Lage-Vickers, S.; Lavignolle, R.; Toro, A.; Olszevicki, S.; Sabater, A.; Cascardo, F.; Vazquez, E.; et al. SARS-CoV-2 Infection Boosts MX1 Antiviral Effector in COVID-19 Patients. iScience 2020, 23, 101585. [Google Scholar] [CrossRef] [PubMed]
Kausar, S.; Said Khan, F.; Ishaq Mujeeb Ur Rehman, M.; Akram, M.; Riaz, M.; Rasool, G.; Hamid Khan, A.; Saleem, I.; Shamim, S.; Malik, A. A review: Mechanism of action of antiviral drugs. Int. J. Immunopathol. Pharmacol. 2021, 35, 20587384211002621. [Google Scholar] [CrossRef] [PubMed]
Piacentini, S.; La Frazia, S.; Riccio, A.; Pedersen, J.Z.; Topai, A.; Nicolotti, O.; Rossignol, J.F.; Santoro, M.G. Nitazoxanide inhibits paramyxovirus replication by targeting the Fusion protein folding: Role of glycoprotein-specific thiol oxidoreductase ERp57. Sci. Rep. 2018, 8, 10425. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Segrelles, C.; Contreras, D.; Navarro, E.M.; Gutiérrez-Muñoz, C.; García-Escudero, R.; Paramio, J.M.; Lorz, C. Bosutinib Inhibits EGFR Activation in Head and Neck Cancer. Int. J. Mol. Sci. 2018, 19, 1824. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Wang, Y.; Schmid-Bindert, G.; Zhou, C. Erlotinib in the treatment of advanced non-small cell lung cancer: An update for clinicians. Ther. Adv. Med. Oncol. 2012, 4, 19–29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Jeong, J.; Kim, J. Combination Effect of Cilengitide with Erlotinib on TGF-β1-Induced Epithelial-to-Mesenchymal Transition in Human Non-Small Cell Lung Cancer Cells. Int. J. Mol. Sci. 2022, 23, 3423. [Google Scholar] [CrossRef]
Naik, R.R.; Shakya, A.K.; Aladwan, S.M.; El-Tanani, M. Kinase Inhibitors as Potential Therapeutic Agents in the Treatment of COVID-19. Front. Pharmacol. 2022, 13, 806568. [Google Scholar] [CrossRef]
National Center for Biotechnology Information. PubChem Compound Summary for CID 5757, Estradiol. Available online: https://pubchem.ncbi.nlm.nih.gov/compound/5757 (accessed on 25 July 2022).
Kröger, A.; Dallügge, A.; Kirchhoff, S.; Hauser, H. IRF-1 reverts the transformed phenotype of oncogenically transformed cells in vitro and in vivo. Oncogene 2003, 22, 1045–1056. [Google Scholar] [CrossRef] [Green Version]
Malek, D.; Gust, R.; Kleuser, B. 17-Beta-estradiol inhibits transforming-growth-factor-beta-induced MCF-7 cell migration by Smad3-repression. Eur. J. Pharmacol. 2006, 534, 39–47. [Google Scholar] [CrossRef]
Penna, C.; Mercurio, V.; Tocchetti, C.G.; Pagliaro, P. Sex-related differences in COVID-19 lethality. Br. J. Pharmacol. 2020, 177, 4375–4385. [Google Scholar] [CrossRef] [PubMed]
Bhopal, S.S.; Bhopal, R. Sex differential in COVID-19 mortality varies markedly by age. Lancet 2020, 396, 532–533. [Google Scholar] [CrossRef]
Doerre, A.; Doblhammer, G. The influence of gender on COVID-19 infections and mortality in Germany: Insights from age-and gender-specific modeling of contact rates, infections, and deaths in the early phase of the pandemic. PLoS ONE 2022, 17, e0268119. [Google Scholar] [CrossRef] [PubMed]
Zafari Zangeneh, F.; Shoushtari, M.S. Estradiol and COVID-19: Does 17-Estradiol Have an Immune-Protective Function in Women Against Coronavirus? J. Fam. Reprod. Health 2021, 15, 150–159. [Google Scholar] [CrossRef] [PubMed]
Suba, Z. Prevention and therapy of COVID-19 via exogenous estrogen treatment for both male and female patients: Prevention and therapy of COVID-19. J. Pharm. Pharm. Sci. 2020, 23, 75–85. [Google Scholar] [CrossRef] [Green Version]
Gil-Ad, I.; Zolokov, A.; Lomnitski, L.; Taler, M.; Bar, M.; Luria, D.; Ram, E.; Weizman, A. Evaluation of the potential anti-cancer activity of the antidepressant sertraline in human colon cancer cell lines and in colorectal cancer-xenografted mice. Int. J. Oncol. 2008, 33, 277–286. [Google Scholar] [CrossRef] [Green Version]
Chen, S.; Xuan, J.; Wan, L.; Lin, H.; Couch, L.; Mei, N.; Dobrovolsky, V.N.; Guo, L. Sertraline, an Antidepressant, Induces Apoptosis in Hepatic Cells Through the Mitogen-Activated Protein Kinase Pathway. Toxicol. Sci. 2014, 137, 404–415. [Google Scholar] [CrossRef]
Xia, D.; Zhang, Y.T.; Xu, G.P.; Yan, W.W.; Pan, X.R.; Tong, J.H. Sertraline exerts its antitumor functions through both apoptosis and autophagy pathways in acute myeloid leukemia cells. Leuk. Lymphoma 2017, 58, 2208–2217. [Google Scholar] [CrossRef]
Halperin, D.; Reber, G. Influence of antidepressants on hemostasis. Dialogues Clin. Neurosci. 2007, 9, 47–59. [Google Scholar] [CrossRef]
Chen, B.S.; Wu, C.C. Systems Biology: An Integrated Platform for Bioinformatics, Systems Synthetic Biology and Systems Metabolic Engineering; Nova Publishers: Hauppauge, NY, USA, 2014. [Google Scholar]
Puray-Chavez, M.; LaPak, K.M.; Schrank, T.P.; Elliott, J.L.; Bhatt, D.P.; Agajanian, M.J.; Jasuja, R.; Lawson, D.Q.; Davis, K.; Rothlauf, P.W.; et al. Systematic analysis of SARS-CoV-2 infection of an ACE2-negative human airway cell. Cell Rep. 2021, 36, 109364. [Google Scholar] [CrossRef]
Dong, J.; Yao, Z.-J.; Zhang, L.; Luo, F.; Lin, Q.; Lu, A.-P.; Chen, A.F.; Cao, D.-S. PyBioMed: A python library for various molecular representations of chemicals, proteins and DNAs and their interactions. J. Cheminformatics 2018, 10, 16. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
Kingma, D.P.; Ba, J. Adam: A method for stochastic optimization. arXiv 2014, arXiv:1412.6980. [Google Scholar]
Subramanian, A.; Narayan, R.; Corsello, S.M.; Peck, D.D.; Natoli, T.E.; Lu, X.; Gould, J.; Davis, J.F.; Tubelli, A.A.; Asiedu, J.K. A next generation connectivity map: L1000 platform and the first 1,000,000 profiles. Cell 2017, 171, 1437–1452.e17. [Google Scholar] [CrossRef] [PubMed]
Seçilmiş, D.; Hillerton, T.; Morgan, D.; Tjärnberg, A.; Nelander, S.; Nordling, T.E.M.; Sonnhammer, E.L.L. Uncovering cancer gene regulation by accurate regulatory network inference from uninformative data. NPJ Syst. Biol. Appl. 2020, 6, 37. [Google Scholar] [CrossRef]
Corsello, S.M.; Nagari, R.T.; Spangler, R.D.; Rossen, J.; Kocak, M.; Bryan, J.G.; Humeidi, R.; Peck, D.; Wu, X.; Tang, A.A.; et al. Discovering the anticancer potential of non-oncology drugs by systematic viability profiling. Nat. Cancer 2020, 1, 235–248. [Google Scholar] [CrossRef] [Green Version]
Xiong, G.; Wu, Z.; Yi, J.; Fu, L.; Yang, Z.; Hsieh, C.; Yin, M.; Zeng, X.; Wu, C.; Lu, A.; et al. ADMETlab 2.0: An integrated online platform for accurate and comprehensive predictions of ADMET properties. Nucleic Acids Res. 2021, 49, W5–W14. [Google Scholar] [CrossRef]

Figure 1. The flowchart of the systems biology method of investigating pathogenesis of SARS-CoV-2 infection to identify significant biomarkers as drug targets and systems drug discovery via a DNN-based DTI model to predict candidate molecular drugs for drug targets through a deep learning scheme and then to screen multiple-molecule drugs by drug design specifications for disrupting the SARS-CoV-2 infection at the amplification and saturation stages.

Figure 2. Visualization of core HPI-GWGEN of amplification and saturation infectious stages. The blue and green edges (lines) represent the protein-protein interactions and gene regulations, respectively. The numbers of each type of node (protein, receptor, transcription factor, miRNA, lncRNA, and virus) are shown in the figure. (A,B) are the core HPI-GWGENs of amplification and saturation infectious stages, respectively.

Figure 3. The core HPI signaling pathways and their downstream abnormal cellular functions of SARS-CoV-2 infection in (i) amplification stage (left-hand side), (ii) common between amplification and saturation infectious stages (red background in the middle of the figure), and (iii) saturation infectious stage (right-hand side). The green nodes represent the high expression of protein/gene. The red nodes represent the low expression of protein/gene.

Figure 4. The flowchart of multiple-molecule drug design of the amplification and saturation stages of SARS-CoV-2 infection based on DNN-based DTI model and drug design specifications. The DNN-based DTI model was first trained by DTI data at the right column. Then, the well-trained DNN-based DTI model was used for the prediction of candidate drugs and the candidate drugs were finally screened by drug design specifications as the potential drugs to combine a multiple-molecule drug in the left column.

Figure 5. The training and validation loss is represented in (A), and the training and validation accuracy is represented in (B). The early stop is adopted to avoid overfitting to stop at epoch = 117.

Figure 6. The ROC curve of DNN-based DTI model. The higher AUC value indicates the higher ability to distinguish positive and negative interaction. The worst case (AUC = 0.5) is shown with the dotted line, which implies the model predicts the positive and negative randomly.

Table 1. The number of nodes and edges in candidate and real HPI-GWGENs at amplification and saturation stages. After system identification and system order detection, the false positives in each node of candidate HPI-GWGEN can be pruned through each infectious stage of HPI RNA-seq data and then form the real HPI-GWGEN of each infectious stage.

Node	Candidate HPI-GWGEN	Amplification Stage Real HPI-GWGEN	Saturation Stage Real HPI-GWGEN
Receptor	2294	2294	2294
Transcription factor	1452	1449	1449
Protein coding	13,989	13,980	13,981
miRNA	154	153	152
lncRNA	8827	1023	664
Virus	11	11	11
Total nodes	26,727	18,910	18,551
Edge	Candidate HPI-GWGEN	Amplification Stage Real HPI-GWGEN	Saturation Stage Real HPI-GWGEN
PPIs	4,722,699	953,464	1,053,818
TF -> Receptor	14,633	9493	8697
TF -> TF	11,846	7168	6590
TF -> Protein	84,183	55,326	51,376
TF -> miRNA	178	102	88
TF -> lncRNA	301	290	291
TF -> Virus	15,972	131	59
miRNA -> Receptor	88,424	10,871	10,267
miRNA -> TF	71,046	9228	8256
miRNA -> Protein	570,830	72,563	67,652
miRNA -> lncRNA	5502	581	478
miRNA -> Virus	1694	23	12
lncRNA -> Receptor	436	313	306
lncRNA -> TF	472	270	287
lncRNA -> Protein	4274	2288	2337
lncRNA -> miRNA	7	5	4
lncRNA -> lncRNA	4	4	3
lncRNA -> Virus	97,097	753	244
Virus -> Virus	121	22	4
Total edges	5,689,719	1,122,895	1,210,769

Table 2. The enrichment analysis of KEGG pathways for core HPI-GWGEN of the amplification infectious stage.

KEGG Pathway	Count	p-Value
Cell cycle	86	6.32 × 10⁻¹⁷
FoxO signaling pathway	80	7.83 × 10⁻¹²
Pathways in cancer	236	1.37 × 10⁻¹⁰
Hepatitis B	90	4.19 × 10⁻¹²
Hepatitis C	84	1.81 × 10⁻⁸
ErbB signaling pathway	52	5.17 × 10⁻⁸
Tight junction	87	1.00 × 10⁻⁷
MAPK signaling pathway	133	6.95 × 10⁻⁷
Endocytosis	113	6.90 × 10⁻⁶

Table 3. The enrichment analysis of KEGG pathways for core HPI-GWGEN of the saturation infectious stage.

KEGG Pathway	Count	p-Value
Pathways in cancer	227	6.39 × 10⁻⁹
Th17 cell differentiation	59	8.13 × 10⁻⁷
Cell cycle	66	1.21 × 10⁻⁶
Osteoclast differentiation	66	2.46 × 10⁻⁶
T cell receptor signaling pathway	56	2.93 × 10⁻⁶
Human T-cell leukemia virus 1 infection	102	3.81 × 10⁻⁶
Apoptosis	68	6.66 × 10⁻⁶
Hepatitis B	77	1.59 × 10⁻⁵
Hepatitis C	75	1.7 × 10⁻⁵

Table 4. The prediction performance of DNN-based DTI model by 10-fold cross-validation.

	Validation Loss	Validation Accuracy	Test Loss	Test Accuracy
1	0.1656409	0.95065	0.2180897	0.9521126
2	0.1929858	0.9438001	0.1789017	0.9493726
3	0.1807019	0.9504943	0.1856036	0.9519569
4	0.1761861	0.9507278	0.2022759	0.951521
5	0.1868679	0.9527516	0.216308	0.9517078
6	0.1671205	0.9526701	0.1956254	0.9513031
7	0.1850127	0.9536821	0.1776747	0.9516767
8	0.1905898	0.9474545	0.1865388	0.9505246
9	0.1813395	0.9499455	0.1792253	0.951988
10	0.1789479	0.9497898	0.1836274	0.9529221
Average	0.1805393	0.9501966	0.192387	0.9515085
Standard Deviation	0.0086033	0.0027209	0.0143861	0.0009184

Table 5. According to drug design specifications, the candidate drugs for each significant drug target are list below.

Candidate Drugs	Regulation Ability (L1000)	Sensitivity (PRISM)	Toxicity (LC50, mol/kg)
Downregulation of EGFR
Fursultiamine	−0.932	−0.035	2.928
fasudil	−0.791	0.367	3.083
* Bosutinib	−0.585	−0.017	6.273
cefaclor	−0.383	−0.099	3.666
* Erlotinib	−0.229	−0.332	5.73
Downregulation of AKT1
Iproniazid	−0.802	−0.337	2.82
gabexate	−0.733	−0.134	4.487
diazoxide	−0.544	0.393	3.058
* Bosutinib	−0.434	−0.017	6.273
Apoptosis-activator-II	−0.302	0.037	5.695
Upregulation of IFNB1
topiramate	0.848	0.161	2.289
* 17-beta-estradiol	0.72	−0.27	5.215
nitrofural	0.691	−0.404	3.88
raclopride	0.514	0.078	3.851
Acyclovir	0.363	0.3078	2.452
Downregulation of SMAD3
niridazole	−0.772	0.264	2.746
* Erlotinib	−0.537	−0.332	5.730
* 17-beta-estradiol	−0.503	−0.27	5.215
Azacitidine	−0.412	−0.393	2.049
Nobiletin	−0.312	−0.448	5.214
Upregulation of JUN
oleoylethanolamide	0.878	−0.15	3.54
carmoxirole	0.776	−0.006	4.477
zibotentan	0.611	0.209	3.013
* Sertraline	0.557	0.097	7.434
Limonin	0.367	−0.36	6.726

The drugs with a star (*) are the potential drugs for multiple-molecule drugs in Table 6 and Table 7.

Table 6. The proposed potential multiple-molecule drug with the corresponding targets for disrupting the progression of the amplification stage of SARS-CoV-2 infection.

	EGFR	AKT1	IFNB1	SMAD3
Drug	EGFR	AKT1	IFNB1	SMAD3
Bosutinib	V	V
Erlotinib	V			V
17-beta-estradiol			V	V
Structure of multiple-molecule drug
Bosutinib	Erlotinib		17-beta-estradiol

V indicates the drug can induce or inhibit the corresponding target.

Table 7. The proposed potential multiple-molecule drug with the corresponding targets for shortening the course of the saturation stage of SARS-CoV-2 infection.

	SMAD3	IFNB1	JUN
Drug	SMAD3	IFNB1	JUN
Erlotinib	V
17-beta-estradiol	V	V
Sertraline			V
Structure of multiple-molecule drug
Erlotinib		17-beta-estradiol	Sertraline

V indicates the drug can induce or inhibit the corresponding target.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, C.-G.; Chen, B.-S. Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications. Stresses 2022, 2, 405-436. https://doi.org/10.3390/stresses2040029

AMA Style

Wang C-G, Chen B-S. Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications. Stresses. 2022; 2(4):405-436. https://doi.org/10.3390/stresses2040029

Chicago/Turabian Style

Wang, Cheng-Gang, and Bor-Sen Chen. 2022. "Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications" Stresses 2, no. 4: 405-436. https://doi.org/10.3390/stresses2040029

Article Menu

Multiple-Molecule Drug Repositioning for Disrupting Progression of SARS-CoV-2 Infection by Utilizing the Systems Biology Method through Host-Pathogen-Interactive Time Profile Data and DNN-Based DTI Model with Drug Design Specifications

Abstract

1. Introduction

2. Results

2.1. Core HPI Signaling Pathways during Amplification and Saturation Stage of SARS-CoV-2 Infection by the Systems Biology Method

2.2. Investigation of Specific Core HPI Signaling Pathways and Their Downstream Abnormal Cellular Functions during SARS-CoV-2 Infection

2.2.1. Investigation of Specific Core HPI Signaling Pathways in Amplification Infectious Stage

2.2.2. Investigation of Common Core HPI Signaling Pathways of Amplification and Saturation Infectious Stages

2.2.3. Investigation of Specific Core HPI Signaling Pathways at Saturation Infectious Stage

2.3. Multiple-Molecule Drug Discovery and Design by DNN-Based DTI Model with Drug Design Specifications

2.3.1. Prediction Performance of DNN-Based DTI Model

2.3.2. Multiple-Molecule Drug Repositioning for Disrupting the Progression of SARS-CoV-2 Infection

3. Discussion

4. Materials and Methods

4.1. Construction of the Candidate HPI-GWGEN Using Big Data Mining

4.2. System Identification of HPI-GWGEN Using HPI RNA-Seq Time-Profile Data

4.2.1. HPI RNA-Seq Time-Profile Data

4.2.2. Dynamic Models for HPI-GWGEN

4.2.3. System Identification and System Order Selection for HPI-GWGEN

4.3. PNP Method to Extract the Core HPI-GWGEN from Network Matrix of Real HPI-GWGEN

4.4. Systematic Discovery and Design of Multiple-Molecule Drug by UtilizingDNN-Based DTI Model with Drug Design Specifications

4.4.1. Preprocess of Targets and Drugs Data

4.4.2. Architecture of DNN-Based DTI Model

4.4.3. Drug Design Specifications

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI