Next Article in Journal
Proteomics Analysis Reveals Diverse Molecular Characteristics between Endocardial and Aortic-Valvular Endothelium
Next Article in Special Issue
NGS Analysis of Liquid Biopsy (LB) and Formalin-Fixed Paraffin-Embedded (FFPE) Melanoma Samples Using Oncomine™ Pan-Cancer Cell-Free Assay
Previous Article in Journal
Transcriptomic Analysis of Drug-Resistance Acinetobacter baumannii under the Stress Condition Caused by Litsea cubeba L. Essential Oil via RNA Sequencing
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

VECTOR: An Integrated Correlation Network Database for the Identification of CeRNA Axes in Uveal Melanoma

1
Department of Biomedical and Biotechnological Sciences—Section of Biology and Genetics, University of Catania, 95123 Catania, Italy
2
Department of Clinical and Experimental Medicine, University of Catania, c/o Dipartimento di Matematica e Informatica, Viale A. Doria 6, 95125 Catania, Italy
3
Department of Experimental Oncology, Mediterranean Institute of Oncology (IOM), 95029 Catania, Italy
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Genes 2021, 12(7), 1004; https://doi.org/10.3390/genes12071004
Submission received: 18 May 2021 / Revised: 28 June 2021 / Accepted: 29 June 2021 / Published: 29 June 2021
(This article belongs to the Special Issue Genetics and Genomics of Melanoma)

Abstract

:
Uveal melanoma (UM) is the most common primary intraocular malignant tumor in adults and, although its genetic background has been extensively studied, little is known about the contribution of non-coding RNAs (ncRNAs) to its pathogenesis. Indeed, its competitive endogenous RNA (ceRNA) regulatory network comprising microRNAs (miRNAs), long non-coding RNAs (lncRNAs) and mRNAs has been insufficiently explored. Thanks to UM findings from The Cancer Genome Atlas (TCGA), it is now possible to statistically elaborate these data to identify the expression relationships among RNAs and correlative interaction data. In the present work, we propose the VECTOR (uVeal mElanoma Correlation NeTwORk) database, an interactive tool that identifies and visualizes the relationships among RNA molecules, based on the ceRNA model. The VECTOR database contains: (i) the TCGA-derived expression correlation values of miRNA-mRNA, miRNA-lncRNA and lncRNA-mRNA pairs combined with predicted or validated RNA-RNA interactions; (ii) data of sense-antisense sequence overlapping; (iii) correlation values of Transcription Factor (TF)-miRNA, TF-lncRNA, and TF-mRNA pairs associated with ChiPseq data; (iv) expression data of miRNAs, lncRNAs and mRNAs both in UM and physiological tissues. The VECTOR web interface can be queried, by inputting the gene name, to retrieve all the information about RNA signaling and visualize this as a graph. Finally, VECTOR provides a very detailed picture of ceRNA networks in UM and could be a very useful tool for researchers studying RNA signaling in UM. The web version of Vector is freely available at the URL reported at the end of the Introduction.

Graphical Abstract

1. Introduction

Uveal melanoma (UM) is the second most common type of human melanoma and the most frequent primary tumor of the eye in adults, with an annual incidence of 6–7 cases per million per year [1]. It mainly affects the choroid and its early metastasis, mostly to the liver, leads to 50% of the death rate in patients [2]. Several molecular alterations have been associated with the development of UM; however, its etiology remains unclear. Monosomy of chromosome 3 and gain of 8q are often found in UM patients [3]. Similarly, UM progression is frequently linked to oncogenic mutations of some genes, such as G protein subunit α q (GNAQ), G protein subunit α 11 (GNA11) and BRCA associated protein 1 (BAP1), related to transcriptional, post-transcriptional and post-translational dysregulations impairing cell cycle and apoptosis [4]. Understanding UM tumorigenesis solely by investigating genetic mechanisms is too limiting. Epigenetic alterations may be considered an important hallmark of cancer because of their critical role in the initiation of tumorigenesis [5]; however, these mechanisms are not well characterized in the onset and progression of UM. Cancer-related epigenetic phenomena include altered methylation of oncogenes and tumor suppressor genes, irregular histone modification pattern and aberrant expression of non-coding RNAs (ncRNAs) [6,7]. This last issue has been extensively explored in the last two decades in the most common types of cancer. NcRNAs mainly include microRNAs (miRNAs) and long non-coding RNAs (lncRNAs). More specifically, miRNAs are 18–25 nucleotide RNA molecules inducing mRNA degradation or translation inhibition through binding to the miRNA response element (MRE) at 3′ untranslated regions (3′-UTRs) of mRNAs. LncRNAs are the most functionally heterogeneous class of ncRNAs, with lengths ranging from 200 nt to 100,000 nt. They also contain MREs and, accordingly, can sequester specific miRNAs preventing them from binding mRNAs. On the other hand, miRNA binding to lncRNAs could promote their decay, under specific stoichiometric conditions [8,9]. In other words, some lncRNAs and mRNAs can share the same MRE and compete for binding to the same miRNAs, creating, in this way, a competitive endogenous RNA (ceRNA) regulatory network. Through such a competitive mechanism of RNA-RNA interactions, increased levels of lncRNAs decrease the quantity of available miRNAs and prevent mRNA degradation or translation block. Conversely, downregulation of a specific lncRNA induces miRNA release and mRNA degradation or reduced translation [9]. This “miRNA sponge” function operated by lncRNAs has been described in many cancer models [10,11,12,13], including UM [14,15]. It has been reported that perturbations of the ceRNA network influence all the cellular signaling underlying cancer phenotypes; however, this topic is currently still poorly investigated in UM. With the advances in RNA sequencing technology and the emergence of public cancer genomics projects, such as The Cancer Genome Atlas (TCGA), a huge amount of transcriptomic data from tumor specimens has been generated and made publicly available for anyone in the research community to use. These data can be computationally elaborated and combined with other data of a different nature, such as RNA-RNA binding and regulation by transcription factors, to build ncRNA networks allowing researchers to in silico explore cancer-related RNA signaling. For this purpose, we calculated the expression correlation of miRNAs, lncRNAs and mRNAs from the UM TCGA dataset [16], consisting of 80 UM patients. This information was combined with data (where available) of RNA-RNA interactions, regulation by transcription factors, and genomic positions. All these data have been collected and stored in a novel database called VECTOR (uVeal mElanoma Correlation neTwORk). VECTOR, through specific RNA-species queries, can identify new RNA circuits and their regulation signaling in UM. VECTOR makes UM RNA networks available to cancer researchers who intend to computationally explore the potential cross-talking between ncRNAs and mRNAs before performing in vitro or in vivo experiments on ocular melanoma models.

2. Materials and Methods

2.1. Data Collection

All the normalized expression data about TCGA UM (80 tumor tissue samples) were extracted from the UCSC Xena Server (https://xena.ucsc.edu, accessed on 28 December 2020). Normalization was performed by the TCGA Consortium. MiRNA expression data of physiological tissues were retrieved from Human miRNA tissue atlas (https://ccb-web.cs.uni-saarland.de/tissueatlas/, accessed on 28 December 2020) [17], while mRNAs and lncRNAs expression data were from Expression Atlas (https://www.ebi.ac.uk/gxa/home, accessed on 28 December 2020) (accession E-MTAB-2836) [18,19,20]. Validated or predicted interactions between miRNAs and mRNAs were retrieved by DIANA-TarBase v8 (https://carolina.imis.athena-innovation.gr/diana_tools/web/index.php?r=tarbasev8%2Findex, accessed on 28 December 2020) [21] and miRTarBase (http://miRTarBase.cuhk.edu.cn, accessed on 28 December 2020) [22]; while, lncRNA:miRNA interaction data were from miRcode (www.mircode.org, accessed on 28 December 2020), Encori (http://starbase.sysu.edu.cn, accessed on 28 December 2020) [23], DIANA-LncBase v3 (https://diana.e-ce.uth.gr/lncbasev3, accessed on 28 December 2020) [24]. Information on binding of transcription factors to promoters of miRNAs, lncRNAs, mRNAs were extracted as ChiPseq data elaborated by TransmiR v2.0 (http://www.cuilab.cn/transmir, accessed on 28 December 2020) [25], ENCODE (https://www.encodeproject.org, accessed on 28 December 2020) [26] and ChEA (http://amp.pharm.mssm.edu/Enrichr, accessed on 28 December 2020) [27,28]. Genomic positions were retrieved by using the UCSC genome browser (https://genome.ucsc.edu, accessed on 28 December 2020) [29].

2.2. Data Elaboration

Expression data retrieved from the TCGA dataset were filtered. To avoid any statistical confounding effect, null expression values were excluded: we arbitrarily chose to maintain for successive analyses only RNAs showing expression values greater than 0 in at least 60 out of 80 samples (75% of samples). Then, expression correlation matrices based on the Pearson calculation were computed between (a) miRNAs and mRNAs, (b) miRNAs and lncRNAs, and (c) lncRNAs and mRNAs. Based on the “miRNA sponge model”, we considered those RNA axes characterized by the following mathematical correlation as consistent: miRNAx:lncRNAy (negative Pearson) + miRNAx:mRNAz (negative Pearson) + lncRNAy:mRNAz (positive Pearson). Separately, we extracted from the mRNA dataset the expression of transcription factors (TFs) according to the list deposited on The Human Transcription Factors website (humantfs.ccbr.utoronto.ca). We then computed the expression correlation matrix between TFs and all the other RNA classes, for each correlation coefficient, a p-value was calculated.

2.3. VECTOR Data and Architecture

The VECTOR database has been built on top of the Laravel model-view-controller framework. All the data have been collected into a Neo4j database. Data processing has been carried out in R, Python and PHP. All the components of the Web Interface have been implemented in React native to ensure high modularity and dynamicity (Figure 1).
VECTOR stores the following information:
  • The correlation values (Pearson coefficients) of miRNA-mRNA, miRNA-lncRNA and lncRNA-mRNA pairs in UM samples. These correlations are used to create correlation networks, which show feedback loops involving the three classes of RNAs. The above-mentioned pairs of molecules are associated with data coming from miRBase, miRTarBase, LncBase, miRcode and Encori databases, also storing information about the predicted or validated RNA-RNA interactions. All correlation values can be downloaded by users at the “Download” section.
  • Overlapping of genomic positions between mRNAs and lncRNAs, in order to find couples of sense-antisense transcripts.
  • Correlation coefficients of TF-miRNA, TF-lncRNA, and TF-mRNA pairs in UM samples. These TF:RNA couples were associated with ChiPseq data of TF binding from TransmiR, ENCODE, and ChEA, in order to corroborate the potential TF regulation on miRNAs, lncRNAs and mRNAs.
  • The expression values of miRNAs, expressed as log2(RPM + 1), mRNAs and lncRNAs, expressed as log2(x + 1) normalized count, were retrieved from the TCGA dataset. Assitionally, VECTOR includes expression data of miRNAs, mRNAs and lncRNAs in several physiological tissues, reported as quantile normalized expression (miRNAs) and TPM (Transcripts Per Kilobase Million) (mRNAs and lncRNAs). Clinicopathological parameters of UM patients included in the UM TCGA dataset were collected and stored in VECTOR.

2.4. VECTOR Web Interface

The GUI consists of two sections: the Menu section (Figure 2) and the Results section (Figure 3).
The Menu section (Figure 2) enables a user to provide the searching parameters through the following query types:
  • The “Circuits” menu allows users to look for the molecular axes generated by lncRNA-mRNA-miRNA correlations. Users have to provide the name of at least one element that is part of the circuit (official gene symbol for mRNAs and lncRNAs, miRBase ID for miRNAs) (Figure 2A, red rectangle), and the minimum correlation coefficient of the miRNA-mRNA, lncRNA-mRNA and miRNA-lncRNA pairs. Alternatively, users can filter the output by p-value. (Figure 2A, yellow rectangle). The last parameter, named “Top n” (Figure 2A, green rectangle), limits the number of returned “triangular RNA circuits” in order to ensure a better readability of the plotted results, as well as a shorter processing time.
  • The “Antisense” menu enables users to look for the sense-antisense sequence overlapping between mRNAs and lncRNAs. In this case, the user has to provide the mRNA and/or lncRNA name (Figure 2B).
  • The “TF search” menu enables users to extract from our database the TF-mRNA, TF-miRNA and TF-lncRNA pairs in terms of correlation data and ChiPseq information about a given transcription factor. Therefore, the user has to provide the TF name (official gene symbol) and/or the name of a lncRNA, miRNA, and/or mRNA (Figure 2C) before submitting the search form.
  • The “Expression” menu allows users to evaluate the expression levels of a selected mRNA, lncRNA or miRNA in the UM TCGA dataset and in several physiological tissues. The users have to choose the RNA molecule for which expression values in both UM and physiological tissues will be shown as histograms. To infer the potential association between the expression of a specific RNA molecule in UM and the clinicopathological parameters of UM patients, the users can select the intended parameter and VECTOR will return a new histogram graph, where UM samples are shown grouped for the selected parameter.
  • The Results section (Figure 3) plots the obtained results as a network or a table.
  • Once the “Circuits” or “Antisense” searching query is submitted, results will be shown through an interactive network comprising nodes and edges (Figure 3 and Figure 4). The nodes represent the RNA species: the mRNAs are shown with blue circles, the miRNAs with red triangles, and the lncRNAs with orange squares. These can be inspected (by clicking on them) to get a table listing all the TFs they interact with. The edges represent the relationships between two RNA molecules (i.e., expression correlation and potential physical interaction). Different styles and colors discriminate the kind of relationship between RNAs: red edges imply a positive expression correlation between two RNA elements, while green edges show the anti-correlation of an expression. In addition to the color, each edge is also marked with the correlation or anti-correlation numeric value, while the p-value is shown in a small pop-up window which appears by clicking on the edge. Moreover, in the “Circuits” section, when the expression relationship is confirmed by at least one of the databases (miRBase, miRTarBase, LncBase, miRcode, and Encori), the edge is depicted as a solid line; otherwise it is a dotted line. The database confirming the expression relationship is shown in a small pop-up window which appears by clicking on the edge.
Together with each circuit, VECTOR generates a heatmap showing the expression of each member of each circuit in the UM TCGA dataset. Expression values will be represented as a color-coded scale ranging from the minimum (green) to the median (black) to the maximum (red) expression value for each RNA molecule. The heatmaps are then shown below the network image.
  • When a “TF searching” query is submitted, the obtained records are shown in a tabular format (Figure 5). Such tables contain the TF-mRNA, TF-lncRNA, or TF-miRNA expression correlations and potential physical interactions between TFs and gene promoters. TF binding to the promoter is linked to ChIPseq data from ENCODE, ChEA, and TransmiR, reported as a binary table (i.e., 0: no ChIPseq data; 1: ChIPseq data demonstrating the TF interaction on the gene promoter).
  • Expression of the chosen RNA molecule in both the UM TCGA dataset and physiological tissues is shown as a histogram (Figure 6). For the UM expression data, samples are shown grouped according to a specific clinicopathological parameter, which can be selected by the user. Selecting a different parameter, samples will be reorganized in order to group all samples sharing that clinical feature. For numerical parameters, samples are shown in ascending order for the parameter. This function will allow users to observe potential expression trends for a chosen RNA molecule in association with a specific clinical feature.

3. Results

3.1. Global Identification of lncRNA–miRNA–mRNA Axes in UM

Following our filtering approach, we built correlation matrices made up of 14,500 mRNAs, 733 lncRNAs, and 612 miRNAs. We obtained from three different matrices (i.e., mRNAs:miRNAs, mRNAs:lncRNAs, lncRNAs:miRNAs) 8,568,000, 10,628,500 and 448,596 correlation coefficients, respectively. All possible Mrna–miRNA pairs from correlation matrices were matched with mRNA–miRNA interaction data from TarBase and miRTarBase, while all lncRNA–miRNA couples were matched with lncRNA–miRNA interaction data from miRcode, lncBase and Encori. By this approach, we obtained multiple sets of triangle-shaped network motifs composed as follows: (1) miRNA–lncRNA axis (negatively correlated and interacting with each other), (2) miRNA–mRNA axis (negatively correlated and interacting with each other), (3) lncRNA–mRNA axis (positively correlated RNAs). According to the correlation thresholds applied to all pairs of the network motifs (from |0.2| to |0.7| for both positive and negative correlations), we generated different numbers of network motifs, as shown in Figure 7. By using a low-moderate threshold (correlation coefficient >0.4 or <−0.4), we obtained 1,806,064 correlation-based lncRNA-miRNA-mRNA network motifs, 5412 of which were characterized by a physical interaction according to at least one database in the miRNA–lncRNA axis and one in the miRNA–mRNA axis. The high-moderate threshold (correlation coefficient >0.5 or <−0.5) provided 199,594 correlation-based network motifs, 467 of which featuring physical interactions. The most stringent threshold (correlation coefficient >0.6 or <−0.6) featuring physical interactions between RNA molecules provided the 11 RNA network motifs reported in Table 1. The threshold correlation coefficient >0.7 resulted in 81 lncRNA–miRNA–mRNA network motifs that did not show predicted or validated RNA–RNA interactions.

3.2. Relationship Between Genomic Overlapping and Expression of Sense-Antisense Transcript Pairs

The overlapping of genomic positions of lncRNAs and mRNAs and its combination with expression correlation data allowed us to explore the possibility of UM expression regulation mediated by sequence complementarity between genes that partially share the same locus on opposite DNA strands. Genomic positions of mRNAs were superimposed on those of lncRNAs, obtaining 198 matches. Specifically, 143 lncRNA:mRNA pairs (72.2%) showed partial overlapping, while 55 (27.8%) included one shorter locus that totally overlapped a longer one. Considering all 198 pairs, 60 pairs (30.3%) overlapped in the 3′ regions (convergent pairs), 83 (41.9%) overlapped in the 5′ regions (divergent pairs), 40 (20.2%) included lncRNA loci totally overlapped the mRNA loci, and 15 (7.6%) mRNA loci totally overlapping lncRNA loci (Figure 8A). We also divided each overlapped the class into subgroups according to the trend of expression correlation (i.e., positive and negative correlations), and observed that lncRNA and mRNA pairs mostly showed a positive rather than a negative expression correlation, particularly in divergent pairs (Figure 8B).
To investigate whether the percentage of sequence overlapping affects the direction of expression correlation between lncRNA and mRNA, we calculated the Pearson coefficient between the expression correlation values and the number of overlapping bases. For pairs including multiple splicing variants for lncRNA and/or mRNA, the length of overlapping regions was calculated as mean, minimum and maximum; in these cases, three correlation coefficients were computed. We observed a negative correlation between the expression correlation and all length values of overlapping regions: mean: −0.23; minimum: −0.19; maximum: −0.23 (Figure 8C). These results would suggest that the lower the overlapping length for two sense-antisense sequences, the stronger their expression correlation.

3.3. The Genome-Wide Identification of TFs Regulating mRNAs, lncRNAs and miRNAs in UM

We built three different correlation matrices made up of (1) 1278 TFs and 14,500 mRNAs; (2) 1278 TFs and 612 miRNAs; and (3) 1278 TFs and 733 lncRNAs. By this approach, we obtained 1,853,1000 correlation coefficients for the TF:mRNA matrix, 782,136 for TF–miRNA matrix, and 936,774 for TF–lncRNA. All TF–xRNA couples were screened for binding of TFs on promoters of xRNA genes (xRNA = any type of RNA), according to ChiPseq data from ENCODE, ChEA, and TransmiR. Finally, we obtained several sets of positively or negatively correlated TF–xRNA pairs, whose TF potentially binds the promoter of xRNA genes (Figure 9). According to the most stringent negative and positive correlation coefficient thresholds and the presence of ChiPseq hits, we retrieved potential UM TF–target pairs. More specifically, we obtained 631 TF–mRNA, 18 TF–miRNA, 42 TF–lncRNA pairs. From these data, we extrapolated the 23 most frequent TFs regulating mRNAs, miRNAs and lncRNAs in UM (Table 2). Unsurprisingly, most TFs reported in Table 2 have a confirmed oncogenic role in different types of neoplasia, including melanoma.
The most frequent TFs were retrieved by using the most stringent and evaluable parameters of TF querying by VECTOR (TF–mRNA = correlation coefficient <−0.9 and >0.9; TF–miRNA = correlation coefficient <−0.6 and >0.7; TF–lncRNA = correlation coefficient <−0.7 and >0.8). From this output, the TFs shows that at least three targets were retrieved. The negative expression correlation between TFs and targets are indicated with a minus symbol between brackets (-) next to the target name. / = no data. Literature regulation mechanisms are highlighted in bold; references reporting the role in cancer or the regulation of the transcript are shown as PubMed IDs.

4. Discussion

In the last decade, a growing number of experimental studies have demonstrated that RNA-RNA crosstalk is implicated in cell-fate determination and in various human diseases, including cancer. CeRNA mechanisms are able to modulate concentration and functions of RNA elements from specific molecular axes and, accordingly, regulate essential biological processes [9,30]. The combination of expression relationships among RNA molecules and their complementarity-based binding provides a reliable scenario of RNA network structure and represents a pivotal starting point for planning experimental procedures to validate and functionally analyze RNA circuits. This methodological approach is very common in most studies concerning the role of lncRNAs in cancer: researchers retrieve expression and interaction data from different public databases and then compute and integrate them to obtain an RNA signaling to experimentally evaluate [31,32,33,34]. This computational method could take a few hours, depending on the amount of information available for a specific cancer model and the researchers’ expertise. Based on these considerations, we created VECTOR, a simple and intuitive database containing the elaboration and integration of expression correlations and experimental and predicted interactions among lncRNAs, miRNAs and mRNAs. Users are allowed to inspect expression of mRNAs, lncRNAs or miRNAs not only in the UM TCGA dataset, but also in other physiological tissues, to evaluate ubiquitarity or specificity of expression. Expression data are shown as histograms. Additionally, VECTOR allows to observe potential association trends between expression and clinical features of UM patients: indeed, the expression data of UM samples can be shown by grouping or ordering samples according to a selected clinicopathological parameter. This option allows to observe an increasing/decreasing expression trend in association with tumor stage, tumor size (thickness or basal diameter), metastasis, and other features. To investigate ceRNA networks, the query can be customized by choosing the type and magnitude of correlations. The relationships among RNAs are visualized as graphs featuring information about correlation and physical interaction in order to make the output easier to understand; expression of each member of the network are represented below as heatmaps. In addition to the classical ceRNA hypothesis view based on the lncRNA-miRNA-mRNA axis, VECTOR also queries the relationships between sense-antisense transcripts and the potential transcriptional regulations by transcription factors in order to obtain a more systemic view of RNA signaling. The choice of UM as the cancer model of VECTOR was dictated by the fact that it is a rare tumor and that there are not many studies on the ncRNAs involved in the onset and progression of this neoplasm. Thanks to UM TCGA findings, it is now possible to statistically elaborate these data to identify the expression relationships among RNA molecules and correlative interaction data from other sources. Other previous studies reported a systemic integrated view of RNA correlations in UM [35,36]; however, in this present work we propose the most comprehensive analysis and make these elaborations available through an interactive web-based user interface. The approach used to build VECTOR was already used in our previous papers on both UM and other cancer models [12,15,37]. In these previous studies, data about expression, interactions, and potential regulation operated by TFs were retrieved from different sources and databases, making this analysis complicated and time-consuming. The great advantage of VECTOR is the possibility to perform this analysis with only a few clicks. The strength of our approach is given by several matches observed between results obtained by using VECTOR and literature evidence. Indeed, among the triangle-shaped network motifs resulting from the application of the most stringent threshold of expression correlation, a molecular axis involving a miRNA sponge function of LINC00518 (long intergenic non-protein coding RNA 518) for miR-199a-5p was already reported in breast cancer [38]. Moreover, our previous paper on UM also suggested that LINC00518 may exert a miRNA sponge function on six miRNAs, including miR-199a-5p [15]. Similarly, several TFs identified by VECTOR as potential regulators of mRNA expression are confirmed by reports in the literature: an autoregulative loop was previously reported for CREB1 (cAMP responsive element binding protein 1) [39,40], GABPA (GA binding protein transcription factor subunit α) [41,42,43], JUND (JunD proto-oncogene, AP-1 transcription factor subunit) [44], MAZ (MYC associated zinc finger protein) [45], RELA (RELA proto-oncogene, NF-kB subunit) [46], SOX2 (SRY-box transcription factor 2) [47,48], and SPI1 (Spi-1 proto-oncogene) [49,50,51]. Additionally, CREB1 was shown to induce MEF2A (myocyte enhancer factor 2A) expression in human trophoblast stem cells and T-cells [52,53].
Another feature of VECTOR is the possibility to identify sense-antisense pairs to experimentally investigate their expression relationship. This aspect has been investigated since the early 2000s, and several papers have reported that the majority of sense-antisense pairs exhibit a positive correlation of expression [54,55,56,57,58]. A recent study showed that a positive correlation occurred in divergent pairs more frequently than in convergent pairs, likely because head-to-head overlap implies that genetic loci share a region with an open chromatin structure and the same regulation; on the contrary, convergent pairs showed both a positive and negative correlation [59]. Our analysis confirmed these observations in UM, where a positive correlation of expression is the most common in all the overlapping classes, but especially in divergent pairs (more than 80% showing a positive correlation of expression). Moreover, in agreement with the literature, a negative correlation was more frequent among convergent than divergent pairs (28.3% vs. 16.8%, respectively). The exact molecular mechanism underlying the co-regulation or inverse correlation of sense and antisense transcripts is still under investigation. Some studies showed that transient silencing of sense or antisense transcript did not affect the expression of the other one [60], while other papers demonstrated that the antisense transcript is responsible for the regulation of sense transcript expression at both RNA [61] and protein levels [62]. This evidence suggests very complex regulatory mechanisms that still need to be investigated. We also analyzed expression correlation for pairs including a shorter RNA molecule that totally overlapped a longer transcript. Our data showed that lncRNAs totally overlapping mRNAs are more frequent that mRNAs totally overlapping lncRNAs (40 vs. 15, respectively), with a slightly higher frequency of negative correlation in the first class (35% vs. 26.7%, respectively). For these two overlapping classes, we investigated whether overlapping regions fell within introns or exons, but the existence of multiple splicing variants for both sense and antisense transcripts created a very complex and heterogeneous scenario, where no clear classification (and consequent analysis) was possible. To our knowledge, no similar analysis has been performed to date on human cancer.
VECTOR was built according to our experience in ceRNA network investigations. However, some limits deriving from the available data should be discussed. The most evident limit is related to the type of samples included in the UM TCGA dataset. Indeed, when investigating the molecular bases of carcinogenesis, a researcher would first of all perform a comparison between tumor and normal tissue to identify differentially expressed molecules. Unfortunately, the UM TCGA dataset includes tumor tissue samples, but lacks in data on physiological tissue. Therefore, such an analysis is impossible to perform with this dataset. Other interesting analyses could be performed through the stratification of tumor samples according to clinicopathological data; however, stratification would create several subgroups consisting of a low number of samples, impairing the statistical power of such analysis. For this reason, stratified analyses were not performed, but VECTOR allows the users to inspect expression trends in the different subgroups obtained by stratifying samples for the available clinicopathological features of patients.

5. Conclusions

Comprehensively, VECTOR provides a very detailed picture of ceRNA networks in UM and we believe that it will represent a very useful tool for researchers studying RNA signaling in UM. Moreover, the VECTOR approach could be used to build other tools for different cancer models in order to make the tumor-related ceRNA circuits easily accessible to non-expert biocomputational researchers willing to devise an experimental plan.

Author Contributions

Conceptualization: A.P. and M.R.; methodology and software: A.D.M., S.A. and A.P.; formal analysis: C.B., A.D.M., A.A. and M.R.; investigation: C.B., A.D.M., A.A.; resources: A.D.M., S.A., A.F. and A.P.; data curation: C.B., A.A., D.B., L.C. and M.R.; Writing—original draft preparation: M.R., C.B. and A.D.M.; writing—review and editing: C.D.P., M.P., A.F. and A.P.; visualization: C.B., A.D.M. and D.B.; supervision: M.R.; project administration: M.R.; funding acquisition: M.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The web version of Vector is freely available at the following URL: https://vectordb.it/.

Acknowledgments

The results published here are in part based upon data generated by the TCGA Research Network: https://www.cancer.gov/tcga (accessed on 28 December 2020). The authors wish to thank the Scientific Bureau of the University of Catania for language support. This study was supported by “linea intervento 3 Open Access Piano PIACERI 2020-2022” from the University of Catania.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Nichols, E.E.; Richmond, A.; Daniels, A.B. Tumor Characteristics, Genetics, Management, and the Risk of Metastasis in Uveal Melanoma. Semin. Ophthalmol. 2016, 31, 304–309. [Google Scholar] [CrossRef] [Green Version]
  2. Singh, M.; Durairaj, P.; Yeung, J. Uveal Melanoma: A Review of the Literature. Oncol. Ther. 2018, 6, 87–104. [Google Scholar] [CrossRef] [Green Version]
  3. van den Bosch, T.; van Beek, J.G.; Vaarwater, J.; Verdijk, R.M.; Naus, N.C.; Paridaens, D.; de Klein, A.; Kilic, E. Higher percentage of FISH-determined monosomy 3 and 8q amplification in uveal melanoma cells relate to poor patient prognosis. Investig. Ophthalmol. Vis. Sci. 2012, 53, 2668–2674. [Google Scholar] [CrossRef] [Green Version]
  4. Amaro, A.; Gangemi, R.; Piaggio, F.; Angelini, G.; Barisione, G.; Ferrini, S.; Pfeffer, U. The biology of uveal melanoma. Cancer Metastasis Rev. 2017, 36, 109–140. [Google Scholar] [CrossRef]
  5. Sharma, S.; Kelly, T.K.; Jones, P.A. Epigenetics in cancer. Carcinogenesis 2010, 31, 27–36. [Google Scholar] [CrossRef]
  6. Brait, M.; Sidransky, D. Cancer epigenetics: Above and beyond. Toxicol. Mech. Methods 2011, 21, 275–288. [Google Scholar] [CrossRef]
  7. Meseure, D.; Drak Alsibai, K.; Nicolas, A.; Bieche, I.; Morillon, A. Long Noncoding RNAs as New Architects in Cancer Epigenetics, Prognostic Biomarkers, and Potential Therapeutic Targets. BioMed Res. Int. 2015, 2015, 320214. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  8. Ling, H.; Fabbri, M.; Calin, G.A. MicroRNAs and other non-coding RNAs as targets for anticancer drug development. Nat. Rev. Drug Discov. 2013, 12, 847–865. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  9. Ragusa, M.; Barbagallo, C.; Brex, D.; Caponnetto, A.; Cirnigliaro, M.; Battaglia, R.; Barbagallo, D.; Di Pietro, C.; Purrello, M. Molecular Crosstalking among Noncoding RNAs: A New Network Layer of Genome Regulation in Cancer. Int. J. Genom. 2017, 2017, 4723193. [Google Scholar] [CrossRef]
  10. Miao, H.; Lu, J.; Guo, Y.; Qiu, H.; Zhang, Y.; Yao, X.; Li, X.; Lu, Y. LncRNA TP73-AS1 enhances the malignant properties of pancreatic ductal adenocarcinoma by increasing MMP14 expression through miRNA-200a sponging. J. Cell. Mol. Med. 2021, 25, 3654–3664. [Google Scholar] [CrossRef] [PubMed]
  11. Cardoso, A.M.; Morais, C.M.; Rebelo, O.; Tao, H.; Barbosa, M.; Pedroso de Lima, M.C.; Jurado, A.S. Downregulation of long non-protein coding RNA MVIH impairs glioblastoma cell proliferation and invasion through an miR-302a-dependent mechanism. Hum. Mol. Genet. 2021, 30, 46–64. [Google Scholar] [CrossRef]
  12. Brex, D.; Barbagallo, C.; Mirabella, F.; Caponnetto, A.; Battaglia, R.; Barbagallo, D.; Caltabiano, R.; Broggi, G.; Memeo, L.; Di Pietro, C.; et al. LINC00483 Has a Potential Tumor-Suppressor Role in Colorectal Cancer Through Multiple Molecular Axes. Front. Oncol. 2020, 10, 614455. [Google Scholar] [CrossRef] [PubMed]
  13. Kong, X.; Duan, Y.; Sang, Y.; Li, Y.; Zhang, H.; Liang, Y.; Liu, Y.; Zhang, N.; Yang, Q. LncRNA-CDC6 promotes breast cancer progression and function as ceRNA to target CDC6 by sponging microRNA-215. J. Cell. Physiol. 2019, 234, 9105–9117. [Google Scholar] [CrossRef] [PubMed]
  14. Lu, Q.; Zhao, N.; Zha, G.; Wang, H.; Tong, Q.; Xin, S. LncRNA HOXA11-AS Exerts Oncogenic Functions by Repressing p21 and miR-124 in Uveal Melanoma. DNA Cell. Biol. 2017, 36, 837–844. [Google Scholar] [CrossRef] [PubMed]
  15. Barbagallo, C.; Caltabiano, R.; Broggi, G.; Russo, A.; Puzzo, L.; Avitabile, T.; Longo, A.; Reibaldi, M.; Barbagallo, D.; Di Pietro, C.; et al. LncRNA LINC00518 Acts as an Oncogene in Uveal Melanoma by Regulating an RNA-Based Network. Cancers 2020, 12, 3867. [Google Scholar] [CrossRef]
  16. Robertson, A.G.; Shih, J.; Yau, C.; Gibb, E.A.; Oba, J.; Mungall, K.L.; Hess, J.M.; Uzunangelov, V.; Walter, V.; Danilova, L.; et al. Integrative Analysis Identifies Four Molecular and Clinical Subsets in Uveal Melanoma. Cancer Cell 2017, 32, 204–220. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Ludwig, N.; Leidinger, P.; Becker, K.; Backes, C.; Fehlmann, T.; Pallasch, C.; Rheinheimer, S.; Meder, B.; Stahler, C.; Meese, E.; et al. Distribution of miRNA expression across human tissues. Nucleic Acids Res. 2016, 44, 3865–3877. [Google Scholar] [CrossRef] [PubMed]
  18. Uhlen, M.; Fagerberg, L.; Hallstrom, B.M.; Lindskog, C.; Oksvold, P.; Mardinoglu, A.; Sivertsson, A.; Kampf, C.; Sjostedt, E.; Asplund, A.; et al. Proteomics. Tissue-based map of the human proteome. Science 2015, 347, 1260419. [Google Scholar] [CrossRef] [PubMed]
  19. Habuka, M.; Fagerberg, L.; Hallstrom, B.M.; Ponten, F.; Yamamoto, T.; Uhlen, M. The Urinary Bladder Transcriptome and Proteome Defined by Transcriptomics and Antibody-Based Profiling. PLoS ONE 2015, 10, e0145301. [Google Scholar] [CrossRef]
  20. Naranbhai, V.; Fletcher, H.A.; Tanner, R.; O’Shea, M.K.; McShane, H.; Fairfax, B.P.; Knight, J.C.; Hill, A.V. Distinct Transcriptional and Anti-Mycobacterial Profiles of Peripheral Blood Monocytes Dependent on the Ratio of Monocytes: Lymphocytes. EBioMedicine 2015, 2, 1619–1626. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  21. Karagkouni, D.; Paraskevopoulou, M.D.; Chatzopoulos, S.; Vlachos, I.S.; Tastsoglou, S.; Kanellos, I.; Papadimitriou, D.; Kavakiotis, I.; Maniou, S.; Skoufos, G.; et al. DIANA-TarBase v8: A decade-long collection of experimentally supported miRNA-gene interactions. Nucleic Acids Res. 2018, 46, D239–D245. [Google Scholar] [CrossRef] [Green Version]
  22. Chou, C.H.; Shrestha, S.; Yang, C.D.; Chang, N.W.; Lin, Y.L.; Liao, K.W.; Huang, W.C.; Sun, T.H.; Tu, S.J.; Lee, W.H.; et al. miRTarBase update 2018: A resource for experimentally validated microRNA-target interactions. Nucleic Acids Res. 2018, 46, D296–D302. [Google Scholar] [CrossRef]
  23. Li, J.H.; Liu, S.; Zhou, H.; Qu, L.H.; Yang, J.H. starBase v2.0: Decoding miRNA-ceRNA, miRNA-ncRNA and protein-RNA interaction networks from large-scale CLIP-Seq data. Nucleic Acids Res. 2014, 42, D92–D97. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Paraskevopoulou, M.D.; Georgakilas, G.; Kostoulas, N.; Reczko, M.; Maragkakis, M.; Dalamagas, T.M.; Hatzigeorgiou, A.G. DIANA-LncBase: Experimentally verified and computationally predicted microRNA targets on long non-coding RNAs. Nucleic Acids Res. 2013, 41, D239–D245. [Google Scholar] [CrossRef]
  25. Tong, Z.; Cui, Q.; Wang, J.; Zhou, Y. TransmiR v2.0: An updated transcription factor-microRNA regulation database. Nucleic Acids Res. 2019, 47, D253–D258. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Consortium, E.P.; Moore, J.E.; Purcaro, M.J.; Pratt, H.E.; Epstein, C.B.; Shoresh, N.; Adrian, J.; Kawli, T.; Davis, C.A.; Dobin, A.; et al. Expanded encyclopaedias of DNA elements in the human and mouse genomes. Nature 2020, 583, 699–710. [Google Scholar] [CrossRef]
  27. Chen, E.Y.; Tan, C.M.; Kou, Y.; Duan, Q.; Wang, Z.; Meirelles, G.V.; Clark, N.R.; Ma’ayan, A. Enrichr: Interactive and collaborative HTML5 gene list enrichment analysis tool. BMC Bioinform. 2013, 14, 128. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  28. Kuleshov, M.V.; Jones, M.R.; Rouillard, A.D.; Fernandez, N.F.; Duan, Q.; Wang, Z.; Koplev, S.; Jenkins, S.L.; Jagodnik, K.M.; Lachmann, A.; et al. Enrichr: A comprehensive gene set enrichment analysis web server 2016 update. Nucleic Acids Res. 2016, 44, W90–W97. [Google Scholar] [CrossRef] [Green Version]
  29. Kent, W.J.; Sugnet, C.W.; Furey, T.S.; Roskin, K.M.; Pringle, T.H.; Zahler, A.M.; Haussler, D. The human genome browser at UCSC. Genome Res. 2002, 12, 996–1006. [Google Scholar] [CrossRef] [Green Version]
  30. Sanchez-Mejias, A.; Tay, Y. Competing endogenous RNA networks: Tying the essential knots for cancer biology and therapeutics. J. Hematol. Oncol. 2015, 8, 30. [Google Scholar] [CrossRef] [Green Version]
  31. Xu, W.; Yu, S.; Xiong, J.; Long, J.; Zheng, Y.; Sang, X. CeRNA regulatory network-based analysis to study the roles of noncoding RNAs in the pathogenesis of intrahepatic cholangiocellular carcinoma. Aging 2020, 12, 1047–1086. [Google Scholar] [CrossRef] [PubMed]
  32. Dai, M.; Chen, X.; Mo, S.; Li, J.; Huang, Z.; Huang, S.; Xu, J.; He, B.; Zou, Y.; Chen, J.; et al. Meta-signature LncRNAs serve as novel biomarkers for colorectal cancer: Integrated bioinformatics analysis, experimental validation and diagnostic evaluation. Sci. Rep. 2017, 7, 46572. [Google Scholar] [CrossRef]
  33. Yang, M.; Xiong, X.; Chen, L.; Yang, L.; Li, X. Identification and validation long non-coding RNAs of oral squamous cell carcinoma by bioinformatics method. Oncotarget 2017, 8, 107469–107476. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  34. Yu, H.; Xu, Q.; Liu, F.; Ye, X.; Wang, J.; Meng, X. Identification and validation of long noncoding RNA biomarkers in human non-small-cell lung carcinomas. J. Thorac. Oncol. 2015, 10, 645–654. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  35. Falzone, L.; Romano, G.L.; Salemi, R.; Bucolo, C.; Tomasello, B.; Lupo, G.; Anfuso, C.D.; Spandidos, D.A.; Libra, M.; Candido, S. Prognostic significance of deregulated microRNAs in uveal melanomas. Mol. Med. Rep. 2019, 19, 2599–2610. [Google Scholar] [CrossRef] [Green Version]
  36. Yu, H.; Cheng, Y.; Li, W.; Li, Z.; Wu, P.; Qiu, S.; Zeng, B.; Huang, B. A novel lncRNA-miRNA-mRNA competitive endogenous RNA network for uveal melanoma prognosis constructed by weighted gene co-expression network analysis. Life Sci. 2020, 260, 118409. [Google Scholar] [CrossRef] [PubMed]
  37. Barbagallo, C.; Brex, D.; Caponnetto, A.; Cirnigliaro, M.; Scalia, M.; Magnano, A.; Caltabiano, R.; Barbagallo, D.; Biondi, A.; Cappellani, A.; et al. LncRNA UCA1, upregulated in CRC Biopsies and Downregulated in Serum Exosomes, Controls mRNA Expression by RNA-RNA Interactions. Mol. Ther. Nucleic Acids 2018, 12, 229–241. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  38. Chang, L.; Hu, Z.; Zhou, Z.; Zhang, H. Linc00518 Contributes to Multidrug Resistance Through Regulating the MiR-199a/MRP1 Axis in Breast Cancer. Cell. Physiol. Biochem. 2018, 48, 16–28. [Google Scholar] [CrossRef]
  39. Rothem, L.; Stark, M.; Assaraf, Y.G. Impaired CREB-1 phosphorylation in antifolate-resistant cell lines with down-regulation of the reduced folate carrier gene. Mol. Pharmacol. 2004, 66, 1536–1543. [Google Scholar] [CrossRef] [Green Version]
  40. Bartsch, D.; Casadio, A.; Karl, K.A.; Serodio, P.; Kandel, E.R. CREB1 encodes a nuclear activator, a repressor, and a cytoplasmic modulator that form a regulatory unit critical for long-term facilitation. Cell 1998, 95, 211–223. [Google Scholar] [CrossRef] [Green Version]
  41. Yang, Z.F.; Mott, S.; Rosmarin, A.G. The Ets transcription factor GABP is required for cell-cycle progression. Nat. Cell Biol. 2007, 9, 339–346. [Google Scholar] [CrossRef]
  42. Yu, S.; Cui, K.; Jothi, R.; Zhao, D.M.; Jing, X.; Zhao, K.; Xue, H.H. GABP controls a critical transcription regulatory module that is essential for maintenance and differentiation of hematopoietic stem/progenitor cells. Blood 2011, 117, 2166–2178. [Google Scholar] [CrossRef]
  43. Patton, J.; Block, S.; Coombs, C.; Martin, M.E. Identification of functional elements in the murine Gabp α/ATP synthase coupling factor 6 bi-directional promoter. Gene 2006, 369, 35–44. [Google Scholar] [CrossRef] [PubMed]
  44. Berger, I.; Shaul, Y. The human junD gene is positively and selectively autoregulated. DNA Cell Biol. 1994, 13, 249–255. [Google Scholar] [CrossRef] [PubMed]
  45. Song, J.; Ugai, H.; Kanazawa, I.; Sun, K.; Yokoyama, K.K. Independent repression of a GC-rich housekeeping gene by Sp1 and MAZ involves the same cis-elements. J. Biol. Chem. 2001, 276, 19897–19904. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  46. Sung, M.H.; Li, N.; Lao, Q.; Gottschalk, R.A.; Hager, G.L.; Fraser, I.D. Switching of the relative dominance between feedback mechanisms in lipopolysaccharide-induced NF-kappaB signaling. Sci. Signal. 2014, 7, ra6. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Boyer, L.A.; Lee, T.I.; Cole, M.F.; Johnstone, S.E.; Levine, S.S.; Zucker, J.P.; Guenther, M.G.; Kumar, R.M.; Murray, H.L.; Jenner, R.G.; et al. Core transcriptional regulatory circuitry in human embryonic stem cells. Cell 2005, 122, 947–956. [Google Scholar] [CrossRef] [Green Version]
  48. Tomioka, M.; Nishimoto, M.; Miyagi, S.; Katayanagi, T.; Fukui, N.; Niwa, H.; Muramatsu, M.; Okuda, A. Identification of Sox-2 regulatory region which is under the control of Oct-3/4-Sox-2 complex. Nucleic Acids Res. 2002, 30, 3202–3213. [Google Scholar] [CrossRef] [PubMed]
  49. Chen, H.; Ray-Gallet, D.; Zhang, P.; Hetherington, C.J.; Gonzalez, D.A.; Zhang, D.E.; Moreau-Gachelin, F.; Tenen, D.G. PU.1 (Spi-1) autoregulates its expression in myeloid cells. Oncogene 1995, 11, 1549–1560. [Google Scholar]
  50. Okuno, Y.; Huang, G.; Rosenbauer, F.; Evans, E.K.; Radomska, H.S.; Iwasaki, H.; Akashi, K.; Moreau-Gachelin, F.; Li, Y.; Zhang, P.; et al. Potential autoregulation of transcription factor PU.1 by an upstream regulatory element. Mol. Cell. Biol. 2005, 25, 2832–2845. [Google Scholar] [CrossRef] [Green Version]
  51. Breig, O.; Theoleyre, O.; Douablin, A.; Baklouti, F. Subtle distinct regulations of late erythroid molecular events by PI3K/AKT-mediated activation of Spi-1/PU.1 oncogene autoregulation loop. Oncogene 2010, 29, 2807–2816. [Google Scholar] [CrossRef] [PubMed]
  52. Tsai, E.M.; Wang, Y.C.; Lee, T.T.; Tsai, C.F.; Chen, H.S.; Lai, F.J.; Yokoyama, K.K.; Hsieh, T.H.; Wu, R.M.; Lee, J.N. Dynamic Trk and G Protein Signalings Regulate Dopaminergic Neurodifferentiation in Human Trophoblast Stem Cells. PLoS ONE 2015, 10, e0143852. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  53. Jain, P.; Lavorgna, A.; Sehgal, M.; Gao, L.; Ginwala, R.; Sagar, D.; Harhaj, E.W.; Khan, Z.K. Myocyte enhancer factor (MEF)-2 plays essential roles in T-cell transformation associated with HTLV-1 infection by stabilizing complex between Tax and CREB. Retrovirology 2015, 12, 23. [Google Scholar] [CrossRef] [Green Version]
  54. Morrissy, A.S.; Griffith, M.; Marra, M.A. Extensive relationship between antisense transcription and alternative splicing in the human genome. Genome Res. 2011, 21, 1203–1212. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  55. Katayama, S.; Tomaru, Y.; Kasukawa, T.; Waki, K.; Nakanishi, M.; Nakamura, M.; Nishida, H.; Yap, C.C.; Suzuki, M.; Kawai, J.; et al. Antisense transcription in the mammalian transcriptome. Science 2005, 309, 1564–1566. [Google Scholar] [CrossRef]
  56. Chen, J.; Sun, M.; Hurst, L.D.; Carmichael, G.G.; Rowley, J.D. Genome-wide analysis of coordinate expression and evolution of human cis-encoded sense-antisense transcripts. Trends Genet. 2005, 21, 326–329. [Google Scholar] [CrossRef]
  57. Wenric, S.; ElGuendi, S.; Caberg, J.H.; Bezzaou, W.; Fasquelle, C.; Charloteaux, B.; Karim, L.; Hennuy, B.; Freres, P.; Collignon, J.; et al. Transcriptome-wide analysis of natural antisense transcripts shows their potential role in breast cancer. Sci. Rep. 2017, 7, 17452. [Google Scholar] [CrossRef] [Green Version]
  58. Grigoriadis, A.; Oliver, G.R.; Tanney, A.; Kendrick, H.; Smalley, M.J.; Jat, P.; Neville, A.M. Identification of differentially expressed sense and antisense transcript pairs in breast epithelial tissues. BMC Genom. 2009, 10, 324. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  59. Hu, H.Y.; He, L.; Khaitovich, P. Deep sequencing reveals a novel class of bidirectional promoters associated with neuronal genes. BMC Genom. 2014, 15, 457. [Google Scholar] [CrossRef] [Green Version]
  60. Goyal, A.; Fiskin, E.; Gutschner, T.; Polycarpou-Schwarz, M.; Gross, M.; Neugebauer, J.; Gandhi, M.; Caudron-Herger, M.; Benes, V.; Diederichs, S. A cautionary tale of sense-antisense gene pairs: Independent regulation despite inverse correlation of expression. Nucleic Acids Res. 2017, 45, 12496–12508. [Google Scholar] [CrossRef] [PubMed]
  61. Faghihi, M.A.; Zhang, M.; Huang, J.; Modarresi, F.; Van der Brug, M.P.; Nalls, M.A.; Cookson, M.R.; St-Laurent, G., 3rd; Wahlestedt, C. Evidence for natural antisense transcript-mediated inhibition of microRNA function. Genome Biol. 2010, 11, R56. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  62. Beltran, M.; Puig, I.; Pena, C.; Garcia, J.M.; Alvarez, A.B.; Pena, R.; Bonilla, F.; de Herreros, A.G. A natural antisense transcript regulates Zeb2/Sip1 gene expression during Snail1-induced epithelial-mesenchymal transition. Genes Dev. 2008, 22, 756–769. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Architecture of VECT model. Cylinders represent databases (both external databases and VECTOR); rectangles depict graphical interface modules or database processing applications. Neo4j database logo is shown within the VECTOR database.
Figure 1. Architecture of VECT model. Cylinders represent databases (both external databases and VECTOR); rectangles depict graphical interface modules or database processing applications. Neo4j database logo is shown within the VECTOR database.
Genes 12 01004 g001
Figure 2. Menu section of the VECTOR database. (A) The “Circuit” menu queries the database by choosing at least an RNA molecule (lncRNA and/or miRNA and/or mRNA), a minimum correlation value for each pair and the number of network motifs to be shown. Alternatively, user can filter the output by p-value. (B) The “Antisense” menu identifies sense-antisense pairs by choosing a lncRNA and/or mRNA. (C) The “TF Search” menu retrieves information on potential TFs regulating lncRNA, miRNA or mRNA expression in UM by choosing a TF and/or a lncRNA, a miRNA, or a mRNA. (D) The “Expression” menu allows to inspect the expression of a chosen lncRNA, miRNA or mRNA in UM samples or several physiological tissues.
Figure 2. Menu section of the VECTOR database. (A) The “Circuit” menu queries the database by choosing at least an RNA molecule (lncRNA and/or miRNA and/or mRNA), a minimum correlation value for each pair and the number of network motifs to be shown. Alternatively, user can filter the output by p-value. (B) The “Antisense” menu identifies sense-antisense pairs by choosing a lncRNA and/or mRNA. (C) The “TF Search” menu retrieves information on potential TFs regulating lncRNA, miRNA or mRNA expression in UM by choosing a TF and/or a lncRNA, a miRNA, or a mRNA. (D) The “Expression” menu allows to inspect the expression of a chosen lncRNA, miRNA or mRNA in UM samples or several physiological tissues.
Genes 12 01004 g002
Figure 3. Result section of the VECTOR database for the “Circuits” query. Output data based on miRNA sponge activity of lncRNAs are depicted as an interactive network. UM expression data of each member of the circuits is shown as heatmap.
Figure 3. Result section of the VECTOR database for the “Circuits” query. Output data based on miRNA sponge activity of lncRNAs are depicted as an interactive network. UM expression data of each member of the circuits is shown as heatmap.
Genes 12 01004 g003
Figure 4. Results section of the VECTOR database for the “Antisense” query. Output data based on gene overlapping and expression correlation are depicted as an interactive network. A pop-up window appears by clicking on the edge and shows expression correlation, p-value and details about the overlapping of the mRNA lncRNA pair.
Figure 4. Results section of the VECTOR database for the “Antisense” query. Output data based on gene overlapping and expression correlation are depicted as an interactive network. A pop-up window appears by clicking on the edge and shows expression correlation, p-value and details about the overlapping of the mRNA lncRNA pair.
Genes 12 01004 g004
Figure 5. Results section of the VECTOR database for “TF search” query. The table shows expression correlation among the TF and the RNA species, with the associated p-value. The presence of an experimentally validated TF binding site according to Chea and ENCODE is reported as 1 (yes) or 0 (no).
Figure 5. Results section of the VECTOR database for “TF search” query. The table shows expression correlation among the TF and the RNA species, with the associated p-value. The presence of an experimentally validated TF binding site according to Chea and ENCODE is reported as 1 (yes) or 0 (no).
Genes 12 01004 g005
Figure 6. Results section of the VECTOR database for the “Expression” query. (A) The expression of the selected RNA molecule in UM samples from TCGA is shown as histograms. The color of the histogram allows to classify each sample according to the selected clinicopathological parameter. (B) The expression of the selected RNA molecule in a set of physiological tissues is shown as histograms.
Figure 6. Results section of the VECTOR database for the “Expression” query. (A) The expression of the selected RNA molecule in UM samples from TCGA is shown as histograms. The color of the histogram allows to classify each sample according to the selected clinicopathological parameter. (B) The expression of the selected RNA molecule in a set of physiological tissues is shown as histograms.
Genes 12 01004 g006
Figure 7. Generation of lncRNA–miRNA–mRNA axes stored in VECTOR. LncRNA–miRNA–mRNA axes (triangle-shaped network motifs) were computed by using different correlation coefficient thresholds (blue bars) and matched with the same network motifs also featuring the predicted or validated interactions in the miRNA–lncRNA and miRNA–mRNA axes (red bars).
Figure 7. Generation of lncRNA–miRNA–mRNA axes stored in VECTOR. LncRNA–miRNA–mRNA axes (triangle-shaped network motifs) were computed by using different correlation coefficient thresholds (blue bars) and matched with the same network motifs also featuring the predicted or validated interactions in the miRNA–lncRNA and miRNA–mRNA axes (red bars).
Genes 12 01004 g007
Figure 8. Sense-antisense lncRNA:mRNA overlapping pairs in UM. (A) Sense-antisense pairs were classified according to overlapping features in: convergent (lncRNA and mRNA overlap in their 3′ regions), divergent (lncRNA and mRNA overlap in their 5′ regions), lncRNA within mRNA (a shorter lncRNA totally overlapping a longer mRNA) and mRNA within lncRNA (a shorter mRNA totally overlapping a longer lncRNA). (B) Sense-antisense pairs divided in subgroups according to the direction of expression correlation: positive correlation is represented as a checked fill pattern of the histogram, negative correlation as a striped fill pattern. Data are shown as percentage calculated on the total number of 198 pairs; the number of pairs included in each overlapping class or subgroup is reported above each histogram. (C) Correlation between length of overlapping regions (mean) and expression correlation among sense and antisense transcripts.
Figure 8. Sense-antisense lncRNA:mRNA overlapping pairs in UM. (A) Sense-antisense pairs were classified according to overlapping features in: convergent (lncRNA and mRNA overlap in their 3′ regions), divergent (lncRNA and mRNA overlap in their 5′ regions), lncRNA within mRNA (a shorter lncRNA totally overlapping a longer mRNA) and mRNA within lncRNA (a shorter mRNA totally overlapping a longer lncRNA). (B) Sense-antisense pairs divided in subgroups according to the direction of expression correlation: positive correlation is represented as a checked fill pattern of the histogram, negative correlation as a striped fill pattern. Data are shown as percentage calculated on the total number of 198 pairs; the number of pairs included in each overlapping class or subgroup is reported above each histogram. (C) Correlation between length of overlapping regions (mean) and expression correlation among sense and antisense transcripts.
Genes 12 01004 g008
Figure 9. Transcription factors and their potential targets in uveal melanoma retrieved by VECTOR. TFs regulating (a) mRNA coding genes, (b) miRNA coding genes, (c) lncRNA coding genes are reported according to correlation coefficients as the number of correlated TF:xRNA pairs (blue bar) and the number of correlated TF–xRNA pairs whose TF potentially binds the promoter of xRNA genes (red bar).
Figure 9. Transcription factors and their potential targets in uveal melanoma retrieved by VECTOR. TFs regulating (a) mRNA coding genes, (b) miRNA coding genes, (c) lncRNA coding genes are reported according to correlation coefficients as the number of correlated TF:xRNA pairs (blue bar) and the number of correlated TF–xRNA pairs whose TF potentially binds the promoter of xRNA genes (red bar).
Genes 12 01004 g009
Table 1. LncRNA–miRNA–mRNA axes featuring physical interactions calculated by using the most stringent threshold (correlation coefficient > 0.6).
Table 1. LncRNA–miRNA–mRNA axes featuring physical interactions calculated by using the most stringent threshold (correlation coefficient > 0.6).
miRNAPearson miRNA–mRNAmRNATarBmirTarlncRNAPearson miRNA–lncRNAmiRclncB-VlncB-PEnPearson lncRNA–mRNA
hsa-miR-199a-5p−0.65CDCA7L10LINC00518−0.6610000.65
hsa-miR-199a-5p−0.65CDCA7L10SNHG7−0.6910000.73
hsa-miR-195-5p−0.60SDC310LINC01128−0.6700010.72
hsa-miR-199a-5p−0.66RPL1510LINC00518−0.6610000.64
hsa-miR-199a-5p−0.66RPL1510SNHG7−0.6910000.80
hsa-miR-199a-5p−0.66RPL1510WDFY3-AS2−0.6310000.62
hsa-miR-199a-5p−0.63ZNF41501LINC00518−0.6610000.72
hsa-miR-195-5p−0.66TPRG1L10LINC01128−0.6600010.82
hsa-miR-508-3p−0.61GPR17601HCP5−0.7010000.63
hsa-miR-195-5p–0.65BSDC110LINC01128–0.6700010.61
hsa-miR-195-5p–0.65CTNNBIP110LINC01128–0.6700010.72
lncRNA-miRNA-mRNA network motifs were calculated by (1) retrieving the miRNA–mRNA, miRNA–lncRNA, and lncRNA–mRNA pairs with correlation coefficients of <−0.6 and >0.6, respectively; (2) identifying miRNA–mRNA and miRNA–lncRNA axes with at least one predicted or validated interaction from Tarbase (TarB), miRTarBase (mirTar), miRcode (miRc), lncBase (lncB-V: validated modules; lncB-P: predicted modules), and Encori (En).
Table 2. The most frequent TFs regulating mRNA, miRNA and lncRNA coding genes in UM according to the most stringent parameters of VECTOR.
Table 2. The most frequent TFs regulating mRNA, miRNA and lncRNA coding genes in UM according to the most stringent parameters of VECTOR.
TFsmRNAsmiRNAslncRNAsRole in Cancer
CDC73ATF2, BACH1, CREB1, ELF1, MEF2A//Oncogene (29221126)/tumor-suppressor (24145611)
COG6BACH1, CREB1, ELF1, GABPA///
CREB1CREB1 (15340044, 9790528), MEF2A (26606046, 25809782)/SEPT7P2, SUZ12P1, ZNF252P, ZNF37BPOncogene (17786359, 28498439, 27801665)
EPC2ATF1, BACH1, CREB1, ELF1, MEF2A, SMAD4//Oncogene (24166297)
GABPAGABPA (17277770, 21139080, 16309857)/LOC407835(-), CCT6P1, LOC100190986, SUZ12P1, ZNF37BPTumor-suppressor (31802036, 28549418)
JUNDJUND (8172655)/FAM35BP(-), FAM35DP(-)Oncogene (30763715, 27358408)/tumor-suppressor (18454173)
MAZMAZ (11259406)/BDNF-AS(-), CCT6P1(-), SBDSP1(-), SEPT7P2(-)Oncogene (31488180, 29414775)
MORC3BACH1, CREB1, ELF1, GABPA///
NARFLATF2(-), BACH1(-)///
PIKFYVEATF2, CREB1, MEF2A, ZFX//Oncogene (17909029, 24840251, 23154468)
RELARELA (24425788)/SEPT7P2(-), SNHG10(-), ZNF37BP(-)Oncogene (17622249, 12615723)/tumor-suppressor (11747334)
SF3B1ATF2, BACH1, CREB1, ZNF143///
SOX2SOX2 (16153702, 12136102)hsa-miR-124-3p, hsa-miR-183-5p, hsa-miR-96-5p/Oncogene (31412296, 31748974, 30518951)
SP3ATF2, CLOCK, CREB1, MEF2A, YY1//Oncogene (20810260, 26967243, 26352013)
SPI1SPI1 (7478579, 15767686, 20190819)hsa-miR-146b-3p, hsa-miR-146b-5p, hsa-miR-150-5pLOC606724, NCF1B, NCF1COncogene (28415748)
TFAP2A/hsa-miR-145-3p(-), hsa-miR-199a-5p(-), hsa-miR-4709-3p(-), hsa-miR-708-5p(-), hsa-miR-887-3p(-), hsa-miR-937-3p(-), hsa-miR-181a-5p/Oncogene (31772149, 30824562, 28412966)/tumor-suppressor (30824562)
TRAPPC8ATF1, ATF2, CREB1, MEF2A, YY1///
USF2USF2/SBDSP1(-), SEPT7P2(-)Oncogene (30244169)/tumor-suppressor (16186802)
XPO1ATF2, BACH1, CEBPZ, CREB1//Oncogene (32487143, 30976603, 24431073)
ZBTB45CREB1(-), MEF2A(-)//
ZFRATF2, BACH1, CLOCK, CREB1, ELF1, YY1//Oncogene (31010678)
ZNF143ZNF143/LOC407835(-), SEPT7P2, SUZ12P1, ZNF37BPOncogene (27449034, 20860770, 32312832)
ZNF791ATF2, BACH1, CREB1, SP4, YY1///
The most frequent TFs were retrieved by using the most stringent and evaluable parameters of TF querying by VECTOR (TF–mRNA= correlation coefficient <−0.9 and >0.9; TF–miRNA = correlation coefficient <−0.6 and >0.7; TF–lncRNA= correlation coefficient <−0.7 and >0.8). From this output, the TFs shows that at least three targets were retrieved. The negative expression correlation between TFs and targets are indicated with a minus symbol between brackets (-) next to the target name. / = no data. Literature regulation mechanisms are highlighted in bold; references reporting the role in cancer or the regulation of the transcript are shown as PubMed IDs.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Barbagallo, C.; Di Maria, A.; Alecci, A.; Barbagallo, D.; Alaimo, S.; Colarossi, L.; Ferro, A.; Di Pietro, C.; Purrello, M.; Pulvirenti, A.; et al. VECTOR: An Integrated Correlation Network Database for the Identification of CeRNA Axes in Uveal Melanoma. Genes 2021, 12, 1004. https://doi.org/10.3390/genes12071004

AMA Style

Barbagallo C, Di Maria A, Alecci A, Barbagallo D, Alaimo S, Colarossi L, Ferro A, Di Pietro C, Purrello M, Pulvirenti A, et al. VECTOR: An Integrated Correlation Network Database for the Identification of CeRNA Axes in Uveal Melanoma. Genes. 2021; 12(7):1004. https://doi.org/10.3390/genes12071004

Chicago/Turabian Style

Barbagallo, Cristina, Antonio Di Maria, Adriana Alecci, Davide Barbagallo, Salvatore Alaimo, Lorenzo Colarossi, Alfredo Ferro, Cinzia Di Pietro, Michele Purrello, Alfredo Pulvirenti, and et al. 2021. "VECTOR: An Integrated Correlation Network Database for the Identification of CeRNA Axes in Uveal Melanoma" Genes 12, no. 7: 1004. https://doi.org/10.3390/genes12071004

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop