Next Article in Journal
HRD Testing of Ovarian Cancer in Routine Practice: What Are We Dealing With?
Next Article in Special Issue
The Role of TRPM4 in Cardiac Electrophysiology and Arrhythmogenesis
Previous Article in Journal
ARL6IP5 Ameliorates α-Synuclein Burden by Inducing Autophagy via Preventing Ubiquitination and Degradation of ATG12
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bioinformatic Identification of Potential RNA Alterations on the Atrial Fibrillation Remodeling from Human Pulmonary Veins

1
Department of Cardiovascular Surgery, Akita University Graduate School of Medicine, 1-1-1 Hondo, Akita 010-8543, Japan
2
Center for Genomic Medicine, Graduate School of Medicine, Kyoto University, Shogoinkawahara-cho, Kyoto 606-8507, Japan
3
Department of Cell Physiology, Akita University Graduate School of Medicine, 1-1-1 Hondo, Akita 010-8543, Japan
4
Research Department, Ono Pharmaceutical Co., Ltd., Kyutaromachi, Osaka 541-0056, Japan
5
Department of Pharmacology, Faculty of Medicine, Yamagata University, Iida-Nishi, Yamagata 990-9585, Japan
*
Author to whom correspondence should be addressed.
These authors contributed equally to this paper.
Int. J. Mol. Sci. 2023, 24(13), 10501; https://doi.org/10.3390/ijms241310501
Submission received: 18 March 2023 / Revised: 16 May 2023 / Accepted: 20 June 2023 / Published: 22 June 2023
(This article belongs to the Special Issue Cardiac Arrhythmia: Molecular Mechanisms and Therapeutic Strategies)

Abstract

:
Atrial fibrillation (AF) is the most frequent persistent arrhythmia. Many genes have been reported as a genetic background for AF. However, most transcriptome analyses of AF are limited to the atrial samples and have not been evaluated by multiple cardiac regions. In this study, we analyzed the expression levels of protein-coding and long noncoding RNAs (lncRNAs) in six cardiac regions by RNA-seq. Samples were donated from six subjects with or without persistent AF for left atria, left atrial appendages, right atria, sinoatrial nodes, left ventricles, right ventricles, and pulmonary veins (PVs), and additional four right atrial appendages samples were collected from patients undergoing mitral valve replacement. In total, 23 AF samples were compared to 23 non-AF samples. Surprisingly, the most influenced heart region in gene expression by AF was the PV, not the atria. The ion channel-related gene set was significantly enriched upon analysis of these significant genes. In addition, some significant genes are cancer-related lncRNAs in PV in AF. A co-expression network analysis could detect the functional gene clusters. In particular, the cancer-related lncRNA, such as SAMMSON and FOXCUT, belong to the gene network with the cancer-related transcription factor FOXC1. Thus, they may also play an aggravating role in the pathogenesis of AF, similar to carcinogenesis. In the least, this study suggests that (1) RNA alteration is most intense in PVs and (2) post-transcriptional gene regulation by lncRNA may contribute to the progression of AF. Through the screening analysis across the six cardiac regions, the possibility that the PV region can play a role other than paroxysmal triggering in the pathogenesis of AF was demonstrated for the first time. Future research with an increase in the number of PV samples will lead to a novel understanding of the pathophysiology of AF.

1. Introduction

Atrial fibrillation (AF) is the most common cardiac arrhythmia. Researchers worldwide have devoted much effort to finding its etiology. Although we currently know that AF has a complex pathology, investigation of the specific molecular mechanisms of atrial fibrillation began with identifying the causative gene for familial atrial fibrillation in the last century. From the ancestry of familial AF, linkage analysis identified causal loci on the genetic map [1]. In 2003, it was reported that one of the causal genes in familial AF was due to mutations in the K+ channel KCNQ1 [2]. KCNQ1 abnormality was known to cause long QT syndrome [3], so this finding attracted much interest. Subsequently, many mutations in ion channel genes causing familial AF have been reported, and the genetic mutations leading to familial AF are well summarized in the review by Ragab et al. [4]. However, familial AF is a rare genetic disorder that hardly explains the pathogenesis of common AF.
For this reason, genome-wide association analysis (GWAS), a comprehensive analysis of single-gene polymorphisms (SNPs), has been repeatedly performed to explore the genetic background of the disease as a more common condition, i.e., non-familial AF. First 2007, an SNP at locus 4q25 was reported [5]. This locus locates between the PITX2 gene and the translational region of the C4orf32 gene, whose function is poorly understood, and its association with AF is overwhelmingly significant. Furthermore, the transcription factor PITX2 has attracted attention in the understanding of the pathophysiology of AF [6], as it is involved in the development of the pulmonary veins (PVs) [7], the regional origin of AF. Subsequently, similar analyses have reported the association of the transcription factors ZFHX3 [8,9] and the K+ channel KCNN3 [10], and now nearly 120 loci have been reported to be associated with AF [11]. The problem is that SNPs at each locus have a low relative risk (RR) for developing AF. Most loci have an RR between 1.0 and 1.1 [12]. In other words, AF-associated loci explain the inherited susceptibility to AF, but a single SNP alone does not lead to the development of the disease.
Therefore, the inherited combination of several SNPs or the involvement of environmental factors leads to the onset of the disease. Transcriptome analysis helps elucidate the direct pathophysiology. Hitherto, oxidative stresses [13], metabolic changes [14], and inflammations [11] have been suggested from transcriptome analysis. Recent advances in sequencing technology allow the transcriptome and non-coding RNAs to be analyzed together, allowing even post-transcriptional considerations [15,16]. However, transcriptome analysis generally has been limited to human patient samples from the atria and has yet to elucidate the pathogenesis by multi-cardiac regions. In the present study, we compared six different cardiac regions of donated samples. Surprisingly, the PV was the cardiac region most affected by atrial fibrillation in gene expressions. Moreover, our results have the potential to explain that AF is induced by an accumulation of acquired environmental stresses, similar to carcinogenesis.

2. Results

2.1. General Overview of Analysis Results

In this study, a total of 46 samples were obtained for RNA-seq; three samples each of left atrium (LA), left atrial appendage (LAA), left ventricle (LV), pulmonary vein (PV), right atrium (RA), right ventricle (RV) and sinoatrial node (SAN) were obtained from donors with or without persistent atrial fibrillation (AF); right atrial appendages (RAAs) were excised during mitral valve replacement surgery for two samples each from patients with or without AF. Patient information is summarized in Table 1. From 33535 RNAs detected by RNA-seq, statistical testing of expression differences between controls and AFs using the false discovery ratio (FDR) < 0.05 criterion resulted in 413 RNA hits, of which 201 were protein-coding, and 212 were long non-coding RNAs (lncRNAs). Using the R package, we applied the authorized method to compare two groups by computing differential expression genes (DEGs). The full results of the analysis can be viewed in Supplementary File S1. The hierarchical clustering of the DEGs in protein-coding and lncRNA are shown in Figure 1A,B, respectively. Besides the presence or absence of AF, differences in expression patterns were observed among cardiac regions. Therefore, each cardiac region was evaluated separately. Hereafter, LAA and RAA data were included in LA and RA, respectively.
First, principal component analysis was applied to the gene expression matrices of each region. Wilks’ lambda was calculated to indicate the distance between groups with or without AF in the two-dimensional space defined by the first and second principal components. Wilks’ lambda is a statistic that measures how groups are separated in space; it is expressed from 0 to 1, with the closer to 0, the greater the degree of separation. Figure 1C displays the negative logarithm of Wilks’ lambda for protein-coding (gray) and lncRNAs (orange). The most considerable difference between groups was demonstrated in PVs. Except for the SAN, differences between groups were more remarkable for lncRNAs than for protein-coding genes. It was suggested that the degree of association with the disease might differ from region to region. There is little difference in RV because the development of AF has little effect on gene expression in RV.

2.2. Differential Expression Genes (DEGs) in PV and LV

DEG analysis was performed by cardiac region (FDR < 0.05). The results showed that 145 DEGs were identified in the PV and 14 in the LV, but not in other regions. This supports the results from the analysis of Wilks’ lambda. Four genes belonging to both PV and LV DEGs were S100A1, GABARAP, CCDC144A, and ENSG00000289474 (Figure 2). Among the DEGs, the top 50 coding RNAs in PV, all lncRNAs in PV, and all DEGs in LV are listed in Figure 3A–C, according to statistical significance. The full results for DEGs in PV and LV can be viewed in Supplementary File S2. In addition, the Gene Ontology (GO) analysis was performed on the PV and LV gene sets to determine the functional annotation of protein-coding RNAs among the DEGs. Significantly enriched ontologies were identified in the PV DEGs, including “ion channel activity”. The GO IDs, descriptions, and statistics of the identified GOs are graphed in Figure 4. The gene names of the DEGs belonging to each GO are listed in Table 2.
In contrast, no significantly enriched Ontology was detected in DEGs of LV. Although most lncRNAs have poorly understood functions and cannot be generally annotated like coding RNAs, cancer-associated lncRNAs have been studied relatively well. In Table 3, lncRNAs corresponding to PV DEGs are addressed to cancer-associated lncRNAs in the database. Significant lncRNAs in PV in AF are also comparable in cancer.

2.3. Co-Expression Network Analysis on DEGs

To explore functional relationships between DEGs, co-expression network analysis was performed on 155 DEGs detected in PV and LV. This method is a valuable data analysis technique in characterizing gene function globally [17] and is also used in investigations of lncRNA function [18]. By clustering Pearson correlation coefficients between gene expression levels, compartmentalized groups of genes with correlated expressions are visualized (Figure 5A). A gene network was constructed by linking pairs of these gene groups with correlation coefficients >0.8. In Figure 5A, the largest compartment (a), followed by compartment (b), and the negatively linked compartments (c1) and (c2) are constructed as subnetworks A, B, and C. Subnetwork A, B, and C are extracted into Figure 6A–C, respectively. Figure 5B displays the distribution of gene degree (number of intergenic connections). These networks have typical gene network properties, with a few genes having many connections and most genes having few connections, referred to as scale-free [19]. Genes with a degree above 30 are listed in Table 4 as hub genes, which play essential roles in the gene network. In Figure 6, subnetwork A is a population of up-regulated 62 genes, subnetwork B is a population of down-regulated 40 genes, and subnetwork C is mixed by up-regulated five and down-regulated ten genes. Subnetworks A and C have no known genes that reportedly induce AF and have several lncRNAs, which are thought to be involved in regulating post-acquired gene expression. In contrast, subnetwork B has known AF-related genes such as NKX2-5, KCNH2, and SCN2B and no lncRNAs.

3. Discussion

Although there have been many comprehensive RNA expression analyses on atrial fibrillation (AF), most studies have limited their analysis to atrial samples on the assumption that pathological RNA alterations occur in the atria [13,14,15,16]. We are the first to perform comparisons in six different heart regions. Surprisingly, the most robust RNA alterations occurred in the pulmonary veins (PVs) among the cardiac regions (Figure 1B). Of these, coding RNAs related to ion channels are particularly enriched as RNAs influenced by AF (Figure 4). This may illustrate the basis of electrical remodeling in AF patients. In addition, the expression of the transcription factor FOXC1 was strongly up-regulated with long non-coding RNA (lncRNA) FOXCUT in AF patients (Figure 6A), which may be involved in structural remodeling by post-acquired gene regulation as discussed below.

3.1. The Priority of PV among Cardiac Regions on AF Remodeling

AF is known to be mainly triggered by PV sleeves [20]. Nevertheless, AF itself is defined as an atrial arrhythmia, and it has long been considered that substrate changes in the atria are responsible for the persistence of the arrhythmia [21]. Thus, most AF researchers believe that the initiation and maintenance of AF result from different molecular mechanisms occurring in different cardiac regions [22]. However, intense fibrosis, representing the substrate changes in AF patients, also develops in PV [23,24]. If substrate changes provide the basis for AF persistence, the atria would not be the only cardiac region contributing to AF persistence. Instead, the current analytic results indicate that biological change is more potent in PVs (Figure 1B). In our analysis, the cardiac regions with statistically significant differences between AF and sinus rhythm patients were PV and LV (Figure 2). There were 145 significant hits in the PV samples, including coding RNAs and long non-coding RNAs (lncRNAs), compared to 14 gene hits in the LV. Furthermore, samples from donors with coronary artery disease are largely unaffected in atrial and PV samples, while ventricular samples may be affected by ischemic changes in the data.

3.2. The Possible Basis of the Electrical Remodeling in PV in AF

Gene ontology (GO) analysis detected several ion transport-related genes from the enriched gene sets on PV, such as a subunit of mitochondrial ATPase (ATP5FD1), endoplasmic reticulum proton pump (ATP6V0C), hERG channel (KCNH2), an auxiliary subunit of BK channel (LRRC38), voltage-gated Ca2+ channel (CACNA1E), ion channel scaffold molecule (PKD2), β subunit of voltage-gated Na+ channel (SCN2B), β subunit of voltage-gated K+ channel (KCNIP2), a part of ion channel-type glutamate receptors (GRIA2, GRIN2c) and GABA receptors (GABRR1).
ATP5FD1, ATP6V0C, KCNH2, SCN2B, and KCNIP2 are essential to myocardial functions and significantly downregulated in AF samples. In addition, these cardiac function-related proteins are all on subnetwork B in Figure 6B. As noted, the co-expression network analysis was calculated from n = 46. Subnetwork B contains the transcription factor NKX2-5. NKX2-5 is a well-established transcription factor that determines morphological and functional features of the myocardium [25] and is also down-regulated in AF samples. It is, thus, estimated that cardiomyocyte-like characteristics might be lost in the PVs of AF patients. Among these genes, NKX2-5, KCNH2, and SCN2B have already been reported as single nucleotide polymorphisms for AF [26,27,28]. Unlike mutations, polymorphisms by themselves do not induce disease and are therefore explained as a genetic background rather than a causative gene. In the present analysis, the simultaneous finding of the down-regulation of these genes is meaningful in considering the cause of the proarrhythmic state. Namely, the down-regulation of NKX2-5 indicates a deviation from cardiac tissue, accompanied by electrical remodelings, i.e., the down-regulation of KCNH2 and SCN2B, resulting in the formation of atrial fibrillation. Also, the present analysis, which does not assume genetic mutations, shares results with the analysis of congenital causes, making it reasonable to assume that the polymorphisms identified in the previous reports are responsible for the proarrhythmic state.
Other ion transport proteins belonging to subnetworks A and C are tentatively ion channel-related molecules expressed in the smooth muscle layer and the pulmonary veins’ nerve plexus.

3.3. Possible Classifying Post-Acquired Disturbances in AF by the Network Analysis

In the co-expression network analysis in Figure 6, subnetwork A is the extensive network with the highest number of correlated RNAs. It exhibits up-regulated expressions of the cancer-associated lncRNAs FOXCUT, SAMMSON, ENSG00000251320, and ENSG00000248927, as well as ion channels in nerve fibers, such as GRIA2 and GABARR1. In addition, the expression of FOXC1, a FOXCUT-related transcript, is also up-regulated. Furthermore, prothrombotic factors, such as PDGFD and ALCAM, and molecules in the inflammatory response, such as receptors for IL-13 and 31, are also up-regulated. On the other hand, unlike subnetwork B, none of the known AF-causing genes are listed.
Given the above, it is likely that subnetwork A reflects post-acquired pathological disturbances due to the disease rather than inherited conditions. LncRNAs are often influential in regulating acquired gene expression through epigenomic regulation and post-transcriptional and post-translational modifications [29], e.g., FOXCUT binds to FOXC1 mRNA and promotes protein translation [30]. The transcription factor FOXC1 is known to contribute to the formation of the heart tube outflow tract early in development in the cardiovascular research field [31]. The PV is the cardiac region that forms a part of the outflow tract [32]. However, considering the age of the donors (Table 1), the increased expression in AF may represent a pathological rather than a developmental change. As for pathological changes, FOXC1 is up-regulated in many carcinomas. It plays a unique role in promoting the epithelial–mesenchymal transition (EMT) [33]; If EMT occurs in the pulmonary veins, it is assumed that endothelial cells will be converted to fibroblasts. Some reports have already found that EMT occurs in the atria of AF patients [34,35]. Then, fibroblasts secrete cytokines and adhesion molecules that may influence the substrate changes in the patients. Indeed, PDGE and ALCAM, prothrombotic factors in subnetwork A, can be secreted by fibroblasts. Thus, fibroblasts would be expected to proliferate and the substrate alteration to progress irreversibly. Our analysis indicates that the focus of the substrate changes is the PV rather than the atria, which may be related to thrombi being more likely to occur in the left atrium (~90%) than in the right atrium [36].
SAMMSON is also a well-studied lncRNA initially described as up-regulated in malignant melanoma [37]. The SAMMSON is located 30 kilobase pairs downstream of MITF, a melanoma-specific transcriptional regulator, whose expression correlates with MITF. However, unlike the relationship between FOXCUT and FOXC1, SAMMSON does not directly contact MITF but epigenetically regulates transcription factors influencing MITF expression. While MITF is a melanoma-specific factor, SAMMSON has also been observed to be up-regulated in thyroid cancer [38] and glioblastoma [39]. What genes SAMMSON couples to in the pulmonary veins of patients with AF is a subject for future research.
Furthermore, lncRNAs can imply regulation of gene expression by environmental factors, but the specific environmental factors themselves are often unknown. On the other hand, increased cytokine receptor expression has been detected in sub-network A, indicating that one of the environmental factors is inflammation or oxidative stress. Both IL13RA2 and IL31RA are receptor genes for inflammatory cytokines, IL13 and IL31. These cytokines play essential roles in acquired immunity [40,41]. In recent years, IL13RA2 has also been reported as a marker for several cancer cells [42,43,44,45].
Although few studies have investigated the analogies between AF and cancer, both diseases are characterized by a high prevalence in the elderly and an increased incidence with age. Previous studies have revealed that the transcriptome profile of age-related changes is characterized by increased inflammation-related genes, among other features [46]. Therefore, age-related changes in molecular profiles may contribute to the risk of age-related diseases through increased inflammation. On the other hand, we do not know of any other transcriptomic studies of human cardiac PV, so we do not know the relationship between the molecules identified in this study and aging. Further profiling of pulmonary veins will lead to a better understanding of age-related diseases.
In this way, it has been explained that sub-network A represents environmental factors and sub-network B myocardial functional factors. In contrast, sub-network C is a mixture of up-and-down-regulated genes and a few genes with a low degree, so biological considerations are challenging. Increasing the number of significant genes would be necessary to make sense of these complementary networks by increasing the sample size.

3.4. Limitations

The primary limitation of the current study is the limited size of sample numbers. The present analysis has been performed on three AF samples and three normal heartbeat samples. It is not guaranteed that the results would be replicated if the sample size were huge or if the property of the domination were aligned by sex, age, or race. However, our study focuses on comparing multiple cardiac regions, and it is not easy to include more replicates. In the RNA-seq research community, a sample size of n = 3 is often considered a minimum requirement. This is even clearly stated in the submission guidelines of specialized omics journals such as Genomics (https://www.sciencedirect.com/journal/genomics, accessed on 3 May 2023). Thus, our study design is considered acceptable. Besides, we took advantage of the properties of omics data to make our analysis more robust with the GO and co-expression gene network analyses. GO analysis uses genome-wide information and provides more reliable biological findings even with small sample sizes. The gene network analysis identifies essential genes using genome-wide information from all n = 46 samples.
Second, we are unable to analyze molecules regulated by micro RNAs because our analysis depends on detecting poly A tails. Some connexin genes, one of the AF-related gene family [47,48,49], are regulated by micro RNAs [50,51], but unfortunately, post-transcriptional modifications through micro RNAs are outside the focus of this study. Moreover, our hypothesis discussed above should also be confirmed in vitro to see how endothelial cells transform into fibroblasts in the future.

4. Materials and Methods

4.1. RNA Extractions from Donated Hearts and Right Atrial Appendage Collections

Human heart samples from subjects with or without persistent atrial fibrillation (AF) for left atrium (LA), left atrial appendage (LAA), right atrium (RA), sinoatrial node (SAN), left ventricle (LV), right ventricle (RV), and pulmonary vein (PV) were purchased from AnaBios Corporation (San Diego, CA, USA), which provides heart organs through the organ procurement organization of the USA in compliance with the Health Insurance Portability and Accountability Act. All personal information of donors is protected. Donated hearts were separated into SAN, LA, LAA, RA, SAN, LV, RV, and PV. These samples were frozen in RNAlater (Thermo Fisher Science, Waltham, MA, USA) at −20 °C. Our laboratory extracted RNAs using QIAGEN RNeasy mini columns (QIAGEN, Venlo, Netherland). Right atrial appendage (RAA) samples were obtained from four patients undergoing mitral valve surgery. Two of these patients had permanent atrial fibrillation; the other two were in sinus rhythm with no history of atrial fibrillation. After excision, all tissue specimens were frozen in liquid nitrogen and stored at −80 °C for later use. RNAs were extracted using ISOGEN II (NIPPON GENE CO., LTD. Kandashiki-machi, Tokyo, Japan).

4.2. RNA-Seq and Data Processing on the Samples

The high-throughput sequence was demonstrated with NovaSeq 6000 (Illumina, San Diego, CA, USA). First, extracted RNAs were purified by poly(A) capture. Resultant mRNAs were then fragmented and reverse-transcribed into single-stranded complementary DNAs (cDNAs). Subsequently, cDNAs were double-stranded by a DNA polymerase. Deoxy UTPs (dUTPs) were mixed in nucleotide materials during the polymerase reactions. Both double-stranded DNA (ds DNA) ends were ligated to a 13 bp adapter sequence. Both double-stranded DNA (ds DNA) ends were ligated to a 13 bp adapter sequence. Next, the ds DNAs were subjected to PCR amplification to prepare the multi-sized DNA library. NovaSeq Control software v1.4.0 analyzed the sequencing runs and tag sequences classified each read in the raw sequencing data. fastp software (version 0.12.4) was used for read quality control and adapter removal [52]. Reads were aligned using STAR software (version 2.7.0a) [53]. The fasta and gtf files of GRCh38/release105 obtained from the Ensemble database were used as reference genome and gene annotation information. The count values of each gene were quantified from the alignment results by featureCounts software (see [54]). We calculated the Transcripts Per Kilobase Million (TPM) values based on the calculated gene lengths and gene counts [55,56]. The current study targeted eight heart regions: left atrium (LA), left atrial appendage (LAA), left ventricle (LV), pulmonary vein (PV), right atrium (RA), right atrial appendage (RAA), right ventricle (RV), and sinoatrial node (SAN). Twenty-three samples from sinus rhythm (SR) and 23 samples from AF patients, a total of 46 samples of RNA-seq data, were used for analysis. Of these, the 21 control samples other than RAA are the same as in the previous report [57] and have been validated by a different experiment (GSE112339 in Gene Expression Omnibus (GEO) database). The TPM values were transformed by log2(intensity + 1) and used in subsequent analyses as gene expression intensity. Genes with zero TPM values in all 46 samples were excluded from the analysis. Protein coding genes and long non-coding RNA (lncRNA) genes were extracted based on gene feature format files provided by the Ensemble database in the same version used for alignment. Finally, 33,535 genes were included in this analysis, including 18,899 protein-coding genes and 14,636 lncRNA genes. The obtained data were subsequently analyzed in the following order: (1) differential expression gene (DEG) analysis, (2) multivariate analysis-of-covariance by manova function in R to calculate Wilks’ lambda, (3) coding RNA annotation by gene ontology (GO) analysis, (4) cancer-related lncRNA annotation, (5) co-expression network analysis.

4.3. Differential Expression Gene (DEG) Analysis

DEG analysis between SR and AF was performed using TCC (an acronym for Tag Count Comparison) in the R package. Here, the trimmed mean of M values (TMM) and edgeR methods were employed for normalization and testing, respectively [58]; genes with FDR < 0.05 were identified as significant DEGs. Gene Ontology (GO) analysis was performed using R’s clusterProfiler package to determine the GO-associated gene set (Benjamini-Hochberg adjusted p-value < 0.05) [59]. In the GO analysis, all genes included in the analysis were set to background. The R package’s org.Hs.eg.db (version 3.13.0) was used as the GO and gene annotation database. The list of transcription factors in humans was obtained from AnimalTFDB4.0 [60]. A gene co-expression network analysis was performed to perform a functional analysis of the identified DEGs. Pearson correlation coefficient matrices were calculated for DEGs. A gene network was constructed by connecting gene pairs with absolute values of correlation coefficients >0.8. The Cytoscape software (version 3.9.1, https://cytoscape.org/) visualizes the network.

4.4. Clustering and Heatmap Visualization

Clustering and heatmap drawings of the samples were applied to the gene expression intensity matrix. Hierarchical clustering of samples was obtained by applying the dist and hclust functions in R with default settings. The heatmap.2 function and dendextend packages [61] in the gplots package of the R were used for heatmap and clustering drawing (https://CRAN.R-project.org/package=gplots). Each expression data was standardized before the heatmap drawing.

5. Conclusions

At least, the current study suggests two findings: (1) the pulmonary veins (PVs) were the regions with the most intensive gene expression alterations by the atrial fibrillation (AF) incidence. Therefore, the PV region should be focused more on researching the etiology of both paroxysmal and chronic AF in the future. (2) the AF remodeling is attributable to a post-transcriptional genetic regulation similar to carcinogenesis, e.g., the FOXCUT-FOXC1 axis, SAMMSON, and IL13RA2.
Thus, our analytic results may suggest that rather than searching the inborn genetic backgrounds, a healthy lifestyle, such as smoking cessation and exercise, is a more protective strategy against AF progression in analogy with cancer prevention. This is because that familial AF is a sporadic disease, and the relative risk of AF-associated SNPs is small enough. Future research with an increase in the number of PV samples will lead to a novel understanding of the pathophysiology of AF.

Supplementary Materials

The supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/ijms241310501/s1.

Author Contributions

W.I. wrote the manuscript and performed experiments and performed the data analysis. D.T. funded and supervised the study. D.O. is the main analyzer of this project. D.K. performed the experiment. T.I. designed the experiment and funding. M.O. processed RNA-seq data. K.I. and H.Y. supervised the study. K.O. funded and supervised the study. Y.O. wrote the manuscript and designed the study and performed experiments and analyzed the data and supervised the study. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by JSPS KAKENHI (Grant-in-Aid for Young Scientists) grant number 20K16116 to Y.O., JSPS KAKENHI (Grant-in-Aid for Young Scientists) Grant number 21K16045 to D.T., JSPS KAKENHI (Grant-in-Aid for Scientific Research C) Grant number 22K08093 to K.O., and by collaboration with Ono Pharmaceutical Cooperation, Ltd. (Japan).

Institutional Review Board Statement

The protocols used in this study were approved by the Ethics Review Committees of the Yamagata University Faculty of Medicine (#110;June 2nd 2017), the Ethics Committees of Kyoto University Graduate School and Faculty of Medicine (R3118;September 17th 2021).
Akita University Certified Clinical Research Revies Board (#1982;May 21st 2021, and #2496; April 25th 2022).

Informed Consent Statement

For all samples except right atrial appendage (RAA), informed consent was obtained appropriately from all subjects involved in the study by AnaBios Corporation, along with the Health Insurance Portability and Accountability Act. For the RAA, patients gave written informed consent, and these samples were obtained with the approval of Akita University’s Ethics Review Board, as mentioned above.

Data Availability Statement

The analytic results in this study are available on a request basis. All RNA-seq data we analyzed in the current study are uploaded into the Gene Expression Omnibus (GEO) database. GSE203367 and GSE226283 are accession numbers for RNA-seq on seven cardiac regions from sinus rhythm control and atrial fibrillation patients, respectively. GSE226282 is the accession number for RNA-seq on cut-off right atrial appendages.

Acknowledgments

We thank Steve Wilhite for curating the GEO submissions.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Brugada, R.; Tapscott, T.; Czernuszewicz, G.Z.; Marian, A.; Iglesias, A.; Mont, L.; Brugada, J.; Girona, J.; Domingo, A.; Bachinski, L.L.; et al. Identification of a Genetic Locus for Familial Atrial Fibrillation. N. Engl. J. Med. 1997, 336, 905–911. [Google Scholar] [CrossRef] [PubMed]
  2. Chen, Y.-H.; Xu, S.-J.; Bendahhou, S.; Wang, X.-L.; Wang, Y.; Xu, W.-Y.; Jin, H.-W.; Sun, H.; Su, X.-Y.; Zhuang, Q.-N.; et al. KCNQ1 Gain-of-Function Mutation in Familial Atrial Fibrillation. Science 2003, 299, 251–254. [Google Scholar] [CrossRef] [PubMed]
  3. Wang, Q.; Curran, M.E.; Splawski, I.; Burn, T.C.; Millholland, J.M.; VanRaay, T.J.; Shen, J.; Timothy, K.W.; Vincent, G.M.; De Jager, T.; et al. Positional cloning of a novel potassium channel gene: KVLQT1 mutations cause cardiac arrhythmias. Nat. Genet. 1996, 12, 17–23. [Google Scholar] [CrossRef] [PubMed]
  4. Ragab, A.A.Y.; Sitorus, G.D.S.; Brundel, B.B.; Groot, N.M.S.D. The Genetic Puzzle of Familial Atrial Fibrillation. Front. Cardiovasc. Med. 2020, 7, 14. [Google Scholar] [CrossRef]
  5. Gudbjartsson, D.F.; Arnar, D.O.; Helgadottir, A.; Gretarsdottir, S.; Holm, H.; Sigurdsson, A.; Jonasdottir, A.; Baker, A.; Thorleifsson, G.; Kristjansson, K.; et al. Variants conferring risk of atrial fibrillation on chromosome 4q25. Nature 2007, 448, 353–357. [Google Scholar] [CrossRef]
  6. Wang, J.; Klysik, E.; Sood, S.; Johnson, R.L.; Wehrens, X.H.T.; Martin, J.F. Pitx2 prevents susceptibility to atrial arrhythmias by inhibiting left-sided pacemaker specification. Proc. Natl. Acad. Sci. USA 2010, 107, 9753–9758. [Google Scholar] [CrossRef] [Green Version]
  7. Mommersteeg, M.T.M.; Brown, N.A.; Prall, O.W.; de Gier-de Vries, C.; Harvey, R.P.; Moorman, A.F.M.; Christoffels, V.M. Pitx2c and Nkx2-5 are required for the formation and identity of the pulmonary myocardium. Circ. Res. 2007, 101, 902–909. [Google Scholar] [CrossRef] [Green Version]
  8. Gudbjartsson, D.F.; Holm, H.; Gretarsdottir, S.; Thorleifsson, G.; Walters, G.B.; Thorgeirsson, G.; Gulcher, J.; Mathiesen, E.B.; Njølstad, I.; Nyrnes, A.; et al. A sequence variant in ZFHX3 on 16q22 associates with a trial fibrillation and ischemic stroke. Nat. Genet. 2009, 41, 876–878. [Google Scholar] [CrossRef]
  9. Benjamin, E.J.; Rice, K.M.; Arking, D.E.; Pfeufer, A.; van Noord, C.; Smith, A.V.; Schnabel, R.B.; Bis, J.C.; Boerwinkle, E.; Sinner, M.F.; et al. Variants in ZFHX3 are associated with atrial fibrillation in individuals of European ancestry. Nat. Genet. 2009, 41, 879–881. [Google Scholar] [CrossRef]
  10. Ellinor, P.T.; Lunetta, K.L.; Glazer, N.L.; Pfeufer, A.; Alonso, A.; Chung, M.; Sinner, M.; De Bakker, P.; Mueller, M.; Lubitz, S.; et al. Common variants in KCNN3 are associated with lone atrial fibrillation. Nat. Genet. 2010, 42, 240–244. [Google Scholar] [CrossRef]
  11. Miyazawa, K.; Ito, K.; Ito, M.; Zou, Z.; Kubota, M.; Nomura, S.; Matsunaga, H.; Koyama, S.; Ieki, H.; Akiyama, M.; et al. Cross-ancestry genome-wide analysis of atrial fibrillation unveils disease biology and enables cardioembolic risk prediction. Nat. Genet. 2023, 55, 187–197. [Google Scholar] [CrossRef]
  12. Ellinor, P.T.; Lunetta, K.L.; Albert, C.M.; Glazer, N.L.; Ritchie, M.D.; Smith, A.V.; Arking, D.E.; Müller-Nurasyid, M.; Krijthe, B.P.; Lubitz, S.A.; et al. Meta-analysis identifies six new susceptibility loci for atrial fibrillation. Nat. Genet. 2012, 44, 670–675. [Google Scholar] [CrossRef] [Green Version]
  13. Kim, Y.H.; Lee, J.H.; Lim, D.S.; Shim, W.J.; Ro, Y.M.; Park, G.H.; Becker, K.G.; Cho-Chung, Y.S.; Kim, M.-K. Gene expression profiling of oxidative stress on atrial fibrillation in humans. Exp. Mol. Med. 2003, 35, 336–349. [Google Scholar] [CrossRef] [Green Version]
  14. Barth, A.S.; Merk, S.; Arnoldi, E.; Zwermann, L.; Kloos, P.; Gebauer, M.; Steinmeyer, K.; Bleich, M.; Kääb, S.; Hinterseer, M.; et al. Reprogramming of the human atrial transcriptome in permanent atrial fibrillation: Expression of a ventricular-like genomic signature. Circ. Res. 2005, 96, 1022–1029. [Google Scholar] [CrossRef] [Green Version]
  15. Zhang, J.; Huang, X.; Wang, X.; Gao, Y.; Liu, L.; Li, Z.; Chen, X.; Zeng, J.; Ye, Z.; Li, G. Identification of potential crucial genes in atrial fibrillation: A bioinformatic analysis. BMC Med. Genom. 2020, 13, 1–8. [Google Scholar] [CrossRef]
  16. Xie, L.; Huang, G.; Gao, M.; Huang, J.; Li, H.; Xia, H.; Xiang, X.; Wu, S.; Ruan, Y. Identification of Atrial Fibrillation-Related lncRNA Based on Bioinformatic Analysis. Dis. Markers 2022, 2022, 8307975. [Google Scholar] [CrossRef]
  17. Stuart, J.M.; Segal, E.; Koller, D.; Kim, S.K. A Gene-Coexpression Network for Global Discovery of Conserved Genetic Modules. Science 2003, 302, 249–255. [Google Scholar] [CrossRef] [Green Version]
  18. Li, J.; Li, C.; Wang, J.; Song, G.; Zhao, Z.; Wang, H.; Wang, W.; Li, H.; Li, Z.; Miao, Y.; et al. Genome-wide analysis of differentially expressed lncRNAs and mRNAs in primary gonadotrophin adenomas by RNA-seq. Oncotarget 2017, 8, 4595–4606. [Google Scholar] [CrossRef] [Green Version]
  19. Jordan, I.K.; Mariño-Ramírez, L.; Wolf, Y.I.; Koonin, E.V. Conservation and Coevolution in the Scale-Free Human Gene Coexpression Network. Mol. Biol. Evol. 2004, 21, 2058–2070. [Google Scholar] [CrossRef]
  20. Haïssaguerre, M.; Jaïs, P.; Shah, D.C.; Takahashi, A.; Hocini, M.; Quiniou, G.; Garrigue, S.; Le Mouroux, A.; Le Métayer, P.; Clémenty, J. Spontaneous initiation of atrial fibrillation by ectopic beats originating in the pulmonary veins. N. Engl. J. Med. 1998, 339, 659–666. [Google Scholar] [CrossRef] [Green Version]
  21. Nattel, S.; Heijman, J.; Zhou, L.; Dobrev, D. Molecular Basis of Atrial Fibrillation Pathophysiology and Therapy: A Translational Perspective. Circ. Res. 2020, 127, 51–72. [Google Scholar] [CrossRef] [PubMed]
  22. Michaud, G.F.; Stevenson, W.G. Atrial Fibrillation. N. Engl. J. Med. 2021, 384, 353–361. [Google Scholar] [CrossRef] [PubMed]
  23. Roney, C.H.; Bayer, J.D.; Cochet, H.; Meo, M.; Dubois, R.; Jaïs, P.; Vigmond, E.J. Variability in pulmonary vein electrophysiology and fibrosis determines arrhythmia susceptibility and dynamics. PLoS Comput. Biol. 2018, 14, e1006166. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Steiner, I.; Hájková, P.; Kvasnička, J.; Kholová, I. Myocardial sleeves of pulmonary veins and atrial fibrillation: A postmortem histopathological study of 100 subjects. Virchows Arch. 2006, 449, 88–95. [Google Scholar] [CrossRef] [PubMed]
  25. Akazawa, H.; Komuro, I. Cardiac transcription factor Csx/Nkx2-5: Its role in cardiac development and diseases. Pharmacol. Ther. 2005, 107, 252–268. [Google Scholar] [CrossRef] [PubMed]
  26. Roselli, C.; Chaffin, M.D.; Weng, L.C.; Aeschbacher, S.; Ahlberg, G.; Albert, C.M.; Almgren, P.; Alonso, A.; Anderson, C.D.; Aragam, K.G.; et al. Multi-ethnic genome-wide association study for atrial fibrillation. Nat. Genet. 2018, 50, 1225–1233. [Google Scholar] [CrossRef] [Green Version]
  27. Sinner, M.F.; Pfeufer, A.; Akyol, M.; Beckmann, B.-M.; Hinterseer, M.; Wacker, A.; Perz, S.; Sauter, W.; Illig, T.; Näbauer, M.; et al. The non-synonymous coding IKr-channel variant KCNH2-K897T is associated with atrial fibrillation: Results from a systematic candidate gene-based analysis of KCNH2 (HERG). Eur. Heart J. 2008, 29, 907–914. [Google Scholar] [CrossRef] [Green Version]
  28. Watanabe, H.; Darbar, D.; Kaiser, D.W.; Jiramongkolchai, K.; Chopra, S.; Donahue, B.S.; Kannankeril, P.J.; Roden, D.M. Mutations in Sodium Channel β1- and β2-Subunits Associated With Atrial Fibrillation. Circ. Arrhythmia Electrophysiol. 2009, 2, 268–275. [Google Scholar] [CrossRef] [Green Version]
  29. Statello, L.; Guo, C.-J.; Chen, L.-L.; Huarte, M. Gene regulation by long non-coding RNAs and its biological functions. Nat. Rev. Mol. Cell Biol. 2021, 22, 96–118, Correction in Nat. Rev. Mol. Cell Biol. 2021, 22, 159. [Google Scholar] [CrossRef]
  30. Yang, Z.; Jiang, S.; Cheng, Y.; Li, T.; Hu, W.; Ma, Z.; Chen, F.; Yang, Y. FOXC1 in cancer development and therapy: Deciphering its emerging and divergent roles. Ther. Adv. Med. Oncol. 2017, 9, 797–816. [Google Scholar] [CrossRef] [Green Version]
  31. Seo, S.; Kume, T. Forkhead transcription factors, Foxc1 and Foxc2, are required for the morphogenesis of the cardiac outflow tract. Dev. Biol. 2006, 296, 421–436. [Google Scholar] [CrossRef] [Green Version]
  32. Peng, T.; Tian, Y.; Boogerd, C.J.; Lu, M.M.; Kadzik, R.S.; Stewart, K.M.; Evans, S.M.; Morrisey, E.E. Coordination of heart and lung co-development by a multipotent cardiopulmonary progenitor. Nature 2013, 500, 589–592. [Google Scholar] [CrossRef] [Green Version]
  33. Debnath, P.; Huirem, R.S.; Dutta, P.; Palchaudhuri, S. Epithelial–mesenchymal transition and its transcription factors. Biosci. Rep. 2022, 42, 1–29. [Google Scholar] [CrossRef]
  34. Kato, T.; Sekiguchi, A.; Sagara, K.; Tanabe, H.; Takamura, M.; Kaneko, S.; Aizawa, T.; Fu, L.-T.; Yamashita, T. Endothelial–mesenchymal transition in human atrial fibrillation. J. Cardiol. 2017, 69, 706–711. [Google Scholar] [CrossRef] [Green Version]
  35. Lai, Y.J.; Tsai, F.C.; Chang, G.J.; Chang, S.H.; Huang, C.C.; Chen, W.J.; Yeh, Y.H. miR-181b targets semaphorin 3A to mediate TGF-β–induced endothelial-mesenchymal transition related to atrial fibrillation. J. Clin. Investig. 2022, 132, e142548. [Google Scholar] [CrossRef]
  36. Klein, A.L.; Grimm, R.A.; Murray, R.D.; Apperson-Hansen, C.; Asinger, R.W.; Black, I.W.; Davidoff, R.; Erbel, R.; Halperin, J.L.; Orsinelli, D.A.; et al. Use of Transesophageal Echocardiography to Guide Cardioversion in Patients with Atrial Fibrillation. N. Engl. J. Med. 2001, 344, 1411–1420. [Google Scholar] [CrossRef]
  37. Leucci, E.; Vendramin, R.; Spinazzi, M.; Laurette, P.; Fiers, M.; Wouters, J.; Radaelli, E.; Eyckerman, S.; Leonelli, C.; Vanderheyden, K.; et al. Melanoma addiction to the long non-coding RNA SAMMSON. Nature 2016, 531, 518–522. [Google Scholar] [CrossRef]
  38. Shao, L.; Sun, W.; Wang, Z.; Dong, W.; Qin, Y. Long noncoding RNA SAMMSON promotes papillary thyroid carcinoma progression through p300/Sp1 axis and serves as a novel diagnostic and prognostic biomarker. IUBMB Life 2020, 72, 237–246. [Google Scholar] [CrossRef]
  39. Ni, H.; Wang, K.; Xie, P.; Zuo, J.; Liu, W.; Liu, C. LncRNA SAMMSON Knockdown Inhibits the Malignancy of Glioblastoma Cells by Inactivation of the PI3K/Akt Pathway. Cell. Mol. Neurobiol. 2021, 41, 79–90. [Google Scholar] [CrossRef]
  40. Khurana Hershey, G.K. IL-13 receptors and signaling pathways: An evolving web. J. Allergy Clin. Immunol. 2003, 111, 677–690. [Google Scholar] [CrossRef]
  41. Cornelissen, C.; Lüscher-Firzlaff, J.; Baron, J.M.; Lüscher, B. Signaling by IL-31 and functional consequences. Eur. J. Cell Biol. 2012, 91, 552–566. [Google Scholar] [CrossRef] [PubMed]
  42. Zeng, J.; Zhang, J.; Yang, Y.Z.; Wang, F.; Jiang, H.; Chen, H.-D.; Wu, H.-Y.; Sai, K.; Hu, W.-M. IL13RA2 is overexpressed in malignant gliomas and related to clinical outcome of patients. Am. J. Transl. Res. 2020, 12, 4702–4714. [Google Scholar] [PubMed]
  43. Sai, K.K.S.; Sattiraju, A.; Almaguel, F.G.; Xuan, A.; Rideout, S.; Krishnaswamy, R.S.; Zhang, J.; Herpai, D.M.; Debinski, W.; Mintz, A. Peptide-based PET imaging of the tumor restricted IL13RA2 biomarker. Oncotarget 2017, 8, 50997–51007. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Chong, S.T.; Tan, K.M.; Kok, C.Y.L.; Guan, S.P.; Lai, S.H.; Lim, C.; Hu, J.; Sturgis, C.; Eng, C.; Lam, P.Y.P.; et al. IL13RA2 Is Differentially Regulated in Papillary Thyroid Carcinoma vs. Follicular Thyroid Carcinoma. J. Clin. Endocrinol. Metab. 2019, 104, 5573–5584. [Google Scholar] [CrossRef]
  45. Shibasaki, N.; Yamasaki, T.; Kanno, T.; Arakaki, R.; Sakamoto, H.; Utsunomiya, N.; Inoue, T.; Tsuruyama, T.; Nakamura, E.; Ogawa, O.; et al. Role of IL13RA2 in sunitinib resistance in clear cell renal cell carcinoma. PLoS ONE 2015, 10, e0130980. [Google Scholar] [CrossRef] [Green Version]
  46. de Magalhães, J.P.; Curado, J.; Church, G.M. Meta-analysis of age-related gene expression profiles identifies common signatures of aging. Bioinformatics 2009, 25, 875–881. [Google Scholar] [CrossRef] [Green Version]
  47. Kato, T.; Iwasaki, Y.K.; Nattel, S. Connexins and atrial fibrillation: Filling in the gaps. Circulation 2012, 125, 203–206. [Google Scholar] [CrossRef] [Green Version]
  48. Gollob, M.H.; Jones, D.L.; Krahn, A.D.; Danis, L.; Gong, X.-Q.; Shao, Q.; Liu, X.; Veinot, J.P.; Tang, A.S.; Stewart, A.F.; et al. Somatic Mutations in the Connexin 40 Gene (GJA5) in atrial fibrillation. N. Engl. J. Med. 2006, 354, 2677–2688. [Google Scholar] [CrossRef]
  49. Tribulova, N.; Egan Benova, T.; Szeiffova Bacova, B.; Viczenczova, C.; Barancik, M. New aspects of pathogenesis of atrial fibrillation: Remodelling of intercalated discs. J. Physiol. Pharmacol. 2015, 66, 625–634. [Google Scholar]
  50. Xu, H.F.; Ding, Y.J.; Shen, Y.W.; Xue, A.-M.; Xu, H.-M.; Luo, C.-L.; Li, B.-X.; Liu, Y.-L.; Zhao, Z.-Q. MicroRNA-1 represses Cx43 expression in viral myocarditis. Mol. Cell. Biochem. 2012, 362, 141–148. [Google Scholar] [CrossRef]
  51. Yang, B.; Lin, H.; Xiao, J.; Lu, Y.; Luo, X.; Li, B.; Zhang, Y.; Xu, C.; Bai, Y.; Wang, H.; et al. The muscle-specific microRNA miR-1 regulates cardiac arrhythmogenic potential by targeting GJA1 and KCNJ2. Nat. Med. 2007, 13, 486–491. [Google Scholar] [CrossRef]
  52. Chen, S.; Zhou, Y.; Chen, Y.; Gu, J. Fastp: An ultra-fast all-in-one FASTQ preprocessor. Bioinformatics 2018, 34, i884–i890. [Google Scholar] [CrossRef]
  53. Dobin, A.; Davis, C.A.; Schlesinger, F.; Drenkow, J.; Zaleski, C.; Jha, S.; Batut, P.; Chaisson, M.; Gingeras, T.R. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 2013, 29, 15–21. [Google Scholar] [CrossRef]
  54. Liao, Y.; Smyth, G.K.; Shi, W. featureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 2014, 30, 923–930. [Google Scholar] [CrossRef] [Green Version]
  55. Li, B.; Ruotti, V.; Stewart, R.M.; Thomson, J.A.; Dewey, C.N. RNA-Seq gene expression estimation with read mapping uncertainty. Bioinformatics 2010, 26, 493–500. [Google Scholar] [CrossRef] [Green Version]
  56. Wagner, G.P.; Kin, K.; Lynch, V.J. Measurement of mRNA abundance using RNA-seq data: RPKM measure is inconsistent among samples. Theory Biosci. 2012, 131, 281–285. [Google Scholar] [CrossRef]
  57. Okada, D.; Okamoto, Y.; Io, T.; Oka, M.; Kobayashi, D.; Ito, S.; Yamada, R.; Ishii, K.; Ono, K. Comparative Study of Transcriptome in the Hearts Isolated from Mice, Rats, and Humans. Biomolecules 2022, 12, 859. [Google Scholar] [CrossRef]
  58. Sun, J.; Nishiyama, T.; Shimizu, K.; Kadota, K. TCC: An R package for comparing tag count data with robust normalization strategies. BMC Bioinform. 2013, 14, 1–14. [Google Scholar] [CrossRef] [Green Version]
  59. Wu, T.; Hu, E.; Xu, S.; Chen, M.; Guo, P.; Dai, Z.; Feng, T.; Zhou, L.; Tang, W.; Zhan, L.; et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data clusterProfiler4.0: A universal enrichment tool for interpreting omics data. Innovation 2021, 2, 100141. [Google Scholar]
  60. Shen, W.; Chen, S.-Y.; Gan, Z.; Zhang, Y.-Z.; Yue, T.; Chen, M.-M.; Xue, Y.; Hu, H.; Guo, A.-Y. AnimalTFDB 4.0: A comprehensive animal transcription factor database updated with variation and expression annotations. Nucleic Acids Res. 2023, 51, D39–D45. [Google Scholar] [CrossRef]
  61. Galili, T. Dendextend: An R package for visualizing, adjusting and comparing trees of hierarchical clustering. Bioinformatics 2015, 31, 3718–3720. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Figure 1. Overview of the obtained data. Twenty-three samples, each with or without AF, were compared, and (A) 201 coding RNA and (B) 212 long non-coding RNA (lncRNA) were detected as differential expression genes (DEGs). Samples were hierarchically clustered, and the origin of the samples was color coded below the dendrogram. Black codes control and blue codes AF. (C) Degree of separation between control and AF samples in each cardiac region. The acquired data were analyzed using principal component analysis to construct a two-dimensional space with first and second principal components. Wilks’ lambda calculates how much each cardiac region data is separated between control and AF. The smaller the value, the greater the degree of separation. The negative logarithm is scaled to present the degree of separation.
Figure 1. Overview of the obtained data. Twenty-three samples, each with or without AF, were compared, and (A) 201 coding RNA and (B) 212 long non-coding RNA (lncRNA) were detected as differential expression genes (DEGs). Samples were hierarchically clustered, and the origin of the samples was color coded below the dendrogram. Black codes control and blue codes AF. (C) Degree of separation between control and AF samples in each cardiac region. The acquired data were analyzed using principal component analysis to construct a two-dimensional space with first and second principal components. Wilks’ lambda calculates how much each cardiac region data is separated between control and AF. The smaller the value, the greater the degree of separation. The negative logarithm is scaled to present the degree of separation.
Ijms 24 10501 g001
Figure 2. Populations of differential expression genes (DEGs) in the pulmonary vein (PV) and left ventricle (LV). 145 DEGs in PV and 14 in LV share four genes, S100A1, GABARAP, CCDC144A, and long non-coding ENSG00000289474.
Figure 2. Populations of differential expression genes (DEGs) in the pulmonary vein (PV) and left ventricle (LV). 145 DEGs in PV and 14 in LV share four genes, S100A1, GABARAP, CCDC144A, and long non-coding ENSG00000289474.
Ijms 24 10501 g002
Figure 3. Differential expression genes (DEGs) in the pulmonary veins (PVs) and left ventricles (LVs). DEGs are presented in order of statistical significance. The red dotted line indicates a p-value = 0.0005. Protein-coding RNAs and long non-coding RNAs (lncRNAs) are color-coded by gray and orange, respectively. (A) Top 50 protein-coding DEGs in PV. (B) lncRNAs as DEGs in PV. (C) All DEGs in the left ventricle.
Figure 3. Differential expression genes (DEGs) in the pulmonary veins (PVs) and left ventricles (LVs). DEGs are presented in order of statistical significance. The red dotted line indicates a p-value = 0.0005. Protein-coding RNAs and long non-coding RNAs (lncRNAs) are color-coded by gray and orange, respectively. (A) Top 50 protein-coding DEGs in PV. (B) lncRNAs as DEGs in PV. (C) All DEGs in the left ventricle.
Ijms 24 10501 g003
Figure 4. Functional annotation of differential expression genes (DEGs) in the pulmonary veins (PVs) by Gene Ontology (GO) analysis. Enriched GOs are presented in order of statistical significance. GO IDs are provided on the left. The corresponding negative logarithm of p-values and descriptions are in the middle and on the right, respectively. The corresponding genes are listed in Table 2.
Figure 4. Functional annotation of differential expression genes (DEGs) in the pulmonary veins (PVs) by Gene Ontology (GO) analysis. Enriched GOs are presented in order of statistical significance. GO IDs are provided on the left. The corresponding negative logarithm of p-values and descriptions are in the middle and on the right, respectively. The corresponding genes are listed in Table 2.
Ijms 24 10501 g004
Figure 5. The fundamental structures of the co-expression network in differential expression genes (DEGs). (A) Co-expressed gene pairs are clustered and visualized as a heatmap. The color scale corresponding to the correlation coefficient is inserted on the right. Based on absolute values of a correlation coefficient of 0.8 or higher, there are three subnetworks: a compartment containing 63 genes (a) and a compartment containing 46 genes (b), as well as a compartment (c1) and (c2), which are composed of 10 and 5 genes, respectively. (c1) and (c2) are linked by negative correlation. The rest of the subnetworks are isolated genes with fewer than two nodes. Compartment (a) is represented as subnetwork A, compartment (b) as sub-network B, and compartments (c1) and (c2) as sub-network C, which are represented in Figure 6A–C, respectively. (B) The histogram of gene degree. The degree is proportional to the number of intergenic connections. The distribution of this histogram indicates that the network is scale-free.
Figure 5. The fundamental structures of the co-expression network in differential expression genes (DEGs). (A) Co-expressed gene pairs are clustered and visualized as a heatmap. The color scale corresponding to the correlation coefficient is inserted on the right. Based on absolute values of a correlation coefficient of 0.8 or higher, there are three subnetworks: a compartment containing 63 genes (a) and a compartment containing 46 genes (b), as well as a compartment (c1) and (c2), which are composed of 10 and 5 genes, respectively. (c1) and (c2) are linked by negative correlation. The rest of the subnetworks are isolated genes with fewer than two nodes. Compartment (a) is represented as subnetwork A, compartment (b) as sub-network B, and compartments (c1) and (c2) as sub-network C, which are represented in Figure 6A–C, respectively. (B) The histogram of gene degree. The degree is proportional to the number of intergenic connections. The distribution of this histogram indicates that the network is scale-free.
Ijms 24 10501 g005
Figure 6. Subnetworks A, B, and C are expanded into (AC) in this figure. The diameter of the nodes is proportional to the degree that the gene has. Coding RNAs are grey, lncRNAs are orange nodes, and square nodes indicate ion channel-related genes enriched in the GO analysis with a marine blue border. Transcription factors are highlighted in red letters. Subnetwork A consists of 62 genes and is a network of genes whose expression is up-regulated compared to controls. Subnetwork B is a network of 40 genes whose expression is down-regulated. Subnetwork C consists of five up-regulated and ten down-regulated genes. All nodes represent differential expression genes of the pulmonary vein except PTGIS, which is in subnetwork A.
Figure 6. Subnetworks A, B, and C are expanded into (AC) in this figure. The diameter of the nodes is proportional to the degree that the gene has. Coding RNAs are grey, lncRNAs are orange nodes, and square nodes indicate ion channel-related genes enriched in the GO analysis with a marine blue border. Transcription factors are highlighted in red letters. Subnetwork A consists of 62 genes and is a network of genes whose expression is up-regulated compared to controls. Subnetwork B is a network of 40 genes whose expression is down-regulated. Subnetwork C consists of five up-regulated and ten down-regulated genes. All nodes represent differential expression genes of the pulmonary vein except PTGIS, which is in subnetwork A.
Ijms 24 10501 g006
Table 1. Patient information of donated or collected samples. SR and AF indicate sinus rhythm and persistent atrial fibrillation, respectively. For sex, M and F indicate male and female, respectively. For medical history: HD; heart disease, HT; hypertension, DM; diabetes, AF; atrial fibrillation, MR; mitral regurgitation, TR; tricuspid regurgitation, CAD; coronary artery disease, HCH; hypertensive cardiac hypertrophy. For heart samples: SAN; sinoatrial node, LA; left atrium, LAA; left atrial appendage, RA; right atrium, LV; the left ventricle, RV; the right ventricle, PV; pulmonary vein, RAA; right atrial appendage.
Table 1. Patient information of donated or collected samples. SR and AF indicate sinus rhythm and persistent atrial fibrillation, respectively. For sex, M and F indicate male and female, respectively. For medical history: HD; heart disease, HT; hypertension, DM; diabetes, AF; atrial fibrillation, MR; mitral regurgitation, TR; tricuspid regurgitation, CAD; coronary artery disease, HCH; hypertensive cardiac hypertrophy. For heart samples: SAN; sinoatrial node, LA; left atrium, LAA; left atrial appendage, RA; right atrium, LV; the left ventricle, RV; the right ventricle, PV; pulmonary vein, RAA; right atrial appendage.
NO.AF/SRAgeSexStrainTabacoAlcoholHDHTDMCancerSamples
1SR52MCaucasianYesYesNoNoNoNoSAN, LA, LAA, RA, LV, RV, PV
2SR46MFilipinoYesYesNomedicationmedicationNoSAN, LA, LAA, RA, LV, RV, PV
3SR23FCaucasianYesNoNoNoNoNoSAN, LA, LAA, RA, LV, RV, PV
4SR50MJapaneseYesYesMR, TRmedicationNoNoRAA
5SR50MJapaneseYesYesMRNoNoNoRAA
6AF58MCaucasianYesYesCAD, AFNoNoNoSAN, LA, LAA, RA, LV, RV, PV
7AF54MCaucasianYesYesAFNoNoNoSAN, LA, LAA, RA, LV, RV, PV
8AF58FCaucasianYesYesAFmedicationNoNoSAN, LA, LAA, RA, LV, RV, PV
9AF59MJapaneseYesYesMR, TR, AFmedicationNoNoRAA
10AF70FJapaneseNoYesMR,
HCH
AF
medicationNoNoRAA
Table 2. Enriched gene sets detected by gene ontology (GO) analysis in the pulmonary veins (PVs). The Detected GOs are on the left, and corresponding gene symbols are on the right. Most genes are significantly downregulated in PV of AF patients except GRIA2, GABRR1, and PKD2.
Table 2. Enriched gene sets detected by gene ontology (GO) analysis in the pulmonary veins (PVs). The Detected GOs are on the left, and corresponding gene symbols are on the right. Most genes are significantly downregulated in PV of AF patients except GRIA2, GABRR1, and PKD2.
DescriptionsGene Symbols
cation channel activityGRIA2, ATP5F1D, ATP6V0C, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
ion channel activityGRIA2, GABRR1, ATP5F1D, ATP6V0C, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
channel activityGRIA2, GABRR1, ATP5F1D, ATP6V0C, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
passive transmembrane transporter activityGRIA2, GABRR1, ATP5F1D, ATP6V0C, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
gated channel activityGRIA2, GABRR1, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
voltage-gated cation channel activityKCNH2, LRRC38, CACNA1E, PKD2, KCNIP2, GRIN2C
voltage-gated ion channel activityKCNH2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
voltage-gated channel activityKCNH2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C
metal ion transmembrane transporter activitySLC22A3, KCNH2, NALF2, LRRC38, CACNA1E, PKD2, SCN2B, KCNIP2, GRIN2C, SLC5A1
Table 3. Long non-coding RNAs (lncRNAs) annotation of atrial fibrillation-induced alterations in pulmonary veins based on cancer-related lncRNAs. Sixteen significant lncRNAs in PV were expressed on a log2 scale. Related carcinomas are listed based on two databases. (http://www.bio-bigdata.net/lnc2cancer/index.html, and http://fcgportal.org/TCLA/index.php, accessed on 1 February 2023). Red or blue letters and numbers indicate the increase or decrease in expression in the related disorder, respectively. GBM, glioblastoma; HCC, hepatocellular carcinoma; MME, malignant melanoma; NCP, nasopharyngeal carcinoma; GAC, gastric adenocarcinoma; BLBC, basal-like breast cancer; ESCC, esophageal squamous cell carcinoma; OSCC, oral squamous cell carcinoma; KIRC, kidney renal clear cell carcinoma; LUSC, lung squamous cell carcinoma; HNSC, head and neck squamous cell carcinoma; PRAD, prostate adenocarcinoma; BRCA, breast invasive carcinoma; COAD, colon adenocarcinoma; NA, not applicable.
Table 3. Long non-coding RNAs (lncRNAs) annotation of atrial fibrillation-induced alterations in pulmonary veins based on cancer-related lncRNAs. Sixteen significant lncRNAs in PV were expressed on a log2 scale. Related carcinomas are listed based on two databases. (http://www.bio-bigdata.net/lnc2cancer/index.html, and http://fcgportal.org/TCLA/index.php, accessed on 1 February 2023). Red or blue letters and numbers indicate the increase or decrease in expression in the related disorder, respectively. GBM, glioblastoma; HCC, hepatocellular carcinoma; MME, malignant melanoma; NCP, nasopharyngeal carcinoma; GAC, gastric adenocarcinoma; BLBC, basal-like breast cancer; ESCC, esophageal squamous cell carcinoma; OSCC, oral squamous cell carcinoma; KIRC, kidney renal clear cell carcinoma; LUSC, lung squamous cell carcinoma; HNSC, head and neck squamous cell carcinoma; PRAD, prostate adenocarcinoma; BRCA, breast invasive carcinoma; COAD, colon adenocarcinoma; NA, not applicable.
Unique Names/Ensembl IDsFold Changes (log2) in PV in AFExpressional Changes in Extra-Cardiovascular Carcinoma
SAMMSON9.29GBM, HCC, MME
FOXCUT8.41NCP, GAC, BLBC, ESCC, OSCC
ENSG000002513205.99KIRC, LUSC, HNSC
ENSG000002489274.16KIRC, PRAD
ENSG000002378072.46BRCA, COAD, PRAD
ENSG00000286429−4.73NA
SCTR-AS1−3.60LUSC
ENSG00000289474−9.35NA
CHL1-AS2−3.41NA
LINC02517−3.55NA
ENSG00000237596−3.72NA
ENSG00000289623−6.12NA
ENSG00000287592−3.61NA
ENSG00000254002−3.82NA
LINC01405−6.77HNSC
LINC01629−6.76NA
Table 4. Hub genes detected in the co-expression analysis (degree > 30). Marine blue and orange text color indicates ion channel activity and long non-coding RNA, respectively.
Table 4. Hub genes detected in the co-expression analysis (degree > 30). Marine blue and orange text color indicates ion channel activity and long non-coding RNA, respectively.
Unique NameGene typeDegree
NRXN3protein_coding50
TPH1protein_coding47
RTN1protein_coding47
PLD5protein_coding45
PCDH10protein_coding45
TRIM36protein_coding44
C8orf34protein_coding44
ACANprotein_coding43
ADAMTSL1protein_coding42
NT5DC3protein_coding41
CNTN1protein_coding40
MRAP2protein_coding39
GPR176protein_coding39
MYH10protein_coding37
ARPP21protein_coding36
CNTN4protein_coding35
GRIA2protein_coding35
HPSE2protein_coding35
RIMS1protein_coding34
CDH8protein_coding34
SUSD5protein_coding33
GABRR1protein_coding33
TMEM30Bprotein_coding33
PCDH11Yprotein_coding33
C1orf87protein_coding32
CYTL1protein_coding32
CARTPTprotein_coding32
IL31RAprotein_coding32
PCDH11Xprotein_coding32
SAMMSONlncRNA32
ST6GAL2protein_coding31
DSC3protein_coding31
FOXCUTlncRNA31
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Igarashi, W.; Takagi, D.; Okada, D.; Kobayashi, D.; Oka, M.; Io, T.; Ishii, K.; Ono, K.; Yamamoto, H.; Okamoto, Y. Bioinformatic Identification of Potential RNA Alterations on the Atrial Fibrillation Remodeling from Human Pulmonary Veins. Int. J. Mol. Sci. 2023, 24, 10501. https://doi.org/10.3390/ijms241310501

AMA Style

Igarashi W, Takagi D, Okada D, Kobayashi D, Oka M, Io T, Ishii K, Ono K, Yamamoto H, Okamoto Y. Bioinformatic Identification of Potential RNA Alterations on the Atrial Fibrillation Remodeling from Human Pulmonary Veins. International Journal of Molecular Sciences. 2023; 24(13):10501. https://doi.org/10.3390/ijms241310501

Chicago/Turabian Style

Igarashi, Wataru, Daichi Takagi, Daigo Okada, Daiki Kobayashi, Miho Oka, Toshiro Io, Kuniaki Ishii, Kyoichi Ono, Hiroshi Yamamoto, and Yosuke Okamoto. 2023. "Bioinformatic Identification of Potential RNA Alterations on the Atrial Fibrillation Remodeling from Human Pulmonary Veins" International Journal of Molecular Sciences 24, no. 13: 10501. https://doi.org/10.3390/ijms241310501

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop