Next Article in Journal
Insights on TAM Formation from a Boolean Model of Macrophage Polarization Based on In Vitro Studies
Next Article in Special Issue
Prior PSMA PET-CT Imaging and Hounsfield Unit Impact on Tumor Yield and Success of Molecular Analyses from Bone Biopsies in Metastatic Prostate Cancer
Previous Article in Journal
Three-Dimensional Imaging for Multiplex Phenotypic Analysis of Pancreatic Microtumors Grown on a Minipillar Array Chip
Previous Article in Special Issue
CT-Based Radiomics Analysis to Predict Malignancy in Patients with Intraductal Papillary Mucinous Neoplasm (IPMN) of the Pancreas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Identifying Cross-Scale Associations between Radiomic and Pathomic Signatures of Non-Small Cell Lung Cancer Subtypes: Preliminary Results

by
Charlems Alvarez-Jimenez
1,2,
Alvaro A. Sandino
1,
Prateek Prasanna
3,
Amit Gupta
4,
Satish E. Viswanath
2 and
Eduardo Romero
1,*
1
Computer Imaging and Medical Application Laboratory, Universidad Nacional de Colombia, Bogotá 111321, Colombia
2
Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH 44106, USA
3
Department of Biomedical Informatics, Stony Brook University, Stony Brook, NY 11794, USA
4
Department of Radiology, University Hospitals Cleveland Medical Center, Cleveland, OH 44106, USA
*
Author to whom correspondence should be addressed.
Cancers 2020, 12(12), 3663; https://doi.org/10.3390/cancers12123663
Submission received: 18 September 2020 / Revised: 12 November 2020 / Accepted: 13 November 2020 / Published: 7 December 2020
(This article belongs to the Special Issue Radiomics and Cancers)

Abstract

:

Simple Summary

This work presents initial results for differentiating two major non-small cell lung cancer (NSCLC) subtypes by exploring cross-scale associations between Computed Tomography (CT) images and corresponding digitized pathology images. The analysis comprised three phases, (i) a multi-resolution cell density quantification to identify discriminant pathomic patterns, (ii) radiomic characterization of CT images by using Haralick descriptors, and (iii) quantitative correlation analysis between the multi-modal features to identify potential associations between them. This analysis was carried out using publicly available databases, two digitized pathology and two radiology cohorts. Preliminary but significant cross-scale associations were identified between cell density statistics and CT intensity values using matched specimens, which were used to significantly improve the overall discriminatory performance of radiomic features in differentiating NSCLC subtypes.

Abstract

(1) Background: Despite the complementarity between radiology and histopathology, both from a diagnostic and a prognostic perspective, quantitative analyses of these modalities are usually performed in disconnected silos. This work presents initial results for differentiating two major non-small cell lung cancer (NSCLC) subtypes by exploring cross-scale associations between Computed Tomography (CT) images and corresponding digitized pathology images. (2) Methods: The analysis comprised three phases, (i) a multi-resolution cell density quantification to identify discriminant pathomic patterns for differentiating adenocarcinoma (ADC) and squamous cell carcinoma (SCC), (ii) radiomic characterization of CT images by using Haralick descriptors to quantify tumor textural heterogeneity as represented by gray-level co-occurrences to discriminate the two pathological subtypes, and (iii) quantitative correlation analysis between the multi-modal features to identify potential associations between them. This analysis was carried out using two publicly available digitized pathology databases (117 cases from TCGA and 54 cases from CPTAC) and a public radiological collection of CT images (101 cases from NSCLC-R). (3) Results: The top-ranked cell density pathomic features from the histopathology analysis were correlation, contrast, homogeneity, sum of entropy and difference of variance; which yielded a cross-validated AUC of 0.72 ± 0.02 on the training set (CPTAC) and hold-out validation AUC of 0.77 on the testing set (TCGA). Top-ranked co-occurrence radiomic features within NSCLC-R were contrast, correlation and sum of entropy which yielded a cross-validated AUC of 0.72 ± 0.01. Preliminary but significant cross-scale associations were identified between cell density statistics and CT intensity values using matched specimens available in the TCGA cohort, which were used to significantly improve the overall discriminatory performance of radiomic features in differentiating NSCLC subtypes (AUC = 0.78 ± 0.01). (4) Conclusions: Initial results suggest that cross-scale associations may exist between digital pathology and CT imaging which can be used to identify relevant radiomic and histopathology features to accurately distinguish lung adenocarcinomas from squamous cell carcinomas.

Graphical Abstract

1. Introduction

According to the World Health Organization (WHO), lung cancer was the most common cancer diagnosed worldwide in 2018 [1]. Between 80% to 85% of lung cancers are non-small cell lung cancer (NSCLC), and the most prevalent subtypes are adenocarcinoma (ADC) and squamous cell carcinoma (SCC) [2,3]. ADC is the most commonly diagnosed type of lung cancer and frequently occurs along the outer periphery of the lung [4,5]. SCC comprises 25–30% of lung cancer cases and usually occurs in the central portion of the lung as well as being considered more aggressive than ADC [4]. Other clinicopathologic differences are highlighted in Table S1. Available treatment options include conventional chemotherapy and targeted therapies, and typically differ for ADC and SCC [6,7]. Therefore, early characterization of lung cancer subtypes can be highly relevant for guiding the therapeutic regimen (curative or palliative [8]). The key modalities in the lung cancer clinical protocol remain standard-of-care computed tomography (CT) imaging (acquired at diagnosis) as well as tissue biopsy specimens (acquired for disease confirmation); both typically requiring subjective expert evaluation in the clinical workflow [9].
Towards developing more objective measurements and approaches for these modalities, there has been a recent focus on developing radiomic and pathomic strategies. ‘Radiomics’ refers to the automated extraction of quantitative information associated with manifestations of pathologies on a radiographic imaging scale, specifically imaging attributes that are not visually identifiable [10,11]. Radiomic attributes have been demonstrated to be correlated with disease risk in prostate [12,13,14], breast [15,16,17], lung [9,18,19,20], brain [21,22], colorectal [23,24,25], renal cell carcinoma [26], and head and neck [27,28] cancers. Specifically, radiomic approaches to discriminate ADC from SCC on CT imaging primarily build upon morphometric and textural features [9,29,30,31,32]. ’Pathomics’ involves computational characterization of complex patterns on digitized histopathology slides [33,34], whole slide images (WSI), to describe diverse phenotypic characteristics of cancer and patient risk in a variety of organs [35,36], including lung [37,38]. On hematoxylin and eosin (H&E) digitized slides of lung cancer specimens, several pathomics strategies based on topological [39], morphological [40], and convolutional neural networks (CNNs) strategies, have been used to capture complex patterns characterizing different pathological subtypes [41,42]. As CT and digitized pathology offer complementary sources of information about the tumor in vivo, a natural question is whether these radiomics and pathomics features might be connected.
There have been recent efforts on identifying cross-scale associations between radiology and pathology scales in different cancers [43,44,45,46,47,48], albeit not specifically for the question of distinguishing ADC from SCC. While ADC is characterized by malign glandular structures and mucin, it grows following different patterns, including acinar, lepidic, papillary, micropapillary and solid [49,50,51]. By contrast, SCC infiltrates the peri-bronchial tissue and grows resembling a cauliflower [50,51]. An optimal computational strategy to differentiate these two lung cancer subtypes on CT should therefore be designed to exploit the underlying architectural differences in the internal organization of cell populations on pathology images.
This study presents initial results for a novel pathomic-radiomic association approach, i.e., identifying pathomic features from digitized histopathology that potentially reflect the tissue composition basis of radiomic descriptors from CT, towards improving the understanding and ability to discriminate ADC from SCC. The underlying hypothesis is that pathomics features that can differentiate ADC from SCC could be used to determine a pathologic or morphologic basis for radiomic expression patterns on CT. The presented approach will involve first evaluating co-occurrence-based texture representations of (a) cellular density on pathology, and (b) CT intensity values on imaging; for differentiating ADC from SCC using each modality independently. Finally, a preliminary exploration of the cross-scale relationships between the radiomic and pathomic representations will be tested by systematically comparing correlations between them.

2. Materials and Methods

2.1. Digitized Pathology Data

A total of 171 patients were included in this study from two public datasets: (i) The National Cancer Institute’s Clinical Proteomic Tumor Analysis Consortium (CPTAC) [52,53], and (ii) The Cancer Genome Atlas (TCGA) and Genomic Data Commons Data Portal [54]. The CPTAC dataset comprised 117 studies while 54 studies were obtained from TCGA. Here, the CPTAC database was used for training while hold-out testing was performed with the TCGA database. Each study had at least one whole slide specimen available that had been stained with hematoxylin and eosin (H&E) and digitized in the SVS format. Table 1 summarizes the scanning parameters and the study population curated for use in this study, based on information reported in the public databases.

2.2. Computed Tomography Imaging Data

A total of 146 patients were included in this investigation from two public datasets: (i) NSCLC-Radiomics, Maastro Lung1 dataset (NSCLC-R) [55] whose cases were collected between 2004 and 2014 at the Maastro Clinic in The Netherlands, and (ii) The Cancer Genome Atlas (TCGA) and Genomic Data Commons Data Portal [54] with cases captured between 1988 and 2005 from many sites all over the world. For each study, a computed tomography (CT) scan is available, for which the scanner, imaging, and population parameters are summarized in Table 2. Note that the CT scans in the TCGA dataset correspond to the same patients for whom WSI images were curated (Section 2.1). Pathologic subtype diagnosis for all CT scans was determined based on information reported in each database.
The NSCLC-R dataset included the associated tumor expert annotations, while for the TCGA dataset, manual delineation of the primary gross tumor volume was performed by an expert radiologist (AG). Crucially, there were important differences between the two databases; while NSCLC-R was quite homogeneous, TCGA studies were highly variable in terms of imaging details. Specifically, NSCLC-R studies were acquired with two different scanner manufacturers but with a similar in-plane resolution, slice thickness, field of view, and patient position across all studies. By contrast, most of these imaging parameters are highly variable in TCGA studies because of the 17-year period of acquisition and concomitant inter-site differences in voxel size, contrast, and imaging sequence. Consequently, classification experiments were performed with the NSCLC-R database alone, while the TCGA database was used to establish correlations between histopathological and radiological attributes.

2.3. Computing Cellular Density Map on Histopathology Slides

A patch-based strategy was employed to analyze each WSI. First, a histopathology grid with patches of 1 mm2 was constructed (equivalent to 2000 × 2000 pixels for a 20× magnification), which was in turn divided into tiles of 50 μm2. The cellular density was estimated in each of these tiles by segmenting nuclei [56] and assigning a gray level to each of these tiles based on the number of nuclei estimated in each tile. This resulted in a spatial map of the cellular density on the digitized histopathology, termed the cell density map. Figure 1b,e illustrates the associated cell density map for the histopathology patches in Figure 1a,d. In addition, Figure 1c,f depicts the tumor patch from the CT scan of the same patient. The top row shows a representative ADC while the second corresponds to a SCC study.

2.4. Pathomics Feature Extraction From Cellular Density Map

Although SCC is characterized by an accelerated cell growth compared to ADC [57] (and thus an increased cell density), first order statistics of the cell density map by themselves may not suffice to differentiate between these two lung cancer subtypes. Therefore, the spatial distribution of cell density on each WSI was computed using Haralick features extracted at three different scales, as illustrated in Figure 2. From each matrix of cell density values, two co-occurrence matrices were calculated per scale (horizontal-vertical and diagonal directions), to estimate how often a particular cell density value co-occurred with its neighbors. For each co-occurrence matrix, 12 Haralick texture features [58] were computed to obtain a feature vector of 72 descriptors per patch. To obtain a feature vector per case, considering that each WSI is represented by a different number of patches, five statistics (mean, median, variance, kurtosis, and skewness) were computed across the entire set of patches and each of the 72 descriptors. Therefore, a 360 × 1 pathomic feature vector was used to describe each study.

2.5. Radiomics Feature Extraction from CT Images

For radiomic feature extraction, the region of interest (ROI) was defined as a sub-volume of the largest annotated 2D tumor section together with the two adjacent Sections (3 consecutive 2D sections in total), as illustrated in Figure 3. This was the smallest sub-volume consistently available for all patients. Per section, 12 Haralick textural features were computed from two co-occurrence matrices (horizontal-vertical and diagonal directions) that had been computed based on CT image intensity values. Five statistics were then calculated to characterize the texture feature distribution in the CT sub-volume, resulting in 120 radiomic texture descriptors per study.

2.6. Experimental Design

2.6.1. Experiment 1: Identifying Pathomics Features to Differentiate ADC from SCC

To analyze the spatial distribution of cell density values across histopathology tiles, normalized co-occurrence matrices from cellular density maps were visualized as heatmaps where red values corresponded to higher likelihood of co-occurrence. Two sets of pathomics features were evaluated for differentiating the two non-small cell lung cancer subtypes, one using a set of Haralick-based pathomic descriptors (from the co-occurrence matrices, denoted FPH) as well as using the five statistics (mean, median, standard deviation, skewness and kurtosis) computed from the cell density distribution (denoted FPS). The CPTAC database was used as the training cohort to identify and evaluate these feature sets through a randomized 10-fold cross validation setting, with further validation using the TCGA database in a hold-out setting.

2.6.2. Experiment 2: Identifying Radiomics Features to Differentiate ADC from SCC

This experiment was carried out to assess the ability of radiomics features to discriminate ADC from SCC. Two classification tests were performed in the NSCLC-R dataset under a randomized 10-fold cross validation scheme, first with the five statistics computed from image intensities within the tumor ROI (denoted FRS), and next using a set of Haralick-based radiomic descriptors (denoted FRH).

2.6.3. Experiment 3: Exploratory Identification of Cross-Scale Pathomic-Radiomic Associations

This experiment was conducted to preliminarily identify and evaluate pathomic-radiomic associations, i.e., a set of correlated pathomics and radiomics features. The first test evaluated the correlation between cell density attributes and image intensities using the TCGA cohort (where both radiology and histopathology images were available), i.e., between FPS and FRS. Next, Haralick-based radiomic descriptors were limited to the specific feature types that had been identified as relevant in Experiment 1 (i.e., at the pathologic scale). This subset was then evaluated to discriminate ADC and SCC at the radiological scale and compared in performance to the descriptors identified in Experiment 2. This test was carried out using the NSCLC-R dataset under a 10-fold cross validation scheme.

2.7. Statistical Analysis

The selection of pathomics and radiomics features was performed as follows: (i) feature normalization was applied to ensure that all extracted descriptors were within the same range of values by subtracting the mean and dividing by the standard deviation, and (ii) discriminant features were selected using a combination of significance and correlation testing, to ensure feature relevance while removing potentially redundant and non-informative descriptors.
Each set of top-ranked radiomic and pathomic features were individually evaluated by a classification task at discriminating ADC from SCC using a support vector machine (SVM) model with a linear kernel. Model performance was quantified in terms of the area-under-receiver-operator-curve (AUC), together with confidence intervals.
Correlations were statistically evaluated as follows: (i) the correlation coefficient and the associated p-values were computed for each pair of FPS and FRS features using Spearman’s rank correlation test (to ensure robustness to outliers while quantifying monotonicity [59]), followed by (ii) identifying significant correlations using False Discovery Rate (FDR) correction for multiple hypotheses testing [60], with statistical significance when FDR ≤ 0.01. This threshold was selected to reflect the preliminary nature of this study.
Additional experiments were conducted to evaluate the robustness of the different feature sets as well as the cross-scale associations. Top-ranked pathomics features identified in Experiment 1 were assessed for statistical significance at different tile sizes (between 50 μm2, 33.3 μm2, and 25 μm2) via Wilcoxon ranksum testing while comparing ADC vs SCC in the CPTAC database. Similarly, top-ranked radiomic features from Experiment 2 were assessed for differences between the two manufacturers (Siemens, CMS) in the NSCLC-R cohort via Wilcoxon ranksum testing. Finally, as the number of patches available from each WSI varied by dataset (6–67 patches/dataset), trends in Spearman’s rank correlation testing between pathomic-radiomic features were re-evaluated within 2 subsets: for cases with <15 patches/study and for cases with ≥15 patches/study.

3. Results

3.1. Experiment 1: Identifying Pathomic Features to Differentiate ADC from SCC

Heat maps of normalized co-occurrences computed from cell density maps illustrate that ADC (Figure 4a,b) show a higher and more concentrated co-occurrences of cell density values between adjacent tiles. The opposite trend is observed in SCC (Figure 4c,d), i.e., co-ocurrences are more distributed with a smoother changes in cell density values between adjacent tiles. Correspondingly, Figure 5a–f shows the box and whisker plots of the top-ranked Haralick-based pathomics features that comprise FPH, together with their p-values from Wilcoxon ranksum testing between ADC (blue) and SCC (yellow); in both TCGA and CPTAC cohorts.
Using the six top-ranked pathomics features in FPH, the SVM classifier with a linear kernel resulted in a training AUC of 0.72 ± 0.02 (95% confidence interval: 0.65–0.77) and a testing AUC of 0.77. This corresponded to an accuracy of 0.69 ± 0.03 (95% confidence interval: 0.62–0.74) in the training set and a 0.74 accuracy in the testing set (0.72 sensitivity, 0.76 specificity). By contrast, when considering F P S , the linear SVM yielded a significantly lower training AUC of 0.62 ± 0.02 (CPTAC, 95% confidence interval: 0.55–0.68) and a testing AUC of 0.34 (TCGA).
Figure S1 shows the box plots for each of the three different tile sizes: 50 μm2, 33.3 μm2 and 25 μm2. These boxplots demonstrate no significant differences (all p > 0.05, Wilcoxon ranksum test) in any of the top-ranked Haralick-based pathomics features between tile sizes.

3.2. Experiment 2: Identifying Radiomic Features to Differentiate ADC from SCC

Figure 6a–f shows the box and whisker plots of the top-ranked radiomics descriptors that comprise F R H identified via cross-validation in the NSCLC-R dataset, together with their p-values in Wilcoxon ranksum testing between ADC (blue) and SCC (yellow). Using this set of discriminant radiomics descriptors, the SVM model yielded a cross-validated training AUC of 0.72 ± 0.01 (95% confidence interval: 0.65–0.77). This corresponded to an accuracy of 0.69 ± 0.01 (95% confidence interval: 0.61–0.74) in the NSCLC-R database, a sensitivity of 0.67 ± 0.01 (95% confidence interval: 0.59–0.73), and a specificity of 0.71 ± 0.02 (95% confidence interval: 0.63–0.76). By comparison, F R S yielded a significantly lower cross-validated AUC of 0.51 ± 0.04 (95% confidence interval: 0.43–0.58).
Figure S2 shows the box plots comparing top-ranked radiomic features between Siemens and CMS scans, demonstrating no significant differences in any of the top-ranked Haralick-based radiomic features between the manufacturers.

3.3. Experiment 3: Exploratory Identification of Pathomic-Radiomic Associations for Differentiating ADC from SCC

Figure 7 visualizes the correlation coefficients between F P S and F R S as well as adjusted p-values (after FDR correction) as heatmaps; for ADC and SCC separately. As observed in Figure 7a,c, six pairs of features showed significant cross-scale associations with correlation coefficients between 0.51 and 0.61 (denoted by *, adjusted p-values after FDR correction in Figure 7b,d). To further evaluate these relationships, Figure 8 illustrates scatter plots for each of these six pairs of features (pathomics features on X-axes, radiomics features on Y-axes) with corresponding histograms along the border of each plot.
Figure S3 additionally illustrates the correlation coefficients for two sub-groups: cases with <15 (top) and ≥15 (bottom) patches/dataset. While ADC shows a similar trend in correlation coefficients between Figure 7a and Figure S3a,b, SCC trends are only largely consistent between Figure 7c and Figure S3b but Figure S3c is slightly different (likely due to the fewer number of cases available in this sub-group).
When limiting F R H to the feature types identified in Experiment 1, classification performance significantly improved to an AUC of 0.78 ± 0.01 (95% confidence interval: 0.70–0.82), compared to Experiment 2. Figure 9 depicts the top-ranked radiomic features identified in this experiment as heatmaps (Haralick correlation and homogeneity) for representative ADC and SCC lung tumors.

4. Discussion

The association between certain spatial patterns of sub-visual information on radiographic or pathologic images with disease outcomes [10,20,33,34] has seen wide evaluation using radiomics and pathomics approaches. In this work, we presented initial results for evaluating cross-scale associations between radiomic and pathomic descriptors; these features were subsequently applied to distinguish the two major subtypes of non-small cell lung cancer, ADC and SCC. Despite the differences in scale, radiology and pathology datasets are complementary in nature, with both forming an integral component in most clinical decision making scenarios in oncology. The underlying relationship between the two scales of information is thus highly relevant to improving the understanding of the disease as well as the patient management. In this context, the goal of this work was to present preliminary results of quantitatively correlating features from these two image modalities to more accurately interrogate disease phenotypes in lung cancers.
The first part of this study was devoted to exploring how pathomics and radiomics features independently discriminated NSCLC subtypes. At both scales, using statistical measures did not yield good classification performance. By contrast, top-ranked co-occurrence-based radiomic (based on image intensities) and pathomic (based on cell density values) features provided a consistently higher performance in discriminating ADC from SCC. On histology, when compared with ADC, SCC was characterized by lower values of mean contrast, and lower values of both standard deviation of variance and homogeneity; suggesting fewer cell density variations in this type of cancer. In addition, ADC exhibited lower median correlation than SCC, suggesting SCC exhibits higher homogeneity in pathomic textures of cell density. On CT, when compared with SCC, ADC was described by lower values of standard deviation of homogeneity, entropy, and energy; indicating a markedly lower intensity variation in this NSCLC subtype. In addition, ADC exhibited higher standard deviation of correlation than SCC, suggesting poorly defined horizontal or vertical patterns in intensity co-occurrences. Top-ranked radiomics and pathomics features were also robust and maintained their statistical trends when evaluated between variations in underlying parameters or acquisition differences. These findings resonate with previous studies on lung cancer where investigations have either quantitatively described histopathological attributes [39,40], or have described visual patterns on CT [9,29,32].
The classifier performance of our radiomics and pathomics strategies were comparable to previous efforts in the literature. Linning et al. [31], Haga et al. [29] and Wu et al. [9] investigated associations between radiomic features such as tumor shape-size, intensity statistics, and texture, with histologic subtypes (ADC and SCC) and reported an AUC of 0.73 , 0.72 and 0.80 , respectively. Bashir et al. [30] analyzed the influence of texture and semantic descriptors; a random forest model with semantic features provided an AUC of 0.82 in an independent test set (n = 100). Using wavelet-based features, Zhu et al. [32] reported an AUC of 0.89 in differentiating the two classes using CT scans. At a histopathologic level, using digitized H&E slides, several pathomics strategies have studied topological [39] and morphological [40] features and reported classification results with up to an AUC of 0.95 . More recently, convolutional neural networks (CNNs) have been applied to capture complex patterns characterizing different pathological entities [41,42], reporting an AUC of 0.97 . While our pathomics features yielded a lower AUC overall, this may have been because our study primarily limited itself to evaluating spatial micro-patterns of cell density rather than more complex features (to ensure a similarity in formulation with radiomic features).
Preliminary pathomic-radiomic correlation analysis was conducted between cell density values (from pathology) and image intensity values (from CT scans) to identify a set of relationships between the modalities. In ADC, the means and medians of cell densities and image intensities were found to be significantly correlated. However, in SCC, CT intensity mean and median was found to be correlated with standard deviation of the cell density. While it is difficult to demonstrate a causal relationship between these features, we can hypothesize about the underlying connections between these features. For instance, homogeneous cell density (histopathology) appears to be associated with lower intensity CT values (radiology). Only a few works have similarly correlated radiology and histopathology information, and efforts have been rather limited to establishing qualitative clinical correlation. In the context of lung cancer, Ganeshan et al. [61] obtained CT images from a set of patients with NSCLC who later underwent an intravenous administration of a marker of tumor hypoxia and angiogenesis. They then assessed correlations between the CT-derived radiomic texture features and the histopathologic markers. They demonstrated radiomic features capture heterogeneity and have the potential to act as imaging correlates for tumor hypoxia and angiogenesis. Snoeckx et al. [62] studied visual pathology-radiology concordance by establishing a level of agreement between findings of radiologists and pathologists in the task of classifying pulmonary nodules into solid or subsolid phenotypes. Lederlin et al. [47] evaluated associations between qualitative CT morphological parameters (tumor shape, sphericity, location, margins and attenuation) and histomorphological growth patterns of pulmonary ADCs (lepidic, acinar, papillary, micropapillary and solid), reporting a set of specific CT features predictive of the histomorphological tumor growth patterns of pulmonary ADC. Most recently, Khorrami et al. [48] identified which radiomic patterns were associated with response and survival to immunotherapy for lung cancers, and determined that specific Gabor responses in the peritumoral region were associated with the density of tumor-infiltrating lymphocytes on diagnostic biopsy samples from these patients. Unlike these works, our study utilized similar formulations of co-occurrence features to establish potential associations between computational textural patterns on both radiology and digitized pathology. These associations were also largely consistent for each NSCLC subtype when evaluated between parameter variations. Despite the limited set of data, results suggest these associations could support clinical decisions. Such models are needed to map each modality onto the other and enrich a final radiology-pathology microscopic representation. To further demonstrate the advantage of exploiting such a pathomic-radiomic association, top-ranked co-occurrence descriptors identified at the pathologic scale were found to yield an improved performance in differentiating lung cancer subtypes when extracted at the radiologic scale.
We do acknowledge some shortcomings of our study. While we utilized large public databases, our final cohort sizes were still limited (n = 171 pathology speciments, n = 146 radiology datasets). However, we ensured our analysis was conducted as rigorously as possible with cross-validation as well as hold-out validation where possible. The radiology-pathology associations evaluated in this study only leveraged a subset of the total datasets considered, which is testament to the difficulty in collecting such matched specimens. We attempted to account for these limited numbers by appropriately adjusting the false discovery rate threshold. Finally, we only evaluated co-occurrence based features in this study rather than more complex features (e.g., architectural, topological from pathology; wavelet, gradients from radiology). This was done in order to be able to directly translate descriptors from one scale to the next as done in our pathomic-radiomic analysis experiments. Our framework may be used to study associations between an expanded suite of radiomic and pathomic measurements, as well as extended to characterizing other pathologic subtypes of lung cancers.

5. Conclusions

In this study, preliminary results were presented for a novel pathomic-radiomic correlation approach to interrogate cross-scale associations between cellular density and image heterogeneity across radiology and histopathology. The present investigation has demonstrated that spatial patterns of tumor heteogeneity may be shared between these modalities, as quantified via radiomics and pathomics features. Exploratory analysis showed that these associations may be further exploited to significantly improve classifier performance and quantitative characterization of lung adenocarcinoma from squamous cell carcinoma. Future work will involve validating our findings and approach on a larger cohort of data, as well as extending this approach to other cancers.

Supplementary Materials

The following are available online at https://www.mdpi.com/2072-6694/12/12/3663/s1, Table S1: Differentiating features between adenocarcinomas and squamous cell carcinomas with respect to demographics, routine imaging and pathology, Figure S1: Box plots (af) comparing the six top-ranked pathomics features at different tile sizes (50 μm2, 33.3 μm2 and 25 μm2), between ADC (blue) to SCC (yellow). Box plots are presented for the CPTAC database, with no significant differences observed between tile sizes, Figure S2: Box plots (af) of the six top-ranked radiomics features comparing between CT manufacturers, i.e., Siemens (blue) to CMS (red). Box plots are presented for the NSCLC-R dataset, showing no significant differences between the manufacturer sub-groups, Figure S3: Heatmap visualizations for (a,d) Spearman’s rank correlation coefficients between pathologic cell density and radiologic intensity statistics (red = negative correlation, blue = positive correlation). (a,c) correspond to cases with <15 patches/study while (b,d) correspond to those with ≥15 patches/study.

Author Contributions

Conceptualization, C.A.-J., A.A.S., P.P., S.E.V. and E.R.; methodology, C.A.-J., A.A.S., P.P., S.E.V. and E.R.; software, C.A.-J. and A.A.S.; validation, C.A.-J., A.A.S., P.P., S.E.V. and E.R.; formal analysis, C.A.-J., A.A.S., P.P., S.E.V. and E.R.; investigation, C.A.-J., A.A.S., P.P., S.E.V. and E.R.; resources, S.E.V. and E.R.; data curation, C.A.-J., A.A.S., P.P. and A.G.; writing—original draft preparation, C.A.-J. and E.R.; writing—review and editing, E.R., P.P. and S.E.V.; visualization, C.A.-J. and A.A.S.; supervision, P.P., S.E.V. and E.R.; project administration, E.R.; funding acquisition, S.E.V. All authors have read and agreed to the published version of the manuscript.

Funding

Research reported in this publication was supported by the National Cancer Institute of the National Institutes of Health under award numbers U01CA248226-01, the CDMRP Peer Reviewed Cancer Research Program (W81XWH-16-1-0329), and the iCorps@Ohio program. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health, the Department of Defense, or the United States Government.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:
NSCLCNon-small cell lung cancer
ADCAdenocarcinoma
SCCSquamous cell carcinoma
HVHorizonal-vertical directions
DDiagonal direction
SVMSupport vector machine
CTComputer tomography
FDRFalse discovery rate

References

  1. Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA A Cancer J. Clin. 2018, 68, 394–424. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Blandin Knight, S.; Crosbie, P.A.; Balata, H.; Chudziak, J.; Hussell, T.; Dive, C. Progress and prospects of early detection in lung cancer. Open Biol. 2017, 7, 170070. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  3. Travis, W.D.; Brambilla, E.; Noguchi, M.; Nicholson, A.G.; Geisinger, K.R.; Yatabe, Y.; Beer, D.G.; Powell, C.A.; Riely, G.J.; Van Schil, P.E.; et al. International association for the study of lung cancer/american thoracic society/european respiratory society international multidisciplinary classification of lung adenocarcinoma. J. Thorac. Oncol. 2011, 6, 244–285. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  4. Wang, Z.; Li, M.; Huang, Y.; Ma, L.; Zhu, H.; Kong, L.; Yu, J. Clinical and radiological characteristics of central pulmonary adenocarcinoma: A comparison with central squamous cell carcinoma and small cell lung cancer and the impact on treatment response. OncoTargets Ther. 2018, 11, 2509. [Google Scholar] [CrossRef] [Green Version]
  5. Zappa, C.; Mousa, S.A. Non-small cell lung cancer: Current treatment and future advances. Transl. Lung Cancer Res. 2016, 5, 288–300. [Google Scholar] [CrossRef] [Green Version]
  6. Ma, Y.; Feng, W.; Wu, Z.; Liu, M.; Zhang, F.; Liang, Z.; Cui, C.; Huang, J.; Li, X.; Guo, X. Intra-tumoural heterogeneity characterization through texture and colour analysis for differentiation of non-small cell lung carcinoma subtypes. Phys. Med. Biol. 2018, 63, 165018. [Google Scholar] [CrossRef]
  7. Asamura, H.; Goya, T.; Koshiishi, Y.; Sohara, Y.; Eguchi, K.; Mori, M.; Nakanishi, Y.; Tsuchiya, R.; Shimokata, K.; Inoue, H.; et al. A Japanese Lung Cancer Registry study: Prognosis of 13,010 resected lung cancers. J. Thorac. Oncol. 2008, 3, 46–52. [Google Scholar] [CrossRef] [Green Version]
  8. Reck, M.; Rabe, K.F. Precision diagnosis and treatment for advanced non–small-cell lung cancer. N. Engl. J. Med. 2017, 377, 849–861. [Google Scholar] [CrossRef] [Green Version]
  9. Wu, W.; Parmar, C.; Grossmann, P.; Quackenbush, J.; Lambin, P.; Bussink, J.; Mak, R.; Aerts, H.J. Exploratory study to identify radiomics classifiers for lung cancer histology. Front. Oncol. 2016, 6, 71. [Google Scholar] [CrossRef] [Green Version]
  10. Lambin, P.; Leijenaar, R.T.; Deist, T.M.; Peerlings, J.; De Jong, E.E.; Van Timmeren, J.; Sanduleanu, S.; Larue, R.T.; Even, A.J.; Jochems, A.; et al. Radiomics: The bridge between medical imaging and personalized medicine. Nat. Rev. Clin. Oncol. 2017, 14, 749–762. [Google Scholar] [CrossRef]
  11. Gillies, R.J.; Kinahan, P.E.; Hricak, H. Radiomics: Images are more than pictures, they are data. Radiology 2016, 278, 563–577. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  12. Gnep, K.; Fargeas, A.; Gutiérrez-Carvajal, R.E.; Commandeur, F.; Mathieu, R.; Ospina, J.D.; Rolland, Y.; Rohou, T.; Vincendeau, S.; Hatt, M.; et al. Haralick textural features on T2-weighted MRI are associated with biochemical recurrence following radiotherapy for peripheral zone prostate cancer. J. Magn. Reson. Imaging 2017, 45, 103–117. [Google Scholar] [CrossRef] [PubMed]
  13. Wibmer, A.; Hricak, H.; Gondo, T.; Matsumoto, K.; Veeraraghavan, H.; Fehr, D.; Zheng, J.; Goldman, D.; Moskowitz, C.; Fine, S.W.; et al. Haralick texture analysis of prostate MRI: Utility for differentiating non-cancerous prostate from prostate cancer and differentiating prostate cancers with different Gleason scores. Eur. Radiol. 2015, 25, 2840–2850. [Google Scholar] [CrossRef]
  14. Khalvati, F.; Wong, A.; Haider, M.A. Automated prostate cancer detection via comprehensive multi-parametric magnetic resonance imaging texture feature models. BMC Med. Imaging 2015, 15, 27. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  15. Li, H.; Zhu, Y.; Burnside, E.S.; Drukker, K.; Hoadley, K.A.; Fan, C.; Conzen, S.D.; Whitman, G.J.; Sutton, E.J.; Net, J.M.; et al. MR imaging radiomics signatures for predicting the risk of breast cancer recurrence as given by research versions of MammaPrint, Oncotype DX, and PAM50 gene assays. Radiology 2016, 281, 382–391. [Google Scholar] [CrossRef] [Green Version]
  16. Rahbar, H.; McDonald, E.S.; Lee, J.M.; Partridge, S.C.; Lee, C.I. How can advanced imaging be used to mitigate potential breast cancer overdiagnosis? Acad. Radiol. 2016, 23, 768–773. [Google Scholar] [CrossRef] [Green Version]
  17. Wang, J.; Kato, F.; Oyama-Manabe, N.; Li, R.; Cui, Y.; Tha, K.K.; Yamashita, H.; Kudo, K.; Shirato, H. Identifying triple-negative breast cancer using background parenchymal enhancement heterogeneity on dynamic contrast-enhanced MRI: A pilot radiomics study. PLoS ONE 2015, 10, e0143308. [Google Scholar] [CrossRef]
  18. Coroller, T.P.; Grossmann, P.; Hou, Y.; Velazquez, E.R.; Leijenaar, R.T.; Hermann, G.; Lambin, P.; Haibe-Kains, B.; Mak, R.H.; Aerts, H.J. CT-based radiomic signature predicts distant metastasis in lung adenocarcinoma. Radiother. Oncol. 2015, 114, 345–350. [Google Scholar] [CrossRef]
  19. Depeursinge, A.; Yanagawa, M.; Leung, A.N.; Rubin, D.L. Predicting adenocarcinoma recurrence using computational texture models of nodule components in lung CT. Med. Phys. 2015, 42, 2054–2063. [Google Scholar] [CrossRef] [Green Version]
  20. Thawani, R.; McLane, M.; Beig, N.; Ghose, S.; Prasanna, P.; Velcheti, V.; Madabhushi, A. Radiomics and radiogenomics in lung cancer: A review for the clinician. Lung Cancer 2018, 115, 34–41. [Google Scholar] [CrossRef]
  21. Kickingereder, P.; Burth, S.; Wick, A.; Götz, M.; Eidel, O.; Schlemmer, H.P.; Maier-Hein, K.H.; Wick, W.; Bendszus, M.; Radbruch, A.; et al. Radiomic profiling of glioblastoma: Identifying an imaging predictor of patient survival with improved performance over established clinical and radiologic risk models. Radiology 2016, 280, 880–889. [Google Scholar] [CrossRef] [PubMed]
  22. Tiwari, P.; Prasanna, P.; Wolansky, L.; Pinho, M.; Cohen, M.; Nayate, A.; Gupta, A.; Singh, G.; Hatanpaa, K.; Sloan, A.; et al. Computer-extracted texture features to distinguish cerebral radionecrosis from recurrent brain tumors on multiparametric MRI: A feasibility study. Am. J. Neuroradiol. 2016, 37, 2231–2236. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  23. Antunes, J.T.; Ofshteyn, A.; Bera, K.; Wang, E.Y.; Brady, J.T.; Willis, J.E.; Friedman, K.A.; Marderstein, E.L.; Kalady, M.F.; Stein, S.L.; et al. Radiomic Features of Primary Rectal Cancers on Baseline T2-Weighted MRI Are Associated With Pathologic Complete Response to Neoadjuvant Chemoradiation: A Multisite Study. J. Magn. Reson. Imaging 2020. [Google Scholar] [CrossRef] [PubMed]
  24. Liang, C.; Huang, Y.; He, L.; Chen, X.; Ma, Z.; Dong, D.; Tian, J.; Liang, C.; Liu, Z. The development and validation of a CT-based radiomics signature for the preoperative discrimination of stage I-II and stage III-IV colorectal cancer. Oncotarget 2016, 7, 31401. [Google Scholar] [CrossRef] [Green Version]
  25. Huang, Y.Q.; Liang, C.H.; He, L.; Tian, J.; Liang, C.S.; Chen, X.; Ma, Z.L.; Liu, Z.Y. Development and Validation of a Radiomics Nomogram for Preoperative Prediction of Lymph Node Metastasis in Colorectal Cancer. J. Clin. Oncol. 2016, 34, 2157–2164. [Google Scholar] [CrossRef]
  26. Antunes, J.; Viswanath, S.; Rusu, M.; Valls, L.; Hoimes, C.; Avril, N.; Madabhushi, A. Radiomics analysis on FLT-PET/MRI for characterization of early treatment response in renal cell carcinoma: A proof-of-concept study. Transl. Oncol. 2016, 9, 155–162. [Google Scholar] [CrossRef] [Green Version]
  27. Parmar, C.; Grossmann, P.; Rietveld, D.; Rietbergen, M.M.; Lambin, P.; Aerts, H.J. Radiomic machine-learning classifiers for prognostic biomarkers of head and neck cancer. Front. Oncol. 2015, 5, 272. [Google Scholar] [CrossRef] [Green Version]
  28. Parmar, C.; Leijenaar, R.T.; Grossmann, P.; Velazquez, E.R.; Bussink, J.; Rietveld, D.; Rietbergen, M.M.; Haibe-Kains, B.; Lambin, P.; Aerts, H.J. Radiomic feature clusters and prognostic signatures specific for lung and head & neck cancer. Sci. Rep. 2015, 5, 11044. [Google Scholar]
  29. Haga, A.; Takahashi, W.; Aoki, S.; Nawa, K.; Yamashita, H.; Abe, O.; Nakagawa, K. Classification of early stage non-small cell lung cancers on computed tomographic images into histological types using radiomic features: Interobserver delineation variability analysis. Radiol. Phys. Technol. 2018, 11, 27–35. [Google Scholar] [CrossRef]
  30. Bashir, U.; Kawa, B.; Siddique, M.; Mak, S.M.; Nair, A.; Mclean, E.; Bille, A.; Goh, V.; Cook, G. Non-invasive classification of non-small cell lung cancer: A comparison between random forest models utilising radiomic and semantic features. Br. J. Radiol. 2019, 92, 20190159. [Google Scholar] [CrossRef]
  31. Linning, E.; Lu, L.; Li, L.; Yang, H.; Schwartz, L.H.; Zhao, B. Radiomics for classifying histological subtypes of lung cancer based on multiphasic contrast-enhanced computed tomography. J. Comput. Assist. Tomogr. 2019, 43, 300–306. [Google Scholar]
  32. Zhu, X.; Dong, D.; Chen, Z.; Fang, M.; Zhang, L.; Song, J.; Yu, D.; Zang, Y.; Liu, Z.; Shi, J.; et al. Radiomic signature as a diagnostic factor for histologic subtype classification of non-small cell lung cancer. Eur. Radiol. 2018, 28, 2772–2778. [Google Scholar] [CrossRef] [PubMed]
  33. Bera, K.; Schalper, K.A.; Rimm, D.L.; Velcheti, V.; Madabhushi, A. Artificial intelligence in digital pathology—New tools for diagnosis and precision oncology. Nat. Rev. Clin. Oncol. 2019, 16, 703–715. [Google Scholar] [CrossRef] [PubMed]
  34. Gupta, R.; Kurc, T.; Sharma, A.; Almeida, J.S.; Saltz, J. The emergence of pathomics. Curr. Pathobiol. Rep. 2019, 7, 73–84. [Google Scholar] [CrossRef] [Green Version]
  35. Tran, W.T.; Jerzak, K.; Lu, F.I.; Klein, J.; Tabbarah, S.; Lagree, A.; Wu, T.; Rosado-Mendez, I.; Law, E.; Saednia, K.; et al. Personalized breast cancer treatments using artificial intelligence in radiomics and pathomics. J. Med. Imaging Radiat. Sci. 2019, 50, S32–S41. [Google Scholar] [CrossRef] [Green Version]
  36. Saltz, J.; Almeida, J.; Gao, Y.; Sharma, A.; Bremer, E.; DiPrima, T.; Saltz, M.; Kalpathy-Cramer, J.; Kurc, T. Towards generation, management, and exploration of combined radiomics and pathomics datasets for cancer research. AMIA Summits Transl. Sci. Proc. 2017, 2017, 85. [Google Scholar]
  37. Corredor, G.; Wang, X.; Zhou, Y.; Lu, C.; Fu, P.; Syrigos, K.; Rimm, D.L.; Yang, M.; Romero, E.; Schalper, K.A.; et al. Spatial architecture and arrangement of tumor-infiltrating lymphocytes for predicting likelihood of recurrence in early-stage non–small cell lung cancer. Clin. Cancer Res. 2019, 25, 1526–1534. [Google Scholar] [CrossRef] [Green Version]
  38. Wang, X.; Janowczyk, A.; Zhou, Y.; Thawani, R.; Fu, P.; Schalper, K.; Velcheti, V.; Madabhushi, A. Prediction of recurrence in early stage non-small cell lung cancer using computer extracted nuclear features from digital H&E images. Sci. Rep. 2017, 7, 1–10. [Google Scholar]
  39. Yao, J.; Ganti, D.; Luo, X.; Xiao, G.; Xie, Y.; Yan, S.; Huang, J. Computer-assisted diagnosis of lung cancer using quantitative topology features. In International Workshop on Machine Learning in Medical Imaging; Springer: Berlin/Heidelberg, Germany, 2015; pp. 288–295. [Google Scholar]
  40. Wang, C.W.; Yu, C.P. Automated morphological classification of lung cancer subtypes using H&E tissue images. Mach. Vis. Appl. 2013, 24, 1383–1391. [Google Scholar]
  41. Khosravi, P.; Kazemi, E.; Imielinski, M.; Elemento, O.; Hajirasouliha, I. Deep convolutional neural networks enable discrimination of heterogeneous digital pathology images. EBioMedicine 2018, 27, 317–328. [Google Scholar] [CrossRef] [Green Version]
  42. Coudray, N.; Ocampo, P.S.; Sakellaropoulos, T.; Narula, N.; Snuderl, M.; Fenyö, D.; Moreira, A.L.; Razavian, N.; Tsirigos, A. Classification and mutation prediction from non–small cell lung cancer histopathology images using deep learning. Nat. Med. 2018, 24, 1559. [Google Scholar] [CrossRef] [PubMed]
  43. Penzias, G.; Singanamalli, A.; Elliott, R.; Gollamudi, J.; Shih, N.; Feldman, M.; Stricker, P.D.; Delprado, W.; Tiwari, S.; Böhm, M.; et al. Identifying the morphologic basis for radiomic features in distinguishing different Gleason grades of prostate cancer on MRI: Preliminary findings. PLoS ONE 2018, 13, e0200730. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Chatterjee, A.; Watson, G.; Myint, E.; Sved, P.; McEntee, M.; Bourne, R. Changes in epithelium, stroma, and lumen space correlate more strongly with Gleason pattern and are stronger predictors of prostate ADC changes than cellularity metrics. Radiology 2015, 277, 751–762. [Google Scholar] [CrossRef]
  45. Bobholz, S.A.; Lowman, A.K.; Barrington, A.; Brehler, M.; McGarry, S.; Cochran, E.J.; Connelly, J.; Mueller, W.M.; Agarwal, M.; O’Neill, D.; et al. Radiomic Features of Multiparametric MRI Present Stable Associations With Analogous Histological Features in Patients With Brain Cancer. Tomography 2020, 6, 160. [Google Scholar] [PubMed]
  46. Braman, N.; Prasanna, P.; Whitney, J.; Singh, S.; Beig, N.; Etesami, M.; Bates, D.D.; Gallagher, K.; Bloch, B.N.; Vulchi, M.; et al. Association of peritumoral radiomics with tumor biology and pathologic response to preoperative targeted therapy for HER2 (ERBB2)–positive breast cancer. JAMA Netw. Open 2019, 2, e192561. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  47. Lederlin, M.; Puderbach, M.; Muley, T.; Schnabel, P.A.; Stenzinger, A.; Kauczor, H.U.; Heussel, C.P.; Herth, F.J.; Hoffmann, H.; Dienemann, H.; et al. Correlation of radio-and histomorphological pattern of pulmonary adenocarcinoma. Eur. Respir. J. 2013, 41, 943–951. [Google Scholar] [CrossRef] [Green Version]
  48. Khorrami, M.; Prasanna, P.; Gupta, A.; Patil, P.; Velu, P.D.; Thawani, R.; Corredor, G.; Alilou, M.; Bera, K.; Fu, P.; et al. Changes in CT Radiomic Features Associated with Lymphocyte Distribution Predict Overall Survival and Response to Immunotherapy in Non–Small Cell Lung Cancer. Cancer Immunol. Res. 2020, 8, 108–119. [Google Scholar] [CrossRef]
  49. Solis, L.M.; Behrens, C.; Raso, M.G.; Lin, H.Y.; Kadara, H.; Yuan, P.; Galindo, H.; Tang, X.; Lee, J.J.; Kalhor, N.; et al. Histologic patterns and molecular characteristics of lung adenocarcinoma associated with clinical outcome. Cancer 2012, 118, 2889–2899. [Google Scholar] [CrossRef] [Green Version]
  50. Santos, M.K.; Muley, T.; Warth, A.; de Paula, W.D.; Lederlin, M.; Schnabel, P.A.; Schlemmer, H.P.; Kauczor, H.U.; Heussel, C.P.; Puderbach, M. Morphological computed tomography features of surgically resectable pulmonary squamous cell carcinomas: Impact on prognosis and comparison with adenocarcinomas. Eur. J. Radiol. 2014, 83, 1275–1281. [Google Scholar] [CrossRef]
  51. Hollings, N.; Shaw, P. Diagnostic imaging of lung cancer. Eur. Respir. J. 2002, 19, 722–742. [Google Scholar] [CrossRef] [Green Version]
  52. National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC). Radiology Data from the Clinical Proteomic Tumor Analysis Consortium Lung Squamous Cell Carcinoma [CPTAC-LSCC] Collection; National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC): Fayetteville, AR, USA, 2018. [CrossRef]
  53. National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC). Radiology Data from the Clinical Proteomic Tumor Analysis Consortium Lung Adenocarcinoma [CPTAC-LUAD] Collection; National Cancer Institute Clinical Proteomic Tumor Analysis Consortium (CPTAC): Fayetteville, AR, USA, 2018. [CrossRef]
  54. Weinstein, J.N.; Collisson, E.A.; Mills, G.B.; Shaw, K.R.M.; Ozenberger, B.A.; Ellrott, K.; Shmulevich, I.; Sander, C.; Stuart, J.M.; Cancer Genome Atlas Research Network; et al. The cancer genome atlas pan-cancer analysis project. Nat. Genet. 2013, 45, 1113. [Google Scholar] [CrossRef] [PubMed]
  55. Aerts, H.J.W.L.; Rios Velazquez, E.; Leijenaar, R.T.H.; Parmar, C.; Grossmann, P.; Carvalho, S.; Bussink, J.; Monshouwer, R.; Haibe-Kains, B.; Rietveld, D.; et al. Data From NSCLC-Radiomics-Genomics; Cancer Imaging Archive: Fayetteville, AR, USA, 2015. [Google Scholar] [CrossRef]
  56. Veta, M.; Van Diest, P.J.; Kornegoor, R.; Huisman, A.; Viergever, M.A.; Pluim, J.P. Automatic nuclei segmentation in H&E stained breast cancer histopathology images. PLoS ONE 2013, 8, e70221. [Google Scholar]
  57. Infante, M.; Berghmans, T.; Heuvelmans, M.A.; Hillerdal, G.; Oudkerk, M. Slow-growing lung cancer as an emerging entity: From screening to clinical management. Eur. Respir. J. 2013, 42, 1706–1722. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  58. Haralick, R.M. Statistical and structural approaches to texture. Proc. IEEE 1979, 67, 786–804. [Google Scholar] [CrossRef]
  59. Corder, G.W.; Foreman, D.I. Nonparametric Statistics: A Step-by-Step Approach; John Wiley & Sons: Hoboken, NJ, USA, 2014. [Google Scholar]
  60. Benjamini, Y.; Hochberg, Y. Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. R. Stat. Soc. Ser. B (Methodol.) 1995, 57, 289–300. [Google Scholar] [CrossRef]
  61. Ganeshan, B.; Goh, V.; Mandeville, H.C.; Ng, Q.S.; Hoskin, P.J.; Miles, K.A. Non–small cell lung cancer: Histopathologic correlates for texture parameters at CT. Radiology 2013, 266, 326–336. [Google Scholar] [CrossRef]
  62. Snoeckx, A.; Reyntiens, P.; Pauwels, P.; Van Schil, P.; Spinhoven, M.; Parizel, P. Radiologic-Pathologic Correlation in Lung Cancer Presenting as a Subsolid Nodule: Room for Improvement? Clin. Oncol. 2017, 2, 1290. [Google Scholar]
Figure 1. Estimating cellular density on histopathology slides: (a,d) illustrate a patch from each WSI, (b,e) show the associated cell density map, and (c,f) depict a corresponding tumor patch from CT. The top row shows a representative ADC while the second corresponds to a SCC study.
Figure 1. Estimating cellular density on histopathology slides: (a,d) illustrate a patch from each WSI, (b,e) show the associated cell density map, and (c,f) depict a corresponding tumor patch from CT. The top row shows a representative ADC while the second corresponds to a SCC study.
Cancers 12 03663 g001
Figure 2. Multi-scale texture analysis of cellular density on digitized pathology to compute pathomic features: the cell density map of a particular histopathology patch is used to compute two co-occurrence matrices (one along the horizontal-vertical direction and one along the diagonal direction) per scale. These are then utilized to compute 12 Haralick texture features per co-occurrence matrix, from which statistics are computed to result in 360 pathomic descriptors.
Figure 2. Multi-scale texture analysis of cellular density on digitized pathology to compute pathomic features: the cell density map of a particular histopathology patch is used to compute two co-occurrence matrices (one along the horizontal-vertical direction and one along the diagonal direction) per scale. These are then utilized to compute 12 Haralick texture features per co-occurrence matrix, from which statistics are computed to result in 360 pathomic descriptors.
Cancers 12 03663 g002
Figure 3. Texture analysis in CT to compute radiomic features: For each CT study, a sub-volume is defined as the largest annotated 2D section of the tumor together with two adjacent slices. For each sub-volume, co-occurrence matrices are computed based on CT image intensity values, based on which 12 Haralick features (HF) are calculated. Finally, five statistics are computed per Haralick feature to yield 120 radiomic descriptors.
Figure 3. Texture analysis in CT to compute radiomic features: For each CT study, a sub-volume is defined as the largest annotated 2D section of the tumor together with two adjacent slices. For each sub-volume, co-occurrence matrices are computed based on CT image intensity values, based on which 12 Haralick features (HF) are calculated. Finally, five statistics are computed per Haralick feature to yield 120 radiomic descriptors.
Cancers 12 03663 g003
Figure 4. Representative normalized co-occurrence matrices computed in horizontal-vertical directions from cell density maps: (a,b) correspond to ADC studies, and (c,d) to SCC studies. Note the more homogeneous pattern in SCC with smoother variation in cooccurrences compared to ADC.
Figure 4. Representative normalized co-occurrence matrices computed in horizontal-vertical directions from cell density maps: (a,b) correspond to ADC studies, and (c,d) to SCC studies. Note the more homogeneous pattern in SCC with smoother variation in cooccurrences compared to ADC.
Cancers 12 03663 g004
Figure 5. Box plots (af) of the six top-ranked pathomics features together with their p-value from Wilcoxon ranksum testing (unadjusted), when comparing ADC (blue) to SCC (yellow). Box plots are presented for both histopathological datasets considered, CPTAC (training) and TCGA (testing).
Figure 5. Box plots (af) of the six top-ranked pathomics features together with their p-value from Wilcoxon ranksum testing (unadjusted), when comparing ADC (blue) to SCC (yellow). Box plots are presented for both histopathological datasets considered, CPTAC (training) and TCGA (testing).
Cancers 12 03663 g005
Figure 6. Box plots (af) of the top-ranked radiomics features together with their p-value from Wilcoxon ranksum testing (unadjusted), when comparing ADC (blue) to SCC (yellow) non-small cell lung cancer subtypes in the NSCLC-R dataset.
Figure 6. Box plots (af) of the top-ranked radiomics features together with their p-value from Wilcoxon ranksum testing (unadjusted), when comparing ADC (blue) to SCC (yellow) non-small cell lung cancer subtypes in the NSCLC-R dataset.
Cancers 12 03663 g006
Figure 7. Heatmap visualizations for (a,c) Spearman’s rank correlation coefficients between pathologic cell density and radiologic intensity statistics (red = negative correlation, blue = positive correlation). (b,d) similarly visualized adjusted p-values for each pair of features as a heatmap, after False Discovery Rate (FDR) correction for multiple hypotheses testing.
Figure 7. Heatmap visualizations for (a,c) Spearman’s rank correlation coefficients between pathologic cell density and radiologic intensity statistics (red = negative correlation, blue = positive correlation). (b,d) similarly visualized adjusted p-values for each pair of features as a heatmap, after False Discovery Rate (FDR) correction for multiple hypotheses testing.
Cancers 12 03663 g007
Figure 8. Scatter plots of six pairs of correlated features identified via pathomic-radiomic association for (ad) ADC stuides, and (e,f) SCC studies. Also noted are correlated coefficients as well as FDR-adjusted p-values. The histogram of pathomic feature are plotted along the X-axes of each plot, while radiomic features are plotted along Y-axes.
Figure 8. Scatter plots of six pairs of correlated features identified via pathomic-radiomic association for (ad) ADC stuides, and (e,f) SCC studies. Also noted are correlated coefficients as well as FDR-adjusted p-values. The histogram of pathomic feature are plotted along the X-axes of each plot, while radiomic features are plotted along Y-axes.
Cancers 12 03663 g008
Figure 9. Top-ranked radiomic descriptors identified based on pathomic-radiomic associations and evaluated in Experiment 3, visualized as a heatmap (where red corresponds to over-expression): (a,d) Tumor on CT image, (b,e) Haralick correlation, (c,f) Haralick homogeneity for a representative ADC study (top row) as well as a representative SCC study (bottom row).
Figure 9. Top-ranked radiomic descriptors identified based on pathomic-radiomic associations and evaluated in Experiment 3, visualized as a heatmap (where red corresponds to over-expression): (a,d) Tumor on CT image, (b,e) Haralick correlation, (c,f) Haralick homogeneity for a representative ADC study (top row) as well as a representative SCC study (bottom row).
Cancers 12 03663 g009
Table 1. Summary of imaging parameters, demographic, and pathologic information of the whole slide image (WSI) cohort used in this study.
Table 1. Summary of imaging parameters, demographic, and pathologic information of the whole slide image (WSI) cohort used in this study.
ParameterCPTAC
(117)
TCGA
(54)
Magnification20×11729
40×025
Resolution (μm∖px)0.252025
0.4941170
0.502029
GenderMale8223
Female3531
Age at diagnosis 65.1 ± 9.368.6 ± 10.00
GradeWell differentiated8NA
Moderately differentiated109NA
Poorly differentiated0NA
Undifferentiated0NA
Pathologic StageI5819
II3818
III2112
IV03
Discrepancy32
NSCLC subtypeADC6125
SCC5629
Table 2. Summary of imaging parameters, demographics, and pathologic information of CT images used in this study.
Table 2. Summary of imaging parameters, demographics, and pathologic information of CT images used in this study.
Parameter NSCLC-R
(101)
TCGA
(45)
In-plane Resolution (mm) 0.970.6–0.97
Slice Thickness (mm) 32–5
Field of view (px) 512 × 512357–512 × 357–512
ScannerCMS, Inc.110
Siemens9010
GE Medical Systems033
Philips02
Patient Position Head First Supine (HFS)NA
GenderMale6420
Female3725
Age at diagnosis 68.5 ± 10.469.3 ± 9.8
Pathologic StageI1716
II1315
III719
IV03
Discrepancy02
SubtypeADC4919
SCC5226
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Alvarez-Jimenez, C.; Sandino, A.A.; Prasanna, P.; Gupta, A.; Viswanath, S.E.; Romero, E. Identifying Cross-Scale Associations between Radiomic and Pathomic Signatures of Non-Small Cell Lung Cancer Subtypes: Preliminary Results. Cancers 2020, 12, 3663. https://doi.org/10.3390/cancers12123663

AMA Style

Alvarez-Jimenez C, Sandino AA, Prasanna P, Gupta A, Viswanath SE, Romero E. Identifying Cross-Scale Associations between Radiomic and Pathomic Signatures of Non-Small Cell Lung Cancer Subtypes: Preliminary Results. Cancers. 2020; 12(12):3663. https://doi.org/10.3390/cancers12123663

Chicago/Turabian Style

Alvarez-Jimenez, Charlems, Alvaro A. Sandino, Prateek Prasanna, Amit Gupta, Satish E. Viswanath, and Eduardo Romero. 2020. "Identifying Cross-Scale Associations between Radiomic and Pathomic Signatures of Non-Small Cell Lung Cancer Subtypes: Preliminary Results" Cancers 12, no. 12: 3663. https://doi.org/10.3390/cancers12123663

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop