Polarimetric feature based microstructural mapping. Since Mueller matrix describes how the optical and structural properties affect polarization states as the photons propagate and scatter in the media, the differences in polarization feature imply differences in the microstructures. Polarization super-pixels obtained from Mueller images are clustered in polarization space, creating a map of tissue microstructural clusters. Our study reveals that the pixels from the sample clusters are spatially correlated, and each microstructural cluster corresponds with a specific pathological feature.
Several distinct clusters are observed, as shown in Fig. 3a. The clusters are clearly separated into two main groups, and detailed clusters can be seen in each group. Separated by a clear gap, two subgroups (cluster 1 and 2) are identified on the left, along with a few scattered clusters in the bot left corner. The gap separating the clusters indicates that there is a distinct difference in polarization characteristic between cluster 1 and 2. On the other hand, the clusters on the right side are more convoluted compared to the left side. Cluster 6 is noticeably separated from the other clusters, but cluster 3, 4 and 5 are close to each other, with many visible subclusters. To interpret these clusters pathologically, we project the points from the polarization feature space onto the co-registered histological images. After the observation and evaluation by pathologists, the sample segmented structures from each polarimetric cluster are identified as displayed in Fig. 3a. Pathologists infer that the two clusters 1 and 2 on the left correspond to normal and cancerous cell nuclei respectively. Furthermore, cluster 2 potentially correlates with differentiation degree, which will be discussed in the next section. Recall that the clear gap dividing cluster 1 and 2 indicates distinctive polarization characteristics, which in turn infer that the cancerous nuclei is microstructurally distinctive from benign nuclei. On the top-right corner, cluster 3 and 4 are involved with cell cytoplasm in liver H&E-stained pathological tissues. The two clusters are connected, indicating similarities in polarization feature, and the corresponding structural features, between the clusters. Unlike the other clusters, cluster 3 and 4 has many distinguishable detailed subsets. It suggests cell cytoplasm has various complicated subtypes of different microstructure or optical properties, which is plausible considering the rich variety in sizes and shapes for organelles within cells. Cluster 5 and 6 mainly consist of collagen fiber and fibrocytes respectively. In cluster 6, there are two identifiable subgroups, implying there are subtypes of fiber with different microstructure property. The unlabeled clusters correspond to noises or imaging artifacts. In short, the polarization pixels form clusters on the UMAP axes, and each cluster has a distinctive microstructure characterization. We discover that pixels clustered in polarization feature space are clustered spatially in histopathology images as well, segmenting pathologically meaningful structures such as normal and cancerous cell nucleus, cytoplasm, fibrocytes, and collagen fiber.
Lesion induced microstructural alteration. Pathological alteration of tissues may induce changes in tissue microstructure, and here we aim to visualize the microstructure transition at different stages of cellular differentiation. Specifically, cellular differentiation affects tissue microstructures, and such changes in microstructure composition can be revealed in the polarization feature space.
To explore the concept of visualizing microstructure transition during pathological changes, we collected HCC samples at different stages of differentiation degrees, and visualized the microstructure composition at each stage using a density heatmap on the UMAP atlas. Figure 3b demonstrates how ROIs from normal, well differentiation, moderate differentiation, and poor differentiation HCC samples differ in microstructural composition, represented by the UMAP density heatmap from the respective tissue samples. There seems to be a characteristic density distribution of polarization signatures at each stage of cellular differentiation degree, where the overall layout of the map remains stable, but the proportion and distribution of pixels that belongs to each cluster varies. The density ratios of several clusters appear to alter distinctly with different cellular differentiation degree: the density of cluster 2 (identified as cell nuclei by pathologists in pathological images) and cluster 6 (recognized as fibrocyte) both increase monotonically as the differentiation degree decreases. On the other hand, the lower right part of cluster 3 (realized as cytoplasm) vanishes gradually as differentiation degree decreases, while the upper left part of cluster 3 changes abruptly between well and moderately differentiated cellular states. Cluster 2 is clearly absent in benign tissues but present in malignant tissues, which implies that the presence of this specific microstructure characterization correlates strongly with tumor malignancy. Such polarization signature can be potentially used as the marker for HCC malignancy detection, providing a quantitative way to identify HCC tumor. The idea is thoroughly explored in the polarization marker section. Another point we noticed is that the configuration of the primary clusters is fixed, while the composition of clusters and the local density of data points varies with respect to cellular differentiation. To shortly summarize: (1) polarization based microstructural clusters contain pathologically significant information such as cellular differentiation degree, and (2) the principal configuration of the microstructural map in UMAP representation remains stable; only the inter-cluster proportion changes with pathological variation.
Cluster dendrogram. Each cluster has its unique polarization characteristic and corresponding pathological feature, and the dendrogram plot in Fig. 4 illustrates how the pixels are separated into clusters using polarization features. Dendrogram illustrates how the clusters are formed, where the node indicates two clusters are merged into one, and the abscissa-axis indicates the distance between clusters. Each branch corresponds to a certain variation in polarization features, and variation in polarization features infer variation in the microstructure. As shown in Fig. 4, the entire set first splits into two main groups, which contain nucleus structures (cluster 1, 2) and the rest (cluster 3, 4, 5, 6), including collagen fiber and cytoplasm. This indicates that cell nucleus and collagen fiber have very different microstructural properties, unsurprisingly. In the top strand, the two main nucleus clusters are extracted by stripping away the artifact clusters, mostly utilizing the equalities of Mueller matrices. In the bottom strand cluster 6 is the easiest to identify, as indicated by the between-cluster distance. The polarization microstructural map and dendrogram plot demonstrate that each subdivided cluster corresponds to a meaningful pathological microstructure in the H&E images, identifiable with distinctive polarization features.
Polarization markers for diagnosis. Tissue microstructure variation is pathologically meaningful. Here, we aim to extract polarization-based microstructure markers and explore their diagnostic value. Subwavelength microstructures, such as that of the cytoplasm, are potential polarization markers as well. Once the tissue microstructural subtypes are mapped using polarization features, it enables the visualization of pathological transition during primary liver cancer progression and the identification of potential polarization markers for tumor classification. To demonstrate, we attempt to tackle two different tasks: identification of HCC with different differentiation degrees, and classification of ICC from HCC.
We first study the microstructural variation during HCC progression and recognize cancerous tissues with different differentiation degrees. We observe the tissue microstructural composition alteration correlates with pathological variation in HCC, from normal liver tissues to well, moderate, and poor differentiation degree of HCC, which is clearly visualized in Fig. 5a, and the animated heatmap provided in Supplementary 1. As indicated in Fig. 4a and Fig. 5a, we identified cluster 2 and 6 as the polarization marker to distinguish HCC from normal tissue. The cluster area proportion (area of pixels belonging to that specific cluster divided by the total area in the ROI) of cluster 2 and cluster 6 are calculated for normal tissue, highly differentiated, moderately differentiated, and poorly differentiated cancer samples respectively. The resulting box-whisker plots are shown in Fig. 5b, and their corresponding p-values for the t-test are calculated to test for statistical significance. When using cluster 2 as the marker, the distinctions between neighboring sets are statistically significant, but not between the well differentiated and moderately differentiated samples (p = 2.22x10− 5 for normal vs well, p = 1.00 x10− 4 for moderate vs poor, p = 1.16x10− 1 for well vs moderate). The monotonically increasing trend of cluster 2 proportion is observed as well. To discriminate malignant HCC samples from normal tissues, we have separated all the samples into two sets, namely the malignant set (containing HCC samples of all three differentiation degrees) and the normal set (containing all the normal ROIs). Under this binary classification scenario, cluster 2 area proportion achieves an AUC of 94.84%. Using cluster 6 as the polarization marker shows a similar trend, as seen in Fig. 5c. The area proportions of cluster 6 in normal tissues are relatively low, most of which are under 5%. The separation between normal and highly differentiated samples (p = 2.69x10− 2), and between moderately and highly differentiated samples (p = 1.48x10− 4) are statistically significant, but not between highly and moderately differentiated samples (p = 5.87x10− 1).
As labeled in Fig. 5d, we identified a cytoplasm structure in cluster 3 that is present in moderately differentiated samples, but not in highly differentiated samples. Using the labeled region as polarization marker, an AUC of 88.59% is achieved for distinguishing well differentiated from moderately differentiated samples, as shown in Fig. 5e. This observation is quite unconventional and requires validation in future works.
We now study the differences in tissue microstructural composition between HCC and ICC pathological samples, in an attempt to distinguish them. Likewise, we can first visualize the variation of tissue microstructural composition by observing the animated heatmap (provided in Supplementary 2) that samples gradually from HCC to ICC tissues. The microstructures characterized by cluster 5, i.e., collagen fiber, is abundant in ICC samples, but not in HCC samples, as seen in Fig. 6a. In contrast, cluster 3, a subtype of cytoplasm, is abundant in HCC sample while not in ICC samples. Cluster 5 is the main focus as the polarization marker for HCC and ICC distinction. For comparison, Fig. 6b shows the box-whisker plot of the cluster 5 area proportion for both HCC and ICC samples. It is observed that most of HCC samples’ cluster 5 proportion ratio is less than 10%, while that of the majority of ICC samples are above 10%. For discriminating ICC from HCC samples, the area proportion of cluster 5 yields an AUC of 84.94%. We also experimented with cluster 3, using it as the polarization marker yields an AUC of 71.69%, as seen in Fig. 6c. This implies that to a certain degree, the cytoplasm composition in HCC and ICC are different.