The difference between MC and AD in CRC has already been recognized at clinical and molecular levels. However, existing studies cannot make the treatment for MC precise, and MC is still treated almost identically to AD(2). As far as we know, the most prominent feature of MC that distinguishes it from AD is the presence of abundant extracellular mucin. Considering the malignant clinical features such as drug resistance and more frequent metastasis in MC patients, the mucins may play an important role(24, 25), but few studies figure out the mechanism on genome/transcriptome leading to the clinical features. Meanwhile, the genesis mechanism of MC is still not clear. For more precise treatment, much needs to be explored at the molecular level of MC. This study attempts to explore the characteristics and different expression patterns of AD at the transcriptome level of MC, and to be the paving stone for identifying the genesis of MC, even making MC treatment precise in the future.
In our study of constructing WGCNA and module-trait correlation, it came to our notice that modules have similar direction and significance of correlation with MC, right-side colon and MSI-H. According to present studies, MC is more frequent in right-side colon and has more MSI-H than AD. Also, studies have reported that MSI-H are more frequently happened in right-side colon(26, 27). In our study, the relationship between MC, right-side colon and MSI-H also seems complicated, especially between MC and MSI-H at the transcriptome level. It is difficult to find the key module(s) of MC. For example, the tan, darkgrey and grey60 modules showed coefficient and P value of high similarity in MC and MSI-H. Although from the RNAs in each module we could say that the darkred module is the most likely key module of MC, because most of the known RNAs that are differentially expressed between MC and AD have been clustered in it.
From the Venn plots (Fig. 3), it appears that only a small fraction of the differential RNAs in the magenta and brown modules are differentially expressed between MC and AD. Combining the results of the module-trait correlations, it is consistent that the tan module doesn't show a better correlation with MC than MSI-H, while magenta shows a much better correlation with MSI-H. From this aspect, we hypothesize that magenta module could be a potential key module of MSI-H but not MC. Although several studies have mentioned the relationship between MC and MSI-H at the molecular level, the core reason for this appearance remains unclear(28, 29). Further studies focusing on RNAs in other modules and the mechanism of MC and MSI-H highly correlated should be worked on.
In the enrichment analysis, despite the common terms enriched in all four modules, the most specifically enriched terms of the darkred module are those with 'extracellular', such as extracellular region and extracellular space. This finding may be explained by the different hub RNAs associated with the extracellular mucins and is like other studies(30). The lightsteelblue1 and tan modules are both enriched for immune-associated terms, and the differential RNAs in them are mostly immune-associated RNAs. This may indicate that these two modules are both involved in the immune process. Also from the co-expression network, the RNAs in these two modules are closely connected, which means that lightsteelblue1 and tan modules may have a similar function pattern.
To further understand the result, we examined the expression level of these differential hub RNAs in human normal cells, and the result presents us with valuable points. Most of the differential hub RNAs in the darkred module were highly expressed in human normal intestinal goblet cells. It is well known that goblet cells are the major producers of mucins in the intestinal tract, and MUC2, the most famous mucin of MC, is secreted by goblet cells(31). Besides, the differential hub RNAs of the darkred module also contain many of the marker RNAs of the progenitor cells of intestinal goblet cells. We have summarized points from recent studies of intestinal goblet cell differentiation and found that parts of the darkred module differential hub RNAs appear in this process (Fig. 7)(32–35). ATOH1 and SPDEF, which are secretory lineage markers, together with other secretory and goblet progenitor markers such as ITLN1, TFF3 and GFI1 are all upregulated in our study. The mucin RNAs other than MUC2, such as AGR2, FCGBP and SPINK4, which mark intestinal goblet cells, also show this pattern. From this finding, we hypothesis that MC tumor cells share similar transcriptome features with intestinal goblet cells, and that the genesis of MC may be associated with a specific differentiation process to goblet cells other than other types of enterocytes. Also in our study, we found that mRNA FOXA2 with its neighbor lncRNA LINC00261 which is proved to induce FOXA2 expression epigenetically are also differential hub RNAs in darkred module. And FOXA2 is proved to control the differentiation of goblet in mice(36, 37). This is also evidence to support that genesis of colorectal MC may related with goblet direction of differentiation.
Figure 7The fate of goblet cells differentiation from crypt-base columnar cells (CBC) and the marker genes in different periods*.
In the tan and lightsteelblue1 modules, the differential hub RNAs showed high levels of expression in normal human macrophages. Although these RNAs were differentially expressed between MC and AD in our study, it's more likely that these RNAs are not from tumor cells but from immune cells like macrophages. To our knowledge, there is no evidence that MC or mucins have such effect like recruitment on macrophages that could explain the results in our study. However, there are studies that have found that MUC2 is associated with inflammation through IL-10, IL-6 and TNF-α et al. In our study, IL-6 is one of the differential hub RNAs in lightsteelblue1. In other studies, CRC patients showed an increasing trend of IL-6, and the silencing of MUC2 may increase the secretion of IL-6 by CRC cells(38, 39). These conclusions cannot explain our result because we found that both IL-6 and MUC2 were upregulated in MC compared to AD. As a conclusion of this finding, MC is somehow more closely associated with the immune process, and macrophages may play an important role in MC. However, whether the result is caused by macrophage recruitment around tumor cells or not and whether it occurs at the transcriptome level (mucin RNAs) or protein level (mucins) should be further investigated.
We then investigated the clinical value of these four modules. From the LASSO logistic regression analysis. The darkred and magenta modules show an acceptable AUC of the model. However, most of the RNAs in the models weren't reported to be associated with MC, except for TFF3, SPINK4 and REG4 in the darkred module. And the significance of each RNA in the TCGA and GEO datasets does not match. This result makes the clinical diagnostic value of the RNA signature evaluated in this study not sufficient for practice, but it indicates that the potential ability of the tan and lightsteelblue1 module to discriminate MC from AD is weak.
As for the survival analysis in this study, because the LASSO-Cox model could not be constructed for each module, the KM curves were plotted by the best cut-off calculated for each differential hub RNA. The result is that almost all RNAs have a better survival in the high expression group. Based on the fact that darkred, tan and lightsteelblue1 modules are positive with MC, the result indicates that MC should have a trend of better survival for these RNAs that are upregulated in MC. This finding may explain the fact of the conflicting survival difference between MC and AD(40–42). The malignant clinical features of drug resistance, advanced stage at diagnosis and more metastases could be a passive strength for patient survival. While the upregulation of survival associated RNAs found in our study shares a similar appearance with normal tissues and could be a positive strength. For example, MUC2 is decreased in AD but increased in MC(43). The MUC2 in normal tissues is prominent in anti-inflammatory, preventing invasion of foreign pathogenic organisms and keeping the intestinal microecology in balance(9). In AD, the decrease of MUC2 may lead to a loss of protective effect and AD(44), while on the contrary aspect, the overexpression of MUC2 may also lead to oncogenic effects by decreasing the innate and adaptive immune response with the appearance of increasing mucin secretion(45). And the mucus layer composed of mucin may act as a physical barrier and cause resistance to systemic treatment(46). These findings suggest that the differential hub RNAs in darkred, light steelblue1 and tan modules may act as MUC2, the high expression of which makes the transcriptome or function of MC cells more similar to normal cells.
When performing qRT-PCR, the selection of cell lines was a complicated task. As far as we know, MUC2 is the marker mucin protein of the intestinal tract and is particularly expressed in goblet cells. Knowing that AD expresses less MUC2 than normal tissue while MC expresses more, the better way to study the difference between two subtypes is to use cell lines with high MUC2 expression compared to those with low expression. Ls174T has been considered as a cell line with high MUC2 expression, whereas HT-29 and T84 also express MUC2 but at a moderate level(23, 47). From this point of view, the result of qRT-PCR suggests that lncRNAs (CTD-2547H18.1, CTD-2589M5.4, RP11-234B24.2, LINC00261, RP11-25K19.1) and mRNA CAPN9 are more highly expressed in the mucin-producing cell lines. However, these cell lines cannot fully represent MC in the human body and may only represent part of the pathway from drivers to MUC2. This means that it will be much easier to explain the difference between mucin-producing and non-mucin-producing cell lines than the difference within Ls174T, HT-29 and T84. Just like CTD-2547H18.1 and CTD-2589M5.4, both showed higher expression levels in mucin-producing cell lines, but the former had a peak in T84 while the latter had a peak in Ls174T. According to articles on the molecular differences between colorectal cell lines(48), Ls174T and HT-29 may differ in microsatellite status, CpG island methylator phenotype (CIMP) and mutations. However, T84 was not included in this study. Several articles had shown that T84 is a crypt-like cell line, which raised the question of whether there is similarity between T84 and CBC at the genome or transcriptome level. Also from our study, we make a reasonable speculation that the genesis of MC may be involved in the fate of CBC differentiation to goblet cells, the result of lncRNAs with higher expression in T84 may possibly be caused by this. In any case, further studies on the genesis of colorectal MC are needed to reach this conclusion.
Limitation still presented in our study. Most of the differential hub RNAs in the four modules are not well studied and the molecular function of them isn't clear, especially in MC. Despite the mRNAs, many lncRNAs appeared as differential hub RNAs with validation of qRT-PCR in the study, but we cannot explain the roles they play in the mechanism of MC exactly. Although our study of bioinformatics analysis provides some insights and hypotheses, many more experiments at different levels should be conducted to explore more deeply.