We have established a TMA resource to assess the in situ, inter-anatomic site, and temporal ITH of IHC biomarkers in HGSC. We observed variations of several biomarkers across different anatomical sites and temporal comparisons between primary and recurrence. Using clinically established IHC markers, we provided a benchmark for ITH that can be used to investigate new emerging biomarkers.
WT1 and p53 represent the minimal ancillary IHC panel to confirm a diagnosis of HGSC [13]. As expected, for the truncal founder mutation of HGSC [29], p53 showed extremely low ITH across anatomical sites and temporally distinct samples. A similar high agreement was previously demonstrated for pre- and post-chemotherapy HGSC samples [21]. Rarely, two different abnormal p53 staining patterns can be observed in what seems a single tumor, but this should raise suspicion for the dual primary origin of two HGSC from different STICs or co-occurrence of HGSC with an endometrial serous carcinoma [30] Subclonal p53 IHC, defined by the combination of abnormal with normal wild-type pattern, is inconsistent with a diagnosis of HGSC and should raise the possibility of an alternative diagnosis, such as endometrioid carcinoma. However, this must be distinguished from areas of poor fixation within HGSC [31]
Similarly, WT1 has also been shown to be robustly expressed in pre- and post-chemotherapy HGSC samples[21] and herein, we show almost perfect agreement using categorical data. However, in a continuous analysis, the agreement dropped to only moderate for some types of heterogeneity. Although pathologists consider the extent of expression in their interpretation (i.e., integrated with the morphological appearance of a tumor), this should only cause occasional practical problems since a categorized interpretation (absent versus present) remains the standard in clinical pathology reporting. Most HGSCs show diffuse WT1 in all tumor cells. Yet in a few HGSC samples, a subset of the tumor cells may lose WT1 expression (WT1 is expressed in a normal fallopian tube, the tissue of origin, in both secretory and ciliated cells), potentially due to the effects of chromosomal instability/copy number variations, including homozygous deletions of WT1[32]. While WT1 shows slight ITH as a continuous marker, it is very robust as a categorical marker.
The concordance for p16, sometimes used as a second-line diagnostic marker[14], was also almost perfect and showed only slightly lower agreement in comparison with p53. Abnormal IHC expression for p16 in HGSC is multicausal and includes homozygous deletion for the abnormal complete absence pattern, while abnormal block expression may be caused by alterations in the G1/S phase such as RB1 loss [33] and CCNE1 high-level amplification[34]. RB1 loss, for example, can occasionally occur in a subclonal fashion, which could result in ITH of p16[33]
The potentially prognostic markers showed a weaker categorical concordance, especially in the temporal context, when comparing primary and recurrence, potentially due to systematic effects of in-between treatment or oncogenic changes during tumor evolution and progression. For example, neoadjuvant chemotherapy lowers the Ki67 labelling index of HGSC[35] The primary samples from the COEUR cohort came exclusively from chemo-naïve primary debulking specimens[36] While we observed significant ITH when comparing primary and recurrence, we did not observe systematic shifts in the prevalence for the Ki67 labelling index on recurrence. This suggests that in contrast to the short time interval between neoadjuvant chemotherapy and interval debulking, proliferation rates recover in case of recurrence, which usually occurs sometime after chemotherapy exposure. There was also no change in the prevalence of PR in the temporal course, but CD8 counts increased on recurrence. While a favourable prognostic marker should not necessarily increase on recurrence[16] an increase of CD3 and CD4 lymphocytes in recurrent HGSC has been reported in another study[37] This poorer temporal agreement among the prognostic markers may require retesting on recurrence with the need to obtain additional longitudinal tissue biopsies.
Within the spatial context, however, the potential prognostic biomarker (PR, CD8, Ki67) generally showed a substantial agreement between anatomical sites, which supports using a single biopsy for adequate biomarker assessment. For CD8, this aligns with another study that concluded that small tissue biopsies represent the immune microenvironment adequately [38] Overall, all six tested IHC biomarkers showed substantial agreement with ICC > 0.60 in most spatial comparisons. This compares favourably to the recent ITH in HRD scores, with only 78% agreement [6] However, as in this example, results on ITH are often not comparable due to differences in reporting (% agreement, kappa, ICC). We recommend reporting ICC as the best way to report ITH, with values over 0.60 as being clinically robust. We provide a power calculation to estimate the number of samples needed to assess for ITH.
In addition to true biological factors, technical factors can contribute to ITH, such as the quantity and quality of tumor sampling. Tissue fixation and processing can impact biomarker expression levels, as non-standardized fixation protocols and processing methods may affect the stability and accessibility of biomolecules. Differences in the quantity of tumor cells available for analysis and tumor cellularity can also result in heterogeneity.
In conclusion, molecular analysis for diagnostic, prognostic or predictive biomarkers is typically conducted on one random tumor sample extracted from patients with HGSC. This method is valid if molecular alterations are truncal and present in every tumor cell. However, several driver alterations occur during tumor evolution in subclones [3], and their accurate assessment will be subject to sampling bias. While this bias is unavoidable, it should be at least measured, and if the acceptable ICC threshold of > 0.60 cannot be met, mitigating strategies such as increase in number of samples tested, minimum sample size, standardization of the sample site and time (primary, possible retesting on recurrence) should be established.