There is a growing understanding of the critical role of the breast tumor immune microenvironment in treatment response and outcomes in breast cancer, yet modifiable patient factors that impact immune infiltration in breast cancer remain poorly understood. The Nurses’ Health Study offers a large-scale, unique cohort to evaluate interactions between patient lifestyle exposures and the immune microenvironment. Specifically, this study is unique in its integration of multiple immune cell-specific immunohistochemistry (CD8, CD4, CD20, and CD163) with published immune gene expression signatures, including the IRIS gene sets, which we recently demonstrated can effectively interrogate the breast cancer tumor immune microenvironment in a large clinical trial [33]. Using paired IHC and gene expression data from over 250 unique patient tumors for each marker, we successfully derived improved expression-based predictors of immune cell subset infiltration, then evaluated the association with patient factors: BMI change from age 18, physical activity, and EDIP.
Higher BMI change since age 18 demonstrated the strongest association between immune cell-specific expression scores and patient factors, specifically GSAct, CD4 T-cell score, and CD163 macrophage score. BMI change in adulthood has been associated with breast cancer risk [34]. The role of obesity in the tumor immune microenvironment is of great interest, driven initially through evidence in melanoma that obesity was associated with longer survival in patients receiving immunotherapy [10]. Mechanistic work suggested that, paradoxically, obesity results in increased immune aging, tumor progression and PD-1-mediated T-cell dysfunction yet increased efficacy of PD-1/PD-L1 blockade in murine models and patients with cancer [35]. In HER2 positive breast cancer, higher BMI was independently associated with worse survival in early breast cancer, but was contradictorily associated with better survival in advanced breast cancer [36].
While most FDA-approved immunotherapies target CD8 + cytotoxic T-cells, the role of CD4 + helper T-cells [37, 38] and CD163 + macrophages [39–41] are less well defined but increasingly seen as key players in the breast cancer microenvironment. It is established that T-lymphocyte populations change with obesity (reviewed in [42]). In our model, for each 10 unit (kg/m2) increase in BMI – roughly equivalent to an individual going from BMI 20 (normal weight) to BMI 30 (obese), the percentage of cells positive for CD4 and CD163 increased 1.6% and 1.4%, respectively. In obesity, there is evidence that interferon gamma-producing pro-inflammatory CD4‐positive Th1 cells are increased, whereas anti‐inflammatory CD4‐positive Th2 and Treg cells are reduced [42]. Intriguingly, we have previously shown in the Nurses’ Health Study that higher BMI was associated with increased expression of genes associated with IFN alpha and gamma response in ER- tumor and ER- tumor-adjacent tissues [9].
Inflammatory diet was positively associated with GSAct in bivariable analyses with a trend in multivariable analyses, suggesting that a more pro-inflammatory diet is associated with higher immunity. This differs from our hypothesis, which was based on data in colorectal cancer, where inflammatory diet was associated with a higher risk of developing colorectal cancer only among tumors that had low tumor infiltrating lymphocytes [15]. It is likely that the distinct settings, for example direct exposure of colonic mucosa to dietary elements versus no exposure in breast tissue, may influence these effects. To our knowledge, this is the first study to examine the relationship between inflammatory diet and quantity of TILs and immune subsets in breast tumors.
We also did not find a correlation between physical activity and immune cell-specific expression scores, specifically not GSAct, CD8, or CD4, which had either bivariable or multivariable association with other patient factors. Physical activity has been hypothesized to be associated with biomarkers of inflammation in breast cancer survivors [43]. In animal breast cancer models, effect of physical activity on the amount of TILs is conflicting [44, 45]. In humans, exercise was not found to affect levels of circulating T-cells in patients receiving chemotherapy for breast cancer [46]. To our knowledge, this is the first study exploring the effect of physical activity on multiple immune cell subset infiltration in breast tumor tissue. While physical activity was not associated with immune cell infiltration in this study, overall, patients with breast cancer have been shown to benefit from exercise during and after cancer-directed therapy, and this study should not be used to justify a sedentary lifestyle for these patients [47, 48].
In this cohort of over 600 tumor-normal pairs and an independent validation cohort (TCGA), both GSAct and CD8A expression metrics show modest – but consistent – correlation between breast cancer and tumor-adjacent normal breast tissue. This suggests that the adjacent normal breast may reflect an altered immune microenvironment in the context of breast cancer. In a smaller cohort, inflammation expression was elevated in adjacent normal tissue relative to reduction mammoplasty tissues [49]. In the NHS, elevated inflammatory expression in adjacent normal breast tissues was associated with higher BMI and alcohol consumption specifically in ER- tumors [9, 22]. It is possible that tumor-adjacent normal inflammatory gene expression is a ‘bystander’ effect in response to the tumor. Additional work on the immune microenvironment of tumor adjacent breast is warranted to understand if adjacent normal immune infiltration is associated with breast carcinogenesis, immune infiltration of established tumors, or therapy response.
This study does have limitations. Only a subset of all the subjects enrolled in NHS/NHSII were included in the TMA due to limited tumor tissue availability. However, the characteristics of participants included in the TMA were very similar to those of all the eligible cases, including BMI, physical activity, EDIP score, and other breast cancer risk factors (e.g. first-degree family history and parity). Adiposity has differential associations with breast cancer risk based on menopausal status and the models were derived primarily in NHSII subjects and applied primarily in NHS subjects; importantly, menopausal status and NHS cohort were covariates included in multivariable models. We acknowledge that TILs in breast cancer are typically characterized using a standard approach based on the International TILs Working Group [50]; however, given the computer-based quantification approach and diversity around stromal versus epithelial correlation across immune markers, we used total positive cells. In addition, the correlation between novel expression signatures and infiltrating immune cell number was modest, though significantly outperformed multiple established immune cell-specific signatures. We hypothesize that this reflects the fact that bulk transcriptome signatures may not be an optimal way to represent discrete infiltrating immune cells numbers and supports further work on single cell-based technologies such as single cell RNA sequencing and highly multiplexed immunofluorescence profiling.
Future studies of the association of patient factors, specifically BMI and EDIP, and patient outcomes are warranted. It would be interesting to investigate the effect of modification of BMI on patient outcomes with different systemic treatments. Associations between BMI, physical activity, and diet with other immune cell subsets should also be investigated in order to understand the complex relationships between immune cell infiltration and modifiable patient factors. Further, in future studies, the association between immune infiltration signatures and patient prognosis will be assessed, but this requires careful analysis beyond the scope of the present study. In addition, to improve gene expression signatures of specific immune subsets we plan to utilize more complex modeling, such as machine learning; however, a strength of our linear model-based score approach is the ability to identify and quantify contribution of individual components.