Participants
This is a natural history, non-interventional study of 45 consecutive clinically and cognitively normal midlife women at different endocrine stages, including equal proportions of pre-menopausal (standardized to midcycle), peri-menopausal, and post-menopausal participants. Participants were recruited at Weill Cornell Medicine (WCM) between 2021-2022 from multiple community sources, including individuals interested in research participation, family members and caregivers of impaired patients at our institution, and by word of mouth10-13.
All participants gave written informed consent to participate in this 18F-fluoroestradiol (18F-FES) positron emission tomography (PET) study,which was approved by the WCM Institutional Review Board. Use of 18F-FES was carried out under WCM Radioactive Drug Research Committee and National Cancer Institute (NCI) Investigational New Drug (IND) #146703 approval.
Participants were 40-65 year-old women with >12 years of education and a diagnosis of normal cognition per physician’s assessment, with Montreal Cognitive Assessment (MoCA) scores >26 and cognitive test performance within normative values for age and education10-1312,50,51. Pre-established exclusion criteria included: (i) any significant neurological disease, such as dementia, normal pressure hydrocephalus, brain tumor, progressive supranuclear palsy, seizure disorder, subdural hematoma, multiple sclerosis, or history of significant head trauma followed by persistent neurologic deficits or known structural brain abnormalities; (ii) any significant psychiatric disease, such as major depression, bipolar disorder, schizophrenia, or psychotic features; (iii) T2 and/or FLAIR MRI brain scan evidence of infarction, lacunes or demyelination disease; (iii) systemic illnesses, unstable medical conditions or major medical complications such as treatment for neoplastic disease, unmanaged cardiovascular disease, diabetes, renal or liver disorder; (iv) history of drug or alcohol dependence; (v) current use of psychoactive medications (e.g. benzodiazepines, cholinesterase inhibitors, psychostimulants, etc.) or investigational agents; (vi) contraindications to MRI or PET imaging. Additional exclusion criteria included: (vii) history of oophorectomy or hysterectomy; (viii) use of hormonal therapy; (ix) active pregnancy.
All participants underwent clinical examinations including medical history, neurological exams, neuropsychological testing, blood analysis including genetics and sex steroid hormones, multi-modal MRI and 18F-FES PET imaging. The patients’ sex was determined by self-report.
Herein, we capitalized on the menopause transition as a natural experiment of estradiol decline. Participants were enrolled into three size-matched groups according to menopausal status based on the Stages of Reproductive Aging Workshop (STRAW)52 with hormone laboratory assessments as supportive criteria (pre-menopause: no change in menstrual regularity in the past 12 months; peri-menopause: no menses in the past 3–11 months; post-menopause: no menses for the past >12 months)52. Participants were therefore not randomly assigned to groups. Sex steroid hormone levels, including estradiol (E2), were measured by a commercial laboratory (Boston Heart Diagnostics, Framingham, MA).
All participants were asked to report the date of their last two menstrual periods for diagnostic purposes. PET studies of pre-menopausal participants were scheduled to coincide with the next nearest midcycle, when plasma E2 levels are highest. Cycle irregularities in peri-menopausal women prevented scheduling their PET scans according to a specific menstrual cycle phase. However, plasma E2 levels were included as a covariate in statistical analysis. Blood samples were taken on the day of the PET study for all participants except two who did the blood draw the day prior, and one that was done a week later. To test whether tracer binding at midcycle was impacted by tracer competition with endogenous E2, we included E2 as a covariate, which only enhanced differences in ER density between menopausal groups. Results remained unchanged excluding E2 data from the last participant.
Our final study cohort included 45 consecutive participants, divided into three size-matched groups of 15 participants each according to menopause status: 15 pre-menopausal, 15 peri-menopausal, and 15 post-menopausal participants. Three of the original 45 participants had to be excluded due to conditions encountered in the MRI scan (demyelination; 1 pre-menopausal and 1 post-menopausal participant) or positive pregnancy test (1 pre-menopausal participant). As such, 3 additional participants were enrolled in those respective groups. Participant characteristics of the final sample are shown in Supplementary Table 1. Unless explicitly stated otherwise (in case of analyses including measures only available for a subset of the participants), these represent the sample sizes used in analysis.
There were no differences in demographic measures between menopausal groups, except for an expected age difference between the pre-menopausal and post-menopausal groups (Supplementary Table 1), which was addressed according to published protocols12,53. Briefly, we used box plots and frequency diagrams to ensure that we had sufficient age overlap among different menopause statuses, which enabled us to test for effects of endocrine aging separately from chronological aging. Age was then set as a covariate in all analyses.
All participants completed menopause questionnaires, including rating scales for symptom clusters including presence of vasomotor symptoms, changes in mood, sleep, libido, and cognition10-13. A composite menopause symptom score was calculated as the sum of individual scores, with higher values reflecting more symptoms. These scores were higher in the post-menopausal (4.5±3.18) and peri-menopausal groups (5.3±3.1) as compared to the pre-menopausal group (1.7±1.3; p’s<0.010). Menopause symptom scores and symptom presence were examined as correlational outcomes.
Cognitive Testing
Participants underwent a cognitive testing battery assessing general cognitive functioning, memory [Rey Auditory Verbal Learning Test (RAVLT) total, immediate and delayed recall; Wechsler Memory Scale logical memory immediate and delayed recall], executive function [Trail Making Test (TMT)-B; FAS], and language [animal naming; WAIS object naming]10-13,50,51. We then calculated a composite verbal memory score by z-scoring both delayed recall tests and averaging across measures. Additionally, a global cognition score was obtained by z-scoring each of the remaining tests and averaging within and across domains. TMT-B scores were first inverted by multiplying by -1. Descriptive assessment of cognitive scores across menopause statuses showed no significant differences, adjusting by age and years of education (p’s>0.69; Supplementary table 1). As cognitive scores differed by less than 1 standard deviation between menopause groups, the variance in these measures was deemed too small to enable meaningful assessment of correlations with 18F-FES data.
Brain Imaging
Acquisition
All participants received MRI and PET scans following standardized protocols10-13. Scans were performed on consecutive days, except for 9 participants who completed FES an average of 0.8+1.9 months before or after MRI. Adjusting by time between scans as a covariate did not significantly impact correlational results.
All MR sequences were acquired on a 3.0 T MR750 Discovery scanner (General Electric, Waukesha, WI) using a 32-channel head coil in a single imaging session, including:
Volumetric MRI: 3D volumetric T1-weighted MRI [BRAVO; 1x1x1 mm resolution, 8.2 ms repetition time (TR), 3.2 ms echo time (TE), 12° flip angle, 25.6 cm field of view (FOV), 256x256 matrix with ARC acceleration].
Arterial Spin Labeling (ASL): acquired using a pseudo-continuous technique [4851 ms TR, 10.6 ms TE, 4 averages, 24 cm FOV, 2.0x2.0x3.8 mm resolution] to estimate cerebral blood flow (CBF) from arterial blood water54. One post-menopausal participant did not complete the scan due to technical issues.
31Phosphorus Magnetic Resonance Spectroscopy (31P-MRS): acquired on the same scanner using a dual tuned 31P/1H quadrature head coil (Clinical MR Solutions, Brookfield, WI). Prior to MRS scanning, shimming was performed using a 1H single voxel technique placed over the entire brain avoiding the air-tissue interfaces. Multiple 2D slices were acquired resulting in an 8x8x8 grid with a 24 cm FOV [2048 points, 5000 Hz sweep width, 2000 ms TR, 2 averages, 55° flip angle at 51.3 MHz] in the sagittal plane. An 8 slice sagittal T1-Fluid Attenuated Inversion Recovery sequence [FLAIR; 2200 ms TR, 12 ms TE, 780 ms inversion time (TI), 24 cm FOV, 0.94x0.94 mm] was acquired with a 5 mm slice thickness at exactly the same position as the center of each 31P MRS slice for reference. The central 4 slices were co-registered to MRI by using the 8-slice concordant image set acquired at the time of MRS. Two peri-menopausal and one post-menopausal participants did not complete the 31P-MRS procedure due to technical issues.
18F-Fluoroestradiol PET imaging. 16α-[18F]fluoro-17β-estradiol (18F-FES) was prepared by the WCM PET Radiochemistry Group using established methods for synthesis and quality assurance55,56 [https://imaging.cancer.gov/programs_resources/cancer-tracer-synthesis-resources/FES_documentation.htm]. 18F-FES scans were acquired using a Siemens BioGraph mCT 64-slice PET/CT scanner [70 cm transverse FOV, 16.2 cm axial FOV] operating in 3D mode. All scans were performed after a 4-hour fasting to decrease biliary uptake. One hour before PET imaging, an antecubital venous catheter was positioned for tracer injection. No arterial blood sampling was performed. Participants lied down on the scanner bed with eyes closed and ears unplugged, in the quiet and dimly lit scan room. Following a low-dose CT scan, a dose of approximately 6 mCi (222 MBq) of 18F-FES was infused intravenously in a volume of 20 mL isotonic phosphate buffered saline containing less than 15% of ethanol by volume over 2 minutes. Dynamic imaging was performed for 90 minutes, and consisting of 30 frames: 4x15, 4x30, 3x60, 2x120, 5x240, 12x300 sec. All images were corrected for attenuation, scatter and radioactive decay.
18F-FES analysis
Regions of Interest
Target regions. While ERs are widely expressed throughout the brain, their density varies by isoform and region1,2. As 18F-FES selectively binds ERα, and tracer uptake in white matter is confounded by non-specific binding23,24, we focused on predominantly gray matter regions with high ERα expression. These included pituitary, hypothalamus, thalamus, hippocampus, amygdala, midbrain, striatum, cingulate, medial and orbitofrontal cortex28-31. These regions were carried into hypothesis testing. Additionally, we examined regions with lower ERα expression but known E2-related effects: superior, middle, inferior and precentral frontal cortex; precuneus; entorhinal cortex, parahippocampal gyrus; inferior, middle and superior temporal gyri; fusiform; superior and inferior parietal lobule28-31.
Reference region. We chose the cerebellum as the anatomical reference region based on evidence that it is generally void of ERα17,20-22. We then developed a probabilistic cluster-based cerebellar ROI, using the following procedures, detailed in Supplementary Figure 1: (i) to be maximally conservative, as ERβ and GPER-1 are expressed in the innermost portion of cerebellar white matter and adjacent gray matter (corresponding to human middle cerebellar peduncle, culmen, arbor vitae, dentate nucleus, and medullary cortex)17,20-22, the cerebellar ROI was restricted to the outermost portion of cerebellar crus II gray matter, which is generally free of ERs; (ii) voxel-based machine learning with intensive iterative data resampling implemented in NPAIRS (nonparametric prediction, activation, influence, and reproducibility resampling)57 identified the inferior portion of cerebellar crus II as showing invariant tracer uptake across menopause classes, which is a pre-requisite for data normalization48. The final cerebellar ROI is illustrated in Supplementary Figure 1.
Kinetic modeling and simplified reference-tissue analysis
18F-FES data were examined as distribution volume ratios (DVR) and standardized uptake value ratios (SUVR) using ROIs based upon anatomical labeling atlas (AAL3)58 ROIs restricted to grey matter using a smoothed gray segment image from each participant’s volumetric MRI. The pituitary ROI was manually delineated on the coregistered anatomical MRI by two expert raters (A.P. and L.M.) using a 1 cm radius sphere, according to published criteria59. ROI placement, thresholding, and sampling were conducted using PMOD v4.1 (PMOD Technologies).
Distribution volume ratios. ROIs were applied to motion-corrected dynamic PET images to obtain regional time activity curves (TAC, mCi/mL) of tissue radioactivity concentration across all slices sampled. Graphic reference-tissue Logan plots47 were used to estimate 18F-FES DVR (1+BP, binding potential) as the concentration of the radioligand in each target region relative to tracer concentration in the cerebellar reference ROI, as implemented in PMOD 4.1.
Standardized uptake value ratios. For comparisons with previous work21, and to assess whether brain ER expression could be estimated from a single, late-scan static PET image, summed PET images corresponding to 30-90, 30-60, and 50-70 minutes of 18F-FES data were converted to standardized uptake values [SUV = activity (Bq/g)/[injected activity (Bq)/body weight (g)]. SUVMax were extracted from pituitary ROI, and average SUV (SUVMean) from other regions, and normalized to cerebellar cortex ROI to obtain SUVR for each timeframe. SUVR are more suitable for clinical application.
Reliability analysis. DVR and SUVR values were compared in SPSS v.28 (IBM) using Intra-class Correlation Coefficients (ICC) with Cronbach’s Alpha as reporting criterion, p<0.05. SUVR50-70 measures yielded comparable estimates to DVR, with ICC=0.89 (SD 0.03), range 0.82 in medial frontal cortex to 0.93 in putamen (p’s<0.001; Supplementary Figure 1b, Supplementary Table 6). Across brain regions, the mean difference was 6%+3%, which was comparable between menopausal groups (pre-menopause: 4%+3%, peri-menopause 6%+3%, post-menopause 7%+3%), and was deemed physiologically acceptable. This time window is also consistent with previous studies and observations that 18F-FES returns to sub-physiologic levels within ~1 hour post-injection21,60. Subsequent analyses are obtained using SUVR ROI data and voxel-wise SUVR50-70 images generated for all participants.
Multiparametric Analysis
Image processing was performed using a semi-automated pipeline10-13. 18F-FES PET, ASL and MRS images were realigned to the corresponding T1-weighted MRI using the surface-fitting Normalized Mutual Information algorithm implemented in SPM1227 running on Matlab 2021 (MathWorks; Natick.MA) to enable accurate sampling.
Regions-of-interest. All ROIs except the manually delineated pituitary were quantified from MRI-coregistered 18F-FES images using the segmentation tools implemented in FreeSurfer 7.2 and Desikan-Killiany Atlas25,26 applied to the MRI. FreeSurfer was also used to derive total intracranial volume (TIV). Given its small size, the pituitary was only examined for 18F-FES effects. 31P-MRS data was processed via our proprietary XSOS written in IDL (Excelis Visual, Boulder, CO) using Hamming and Fermi k-space filters, a 7.5 mm center voxel shift, 20 Hz exponential filtering and zero-filling in time, x and y-domains prior to 3D Fast Fourier Transformation. A fixed first order phase of 4200° was applied to all spectra and data was automatically phased in zero order. The phospho-creatine (PCr) peak was set at 0.0 ppm and the central spectrum set as a reference, and susceptibility corrections performed throughout the data set. Baseline correction was applied to all other voxels in the chemical shift imaging (CSI) dataset by an experienced analyst (JPD). Peak area integration was performed around four well-resolved resonance peaks yielding PCr and total ATP (sum of α, β and γ-ATP moieties) expressed as percent area of the total phosphorous signal in the corresponding spectrum, and the PCr/ATP ratio (e.g. a marker of ATP re-synthesis) was computed. For 31P-MRS analysis, medial temporal lobe regions were averaged, while hypothalamus and accumbens were excluded, due to size considerations.
Voxel-based analysis. MRI scans were spatially normalized to the template-normalized tissue probabilistic map (TPM) image included in SPM12 conforming to the Montreal Neurological Institute (MNI) space, and processed using voxel-based morphometry (VBM) including image segmentation, Jacobian modulation, high-dimensional warping (DARTEL) of the segments, and application of an 8mm full-width at half maximum smoothing kernel27. Gray matter volume (GMV) segments were retained for statistical analysis. The MRI-coregistered 18F-FES SUVR and ASL scans were spatially normalized to the TPM image using MRI-derived subject-specific transformation matrices and smoothed at 10-mm FWHM27.
Covariates
Brain imaging analyses were adjusted by age and modality-specific confounders (cerebellar 18F-FES uptake; MRI total intracranial volume; global ASL blood flow; PCr/ATP). 18F-FES analyses were also adjusted by plasma E2 levels.
Statistical Analysis
Analyses were performed in SPSS v.25, R v.4.2.0 and SPM12. Clinical measures were examined with general linear models or chi-squared tests as appropriate. Our primary outcome was examination of 18F-FES SUVR (e.g. ER density) differences by menopause status. Additional outcomes included correlational analysis of 18F-FES SUVR with GMV, CBF, PCr/ATP, and menopause symptom scores.
ER density by menopause status
ROI analysis. Multivariate analysis of variance was used to test the statistical significance of the effect of menopause status (pre-, peri-, or post-menopausal) on 18F-FES SUVR in the set of target ROIs: pituitary, hypothalamus, accumbens, amygdala, hippocampus, cingulate, orbital and middle frontal gyrus. Post-hoc testing was performed to identify which pairwise comparisons in menopause status significantly differed across brain region measures. Regression models were constructed to obtain global P values for multivariate pair-wise outcomes across brain regions, followed by forest plot analysis for assessment of individual regions, at p<0.05. For completeness, SUVR in exploratory regions are provided in Supplementary Table 7 for descriptive purposes.
Voxel-based analysis. We used factorial models with post-hoc t-contrasts to test for 18F-FES differences between menopause groups, adjusting for age and cerebellar uptake. Statistical maps were constructed by applying a stringent voxel-level Gaussian random field theory–based threshold of p<0.05, cluster-level corrected for Family-Wise Type Error (FWE) within a binary masking image consisting of the full set of a priori defined regions used for sampling. Only clusters >16 voxels were considered significant. Anatomical location of significant clusters was described using Talairach coordinates after conversion from MNI space. Clusters of significant associations between ER density and menopause status were saved as binary masks (FESmask), and SUVR data were extracted from peak clusters using MarsBar 0.45 [https://marsbar-toolbox.github.io/download.html] for further analysis.
Prediction of menopause status: (a) Standardized ROI and VBA-derived cluster SUVR were examined for menopause status separation using Cohen’s d effect size. After adjustment by age, the standardized pairwise mean differences between any two levels were expressed as Cohen’s d coefficients, where d>0.8 reflects a large effect size. We used a conservative cut-off of 1.5 to identify the regions yielding the largest effect size in separating groups. (b) To gauge the degree to which ER density in these regions was predictive of menopause status, predictive models by means of multivariable logistic regressions were trained on a random 80% of the study sample, with 20% withheld as the testing set. Each model contained the binary outcome, pre- vs. post-menopause status. Our primary outcome was % accuracy in the testing set, defined as the proportion of correct predictions over total predictions. Global likelihood ratio tests were performed for each model at p<0.05.
Associations of ER density with brain biomarker measures
To characterize the relationship between ER density and additional imaging outcomes, correlation graphs are presented both for the overall study sample as well as by menopause status. The stratified analysis was performed to investigate hypothesized differences in the strength of the correlations throughout menopause. Stratification by menopause status also mitigates the effects of its confounding on the overall correlations.
ROI analysis. We used multiple linear regressions to test for associations of regional ER density with GMV, CBF, and PCr/ATP, adjusting by age and modality-specific confounders, at p<0.05.
Voxel-based analysis. We used multiple linear regressions to test for voxel-wise associations of ER density with GMV and CBF in regions impacted by menopause status, and within peak clusters identified in analysis of menopause status, adjusting for age and modality-specific confounders. The latter analysis was done via the inclusive masking option at the t-map generation step, which allowed us to test for associations of ER density with GMV and CBF within the FESmask image. For completeness, we used factorial analysis to test for voxel-wise associations of menopause status with GMV and CBF in the a priori masking image, thus independently of their associations with ER density. Results are reported in brain regions that survive the masking and the multiple comparisons adjustment.
Associations of ER density and menopause symptoms
We used a two-step approach to test for associations between ER density and menopause symptoms: (i) We computed voxel-wise linear regressionswith post-hoc t testsassessing associations between menopause symptom scores and ER density at p<0.05, cluster-level corrected within the masking image; (ii) we developed multivariable logistic regression models with menopause symptom occurrence as the binary outcome variable, and ER density as the exposure of interest, with age as a covariate. This analysis used both SUVR from peak clusters identified in (i) and target ROI. Analyses were performed across the entire cohort and within peri-menopausal and post-menopausal groups. Odds ratios (OR) were estimated, where a positive value denotes a positive association between ER density and presence of each menopause symptom. OR that exceeded 10 were capped at 10 to ensure visibility of positive associations < 10 in the figure.