Raman spectral analysis unveils metabolic differences in follicular fluid of PCOS
In this study, Raman spectra of 150 PCOS and 150 non-PCOS patients were obtained by analyzing follicular fluid samples. The fingerprint region from 600 cm− 1 to 1800 cm− 1 was calculated for analysis, which typically contained the most essential biological information [14]. In accordance with the previous study [20], the raw Raman spectra data of the FF samples from PCOS patients showed no significantly different standard deviation/ variation by compared to those of non-PCOS patients (Fig. 2A). Then, Raman spectra from all samples were averaged and subtracted from each sample at each wavenumber, and the mean values were assembled for each group to obtain mean-centered spectra. The results show that the mean-centered spectra are significantly different between the two groups (Fig. 2B). The corresponding Raman bands and their tentative assignments are summarized in Table 2. Some signatured bands can be assigned to phenylalanine (1,003 cm− 1), C-C, C-N stretching (protein) (1,156 cm− 1), β -carotene accumulation (C-C stretch mode) (1,516 cm− 1), and C = O (1668 cm− 1) [24].
Table 2 Peak assignments in the Raman spectra from follicular fluid of PCOS patients.
Peak (cm− 1)
|
Assignation
|
758
|
Tryptophan
Ethanolamine group
Phosphatidylethanolamine
|
852
|
Proline, hydroxyproline, tyrosine
Tyrosine ring breathing
Glycogen
|
936
|
C-C stretching mode of proline & valine & protein
backbone (α-helix conformation)/glycogen (protein assignment)
P(CH3) terminal, proline, valine + ν(CC) α -helix keratin (protein assignment)
|
1003
|
Phenylalanine, C-C skeletal
|
1156
|
C-C, C-N stretching (protein)
|
1516
|
β-carotene accumulation (C-C stretch mode)
|
1518
|
v (C = C), porphyrin
Carotenoid peaks due to C-C & conjugated C = C band stretch
|
1590
|
Carbon particles
|
1627
|
Cα=Cα stretch
Amide C = O stretching absorption for the β-form polypeptide films
|
1668
|
Carbonyl stretch (C = O)
Cholesterol ester
|
Unsupervised PCA was then applied to all of the spectra to reduce the high-dimensional Raman dataset and transform data into appropriate variables that confered biologic information. A spectral range of 600-1,800 cm− 1 was used to minimize the effects of uneven baseline. As shown in Fig. 2C, PCOS and non-PCOS have two clusters with some overlap. The overlap can be explained by the close similarity of follicular fluid between PCOS and non-PCOS patients. Herein, we sought to find biomarkers which can best separate the two groups and provide diagnostic advice for PCOS. Because the difference between groups was mostly significant along dimension 1, a loading plot was made to assign each Raman wavenumber to indicate significant Raman bands along dimension 1. Two Raman zones were identified for describing the largest variances between the two datasets: 993-1,165 cm− 1 and 1,439-1,678 cm− 1 (Fig. 2D). The quantification of metabolite concentration between PCOS and non-PCOS groups were presented at box plots by integrating individual Raman bands at the two Raman zones (993-1,165 cm− 1 and 1,439-1,678cm− 1). The results showed that the quantification of integration at 993-1,165 cm− 1 was found to be higher in PCOS FF samples (0.0286 ± 0.001) compared with non-PCOS FF samples (0.0274 ± 0.001; P < 0.001). And, the quantification of the integration at 1,439-1,678cm− 1, on the other hand, showed a higher content in non-PCOS FF samples (0.0337 ± 0.001) than in PCOS FF samples (0.0319 ± 0.001; P < 0.001) (Fig. 3). All the results suggest that there are real different metabolic patterns in follicular fluid of PCOS patients.
Special Raman spectral as biomarkers for predicting oocyte development and IVF outcome in PCOS patients
As the follicular fluid is the direct environment of oocyte, the changes of metabolism in FF will affect oocyte development. In order to investigate whether the Raman spectra were predictive to the oocyte development, the special Raman spectra were compared between two subgroups of the transferrable PCOS blastocysts (n = 150), according to their morphological scores: (1) high-quality blastocysts group (HQ, ≥ 4BB) (n = 75), and (2) low-quality blastocysts group (LQ, ≥3BC and ≤ 4BC) (n = 75). The results showed that there were significant differences in two Raman positions (993-1,165 cm− 1 and 1,439-1,678cm− 1) between HQ and LQ groups (P < 0.001) (Fig. 4). In details, the quantification of the integration at 993-1,165 cm− 1 was lower in HQ group (0.0316 ± 0.001) compared with LQ group (0.0328 ± 0.001; P = 1.07 ×10− 5). On the other hand, the quantification of the integration at 1,439-1,678cm− 1 showed a higher content in HQ group (0.0292 ± 0.001) than LQ group (0.0279 ± 0.001; P = 1.92 ×10− 6).
Embryo quality is a key factor affecting pregnancy outcome. In this study, we further analyzed the relationship of the special Raman spectra and IVF outcome in PCOS. By tracking the pregnancy outcome of each transferred blastocyst, the 150 Raman spectra of PCOS samples were divided into pregnancy success group (n = 85) and pregnancy failure group (n = 65). As shown in Fig. 5, the quantification of Raman spectra (993-1,165 cm− 1 and 1,439-1,678cm− 1) are significantly different between pregnancy success and pregnancy failure groups (P < 0.001). In line with the trend of the same two Raman zones correlating with blastocyst development, the quantification of the integration at 993-1,165 cm− 1 was lower in pregnancy success group (0.0316 ± 0.001) compared with pregnancy failure group (0.0335 ± 0.002; P = 1.36 × 10− 5), and the quantification of the integration at 1,439-1,678cm− 1 was higher in pregnancy success group (0.0289 ± 0.001) than that in pregnancy failure group (0.0278 ± 0.001; P = 1.98 × 10− 6).
Machine-learning models based on Raman spectra classify blastocysts development and pregnancy outcome with high performance in PCOS
In this study, based on the 150 Raman spectra of PCOS follicular fluid samples, the two fully connected ANN classification models were computed to predict the blastocyst development and clinical pregnancy, respectively. The spectra of different subgroups (HQ or LQ blastocyst; pregnancy success or pregnancy failure) were split into training set and the testing set in a ratio of 4:1. The training set was used to train a classification model and the testing set was used to evaluate the model performance. Specifically, for the classification model to predict blastocyst development, Raman spectra of 150 PCOS samples were divided into high-quality blastocysts (HQ) (n = 75) and low-quality blastocysts (LQ) (n = 75) groups. Of them, Raman spectra of 100 samples (50 high-quality blastocysts and 50 low-quality blastocysts) were used to train a classification model and the remaining 50 spectra (25 high-quality blastocysts and 25 low-quality blastocysts) were input as the testing set to evaluate the model performance. At the same time, to construct the classification model to predict the IVF outcome, the same Raman spectra of 150 PCOS samples were divided into pregnancy success (n = 85) and pregnancy failure (n = 65) groups. Of them, 100 spectra (50 pregnancy success and 50 pregnancy failure) were used to train a classification model and the remaining 50 spectra (35 pregnancy success and 15 pregnancy failure) were input as the testing set to evaluate the model performance.
The results of ANN models were presented in Table 3. As shown in Table 3A, 23 out of 25 high-quality blastocysts (F1 score 0.9020) and 22 out of 25 low-quality blastocysts (F1 score 0.8980) were assigned correctly. For classifying clinical pregnancy outcome (Table 3B), the ANN model was able to correctly assign 25 out of 35 pregnancy success spectra and 12 out 15 pregnancy failure spectra, with F1 scores of 0.7937 and 0.6486, respectively. Noticeably, the accuracy of the ANN model for predicting the blastocysts development is 90.00%, while 74.00% for predicting the IVF outcome. As shown in Fig. 6, ROC analysis resulted in AUCs of 0.88 for the blastocysts developmental ANN model (Fig. 6A) and 0.75 for the ANN model predicting the IVF outcome (Fig. 6B).
Table 3 Confusion matrix and performance evaluation of ANN classification models for an independent testing set of 50 Raman spectra.
A
|
Confusion matrix
|
Performance evaluation
|
Sample count
|
Predicted
HQ- blastocysts
|
Predicted
LQ- blastocysts
|
Precision
|
Sensitivity
|
F1 score
|
Accuracy
|
Actual HQ- blastocysts
|
23
|
2
|
88.46%
|
92.00%
|
0.9020
|
90.00%
|
Actual LQ- blastocysts
|
3
|
22
|
91.67%
|
88.00%
|
0.8980
|
B
|
Confusion matrix
|
Performance evaluation
|
Sample count
|
Predicted pregnancy success
|
Predicted pregnancy failure
|
Precision
|
Sensitivity
|
F1 score
|
Accuracy
|
Actual pregnancy success
|
25
|
10
|
89.29%
|
71.43%
|
0.7937
|
74.00%
|
Actual pregnancy failure
|
3
|
12
|
54.55%
|
80.00%
|
0.6486
|
Note: Models were trained from a training set of 100 Raman spectra.
ANN is artificial neural network.