DOI: https://doi.org/10.21203/rs.3.rs-762131/v1
Objective
To develop a model that uses hematological indexes and clinical characteristics to help estimate the probability of endometriosis in patients with ovarian cysts.
Methods
A retrospective study was conducted on 2242 patients who underwent surgery for benign ovarian cysts from January 2008 to November 2016. Variables included in the model were serum tumor markers, blood routine test, age, BMI, reproductive history, history of hysteroscopy, menstrual episodes. Logistic regression was used to construct a predictive model for endometriosis, Receiver Operating Characteristic curves and the areas under the curve was used to verify the model’s validities. Ten-fold cross-validation was primarily used as an internal validation to evaluate the prediction accuracies of the model. Normalized mean square errors (NMSE) was obtained to compare the reliability of different models.
Results
978 (43.6%) patients with endometriosis were included in the strudy. Univariate analysis showed that age, BMI, delivery, dysmenorrhea, menstrual cycle, duration of menstrual flow, history of hysteroscopy, CA-125 and CA-19-9 (P < 0.001) are associated with endometriosis. The area under the receiver operating characteristic curve for the model with CA-125 alone was 0.888, with a sensitivity of 81.6% and specificity of 83.5%. After adjustment for other multiple covariates, including age, mature delivery, irregular menstruation, dysmenorrhea, menstrual period, history of hysteroscopy, CEA, CA-19-9, monocyte count, platelet count, the model obtained had an AUC of 0.916, with a sensitivity of 0.849 and specificity of 0.864.
Conclusions
The diagnostic prediction model can be used as a framework for potential improvement in diagnosis of endometriosis in patient with ovarian cyst.
Endometriosis is one of the most common benign gynecological disorders occurring in 6 to 10% of the general female population[1]. It is defined as the development of endometrial tissue (gland and stroma) outside the uterus such as, ovaries, pelvic peritoneum, and rectovaginal space. Endometriosis is a recurring persistent disease that causes non-menstrual pelvic pain, dyspareunia infertility, dysmenorrhea, and menstrual irregularities [2]. It is reported as a disease of complex multifactorial etiology, among all the hypothesis, transplantation of endometrial tissues via retrograde menstruation is widely accepted [3]. The correlation between symptoms and lesions is quite incomprehensible because the symptoms are nonspecific and not diagnostic.[4].
The diagnosis of endometriosis is based on clinical manifestations and imaging techniques [5] but confirm diagnosis of the disease can only be obtain by invasive procedures like direct visualization of peritoneal and ovarian implants by laparoscopy or laparotomy followed by histological analysis [6]. To increase the accuracy of the diagnosis of endometriosis, especially to avoid the use of invasive management, some investigators have begun to characterize the factors contributing to the detection of endometriosis. There is evidence that family history, immunological, menstrual and reproductive factors and are associated with endometriosis[7], different combinations of these biomarkers, are studied to increase the diagnostic accuracy of this disease.[8–10] The development of multiple factors to improve the accuracy of diagnosis of endometriosis is necessary.
The novel model was synthesized by correlating patient’s hematological indexes and clinical characteristics in a multivariate regression model which could help us to recognize which ovarian cysts are more likely to be biopsied so as endometriosis can be diagnosed in early stage.
Retrospective data were collected from The Second Affiliated Hospital of Wenzhou Medical University. A total of 2242 premenopausal women who underwent either laparoscopic or laparotomic surgery in the Gynecology Department of our hospital from January 2008 to November 2016 were included in this study. Patients who underwent surgery for ovarian endometriosis, which was confirmed by surgical specimen histopathological examinations were eligible for our study. The exclusion criteria as follows: history of hormonal therapy for endometriosis, pregnant woman, abnormal hepatic and renal function tests, pelvic inflammatory disease, pathologically confirmed or clinically diagnosed with leiomyoma or adenomyosis, acute infection or history of chronic inflammatory disease, immune system diseases, or malignancy.
The data were obtained by reviewing the patients’ medical records. All patients underwent routine preoperative laboratory studies, including CA-125, CA-19-9, carcinoembryonic antigen (CEA) and a complete blood count test was performed prior to surgery. Patient-related factors assessed included endometriosis-related symptoms, age, body mass index (BMI), reproductive history (mature delivery, premature delivery, abortion), menstrual history (menstrual cycle, duration of menstrual flow), surgical features (history of laparotomy, laparoscopy or hysteroscopy) and histopathology diagnosis following surgery were retrieved for each patient.
Most patients in the study had preoperative ultrasonographic evaluations. The reasons why we do not bring in the ultrasonographic diagnosis of cysts in the study were that sonographic evaluations were performed by different sonographers, transducers and types of ultrasounds.
Patients characteristics were compared by using variance analysis (t-test, for continuous variables), chi-square test (for dichotomous variables) or Kruskal-Wallis test (for continuous variables in skewed distribution) [11]. Logistic regression was used to perform multivariate analysis, and forward method was used to select variables[12]. 10-fold cross validation was used to predict the accuracy of the internal validation model. We randomly divided the data set into 10 copies, with 9 of them as a training set, used to establish the forecast model, the remaining 1 data set as a validation set, as a validation set. Training set to build the model, with the validation set to predict, this process was continued for 10 times and ultimately obtained a complete set of predictions. The predicted value was used to establish a cross-validated ROC curve. At the same time, 10 models were established, and 10 normalized mean square errors (NMSE) were obtained for the verification set and the average NMSE of the model was obtained to compare the reliability of the model. The smaller the NMSE, the more reliable the model is. Delong method was used to compare the significant difference between ROC curves, in which P < 0.05 indicated the difference was statistically significant. The Hosmer-Lemeshow goodness-of-fit test was used to test the predictive ability (calibration) of the model[13]. The model was evaluated with different indicators, including sensitivity, specificity, area under curve (AUC), and Youden Index, where the signature = sensitivity − (1-specificity). SPSS 13.0 for statistical analysis, R 3.2.3 software pROC package to do the Receiver Operating Curve (ROC) and ROC curve comparison. Bilateral test P < 0.05 showed statistically significant.
A total of 2242 women were enrolled in the study and the age ranges from 18 to 46 years, with a mean age of 31.37 years. 978 patients had endometriosis while the rest were 860 patients with mature teratoma,145 patients with serous cystadenomas, 183 patients with mucinous cystadenomas, and 76 patients with other benign conditions. The characteristics included baseline demographic, clinical, surgical, and laboratory features of patients with and without endometriosis (Table 1).
Variables | Non-endometriosis (n = 1264) | Endometriosis (n = 978) | Statistic | P-value |
---|---|---|---|---|
Age (years) | 30.94 ± 6.816 | 31.92 ± 5.381 | t=-3.709 | < 0.001 |
BMI(kg/m2) | 21.48 ± 2.982 | 21.01 ± 2.902 | t = 3.701 | < 0.001 |
Mature delivery(n) | 1 (0–5) | 1 (0–5) | Z=-4.459 | < 0.001 |
Premature delivery(n) | 0 (0–2) | 0 (0–1) | Z=-1.306 | 0.192 |
Abortion (n) | 1 (0–12) | 1 (0–10) | Z=-0.222 | 0.824 |
Menstrual cycle(days) | 30.77 ± 5.553 | 29.54 ± 3.234 | t = 6.156 | < 0.001 |
Duration of menstrual flow (days) | 5.39 ± 1.395 | 5.81 ± 1.534 | t=-6.860 | < 0.001 |
Irregular menstruation | 128 (10.1%) | 35 (3.6%) | χ2 = 35.056 | < 0.001 |
Dysmenorrhea | 254 (20.1%) | 358 (36.6%) | χ2 = 75.735 | < 0.001 |
Hyperlipidemia | 87 (6.9%) | 55 (5.6%) | χ2 = 1.474 | 0.256 |
History of caesarean | 204 (16.1%) | 181 (18.5%) | χ2 = 2.174 | 0.143 |
History of laparotomy | 375 (29.7%) | 253 (25.9%) | χ2 = 3.946 | 0.052 |
History of laparoscopy | 35 (2.8%) | 35 (3.6%) | χ2 = 1.195 | 0.274 |
History of hysteroscopy | 4 (0.3%) | 25 (2.6%) | χ2 = 21.665 | < 0.001 |
CEA(U/mL) | 1.28 (0-95.32) | 1.12 (0-4.28) | Z=-7.096 | < 0.001 |
CA-125(U/mL) | 15.54 (3.10-469.10) | 45.35 (4.80-660.40) | Z=-31.622 | < 0.001 |
CA-19-9(U/mL) | 19.13 (0-1147.23) | 31.84 (0-1128.52) | Z=-6.854 | < 0.001 |
Neutrophil count(X103/mm3) | 3.96 (1.16–19.96) | 3.86 (1.25–14.40) | Z=-0.246 | 0.806 |
Lymphocyte count(X103/mm3) | 1.83 (0.31–4.45) | 1.76 (0.25–4.69) | Z=-3.253 | 0.001 |
NLR | 2.17 (0.56–31.26) | 2.22 (0.65–21.98) | Z=-1.762 | 0.078 |
monocyte count (X103/mm3) | 0.42 (0.09–1.46) | 0.42 (0.11–1.50) | Z=-1.758 | 0.079 |
Platelet count (X103/mm3) | 211.00 (72.00-516.00) | 213.00 (61.00-418.00) | Z=-0.785 | 0.443 |
MPV | 10.70 (6.90–14.50) | 10.70 (6.40–14.40) | Z=-0.812 | 0.417 |
Hemoglobin, g/dL | 120.00 (62.00-156.00) | 117.00 (65.00-149.00) | Z=-5.849 | < 0.001 |
MCV | 88.45 (56.80-103.40) | 88.30 (60.20–99.20) | Z=-1.146 | 0.252 |
MCH | 29.80 (18.00-35.50) | 29.60 (18.20–34.40) | Z=-3.337 | 0.001 |
MCHC | 335.00 (280.00-368.00) | 333.00 (288.00-363.00) | Z=-4.097 | < 0.001 |
Abbreviations: BMI, body mass index; MPV, mean platelet volume; NLR, neutrophil-to-lymphocyte ratio; MCH, mean corpuscular hemoglobin; MCHC, mean corpuscular hemoglobin concentration; P < 0.05 is considered as statistically significant. |
After analysis of endometriosis with clinicopathological features, we found that there was no statistical significance with premature delivery (P = 0.192), abortion (P = 0.824), hyperlipidemia (P = 0.256), history of caesarean (P = 0.143), history of laparotomy (P = 0.143), history of laparoscopy (P = 0.143), Neutrophil count (P = 0.806), NLR (P = 0.078), monocyte count (P = 0.079), Platelet count (P = 0.443), MPV (P = 0.417) and MCV (P = 0.252). Further analysis showed that age, BMI, mature delivery, menstrual cycles, duration of menstrual flow, irregular menstruation, dysmenorrhea, history of hysteroscopy, CEA, CA 125, CA-19-9, hemoglobin, MCHC (P < 0.001), lymphocyte count (P = 0.001) and MCH (P = 0.001) were statistically associated with endometriosis (Table 1).
We selected 16 variables to establish the multivariate logistic regression analysis. As a result, 11 variables were retained in the final logistic regression model (Table 2), it revealed that Log(CA-125), dysmenorrhea, history of hysteroscopy, age and mature delivery remained as significant variables associated with endometriosis. The Hosmer–Lemeshow test results revealed an adequate goodness-of-fit for the regression model (P > 0.05). The association between characteristics and diagnosis of endometriosis was explored and shown in (Table 3), all the models (model 1 ~ model 5) were significantly associated with endometriosis. A ROC curve for the model with CA-125 alone was constructed (Fig. 1), the area under the curve was 0.888 (P < 0.001) for the model, with a sensitivity of 0.816% and specificity of 0.835%, These results indicate a moderate predictive performance of the model,after adjustment for other multiple covariates, the result presented a little rise of accuracy in diagnosis of endometriosis. The modle5 of included other values (combined clinical characteristics and haematological indexes) in the CA-125 based probabilistic model showed an AUC of 0.916 (P ༜001), a sensitivity of 0.849 and specificity of 0.864 (Fig. 2), The Hosmer– Lemeshow test (p = 0.060) in the combined model5 indicate a good fitness of the model characteristic (dysmenorrhea, Irregular menstruation,), laboratory characteristics (Platelet count, monocyte count, CA-19-9, CA-125, CEA).
Variables | β | S.E. | Wals | OR | 95% CI | P- Value |
---|---|---|---|---|---|---|
Age | 0.126 | 0.013 | 87.396 | 1.134 | 1.105–1.165 | < 0.001 |
mature delivery | -0.647 | 0.110 | 34.303 | 0.524 | 0.422–0.650 | < 0.001 |
Irregular menstruation | 0.955 | 0.276 | 11.941 | 2.600 | 1.512–4.469 | < 0.001 |
Dysmenorrhea | 0.569 | 0.142 | 16.051 | 1.767 | 1.337–2.334 | < 0.001 |
Menstrual period | 0.259 | 0.047 | 30.897 | 1.296 | 1.183–1.420 | < 0.001 |
History of hysteroscopy | 2.017 | 0.696 | 8.395 | 7.518 | 1.921–29.430 | < 0.001 |
CEA | -0.878 | 0.104 | 71.874 | 0.415 | 0.339–0.509 | < 0.001 |
CA-19-9 | -0.003 | 0.001 | 12.340 | 0.997 | 0.996–0.999 | < 0.001 |
monocyte count | -1.386 | 0.383 | 13.116 | 0.250 | 0.118–0.530 | < 0.001 |
Platelet count | -0.003 | 0.001 | 5.351 | 0.997 | 0.995–1.000 | < 0.001 |
Log(CA-125) | 7.059 | 0.307 | 528.6 | 1162.7 | 637–2122.4 | |
Constant | -13.619 | 0.803 | 287.9 | 0.000 |
Models | H-L test | Z | P-value | AUC | Sensitivity | Specificity | Youden index | NMSE |
---|---|---|---|---|---|---|---|---|
Model 1a | < 0.001 | 7.167 | < 0.001 | 0.888 | 0.816 | 0.835 | 0.651 | 0.556 |
Model 2b | < 0.001 | 6.666 | < 0.001 | 0.891 | 0.843 | 0.816 | 0.659 | 0.545 |
Model 3c | < 0.001 | 6.320 | < 0.001 | 0.899 | 0.836 | 0.833 | 0.669 | 0.511 |
Model 4d | < 0.001 | 4.191 | < 0.001 | 0.911 | 0.885 | 0.813 | 0.698 | 0.477 |
Model 5e | 0.060 | Reference | Reference | 0.919 | 0.849 | 0.864 | 0.713 | 0.441 |
a Only CA125 in the model, b variables in model 1 plus age, c variables in model 2 plus reproductive history, d variables in model 3 plus mature delivery, history of hysteroscopy, clinical characteristic, e variables in model 4, except CA125 was replaced by log(CA125) |
Endometriosis can only be diagnosed by invasive procedures such as laparoscopic or laparotomy exploration. We constructed a non-invasive predictive model based on medical history and hematological indexes (blood routine, serum tumor markers examination) that can diagnose endometriosis in ovarian cyst patients. We found association between the CA-125, CA-19-9, age, partus matures, menstrual episodes, history of hysteroscopy, dysmenorrhea and blood routine test with endometriosis, but no single characteristic predicted endometriosis with a high accuracy. Our study supported the retrograde menstruation theory because the history of hysteroscopy is shown to be associated with an increase in risk of developing endometriosis.
Our study confirmed the belief that an increased frequency of and duration of menstruations is associated with endometriosis [14, 15]. Dysmenorrhea was the main symptom of endometriosis infertile women (46.92%) with endometriosis and the mechanism of dysmenorrhea in endometriosis lie in increased production of prostaglandins (PGs)[16]. Moreover, BMI showed a negative correlation with the presence of endometriosis, as was reported previously[17]. Obesity is often associated with long menstrual cycles, a factor that reduce the risk of endometriosis. It is considered that the reduction of the frequency of menstrual episodes counterbalances the relative hyperestrogenism of women[18].
Endometriosis is rare before the menarche and tends to decrease after the menopause. Studies conducted in women under age of 45 years suggested that the frequency of endometriosis increases with age until menopause[19]. While Fuldeore, M. J, et al [2] reported that the average age of women with endometriosis in their study was 37.8 years compared to 33.8 years women without endometriosis (p < 0.0001), it is possible that incidence of endometriosis increases as women age increases which can be because of the hormonal changes that occur during peri-menopause[20].
Screening for the diagnosis of patients with clinical suspicion of endometriosis is based on serum CA-125 which have been confirmed in many studies. Shen, A et al [21] reported that endometriosis is significantly associated with elevated serum CA-125 concentrations, confirmed CA-125 as an auxiliary biological marker in endometriosis diagnosis. Some studies[22, 23] did not agree with this finding and showed that the diagnosis of endometriosis on CA-125 alone is not accurate, mainly in relation to their sensitivity, Hirsch, M, et al[23] reported that CA-125 with a cut-off of ≥ 30 u/ml has a sensitivity of 0.57, which did not meet the criteria for a triage test, and international guidelines do not recommend CA-125 testing in women with suspected endometriosis[24]. However, in the study we find model 1 which consisted of CA-125 alone predicted endometriosis with high sensitivity (81.6%) and predicted the absence of endometriosis with a specificity of 83.5%. Nevertheless the timing of blood collection for CA-125 is uncontrolled because it’s a retrospective design, the relationship with the menstrual cycle is known to affect this test[25].
The study shows an inverse association between the number of mature delivery and endometriosis, but no association between the number of abortions and endometriosis has been found. This has also been observed in many studies of endometriosis[14, 26]. Parazzini, F, et al [26]reported that the risk of endometriosis decreased with increasing number of births, compared with nulliparous women, the OR of endometriosis at stage 1 was 0.1 (95% CI 0.1, 0.2) in women reporting two or more births was respectively 0.1 (95% CI 0.1, 0.3), 0.2 (95% CI 0.1, 0.4).
It has been reported that [27] reproductive history may influence hormonal milieu, Estradiol levels is higher among nulliparous women than among parous women, whereas androgen levels have an opposite effect, and reproductive history may influence the volume of endometrial cells released into the peritoneal cavity. The other studies revealed that CA-19-9 can be used to discriminate between patients with or without endometriosis, and it is correlation to severity of the disease, their results showed that CA-19-9 was significantly associated with advanced stage (stage III and IV) endometriosis[10]. Our results is in concordance with a former study that the mean levels of CA-19-9 are significantly elevated compared with the control group.
Endometriosis is associated with increased inflammatory activity which is an important stimulant for platelets[28], suggest platelet indices is an important and effortless hematological parameter that can be useful in evaluation of endometriosis[29, 30]. Evsen, M. S et al reported that platelets count in patients with peritoneal endometriosis were found to be higher from the control group (p = 0.038)[30], particularly more apparent in advanced stage peritoneal endometriosis. Monocytes also was also implicated as prognostic factor of inflammatory response but there is no evidence supporting that monocyte count is associated with endometriosis, our study shows that it is a protective factor for endometriosis, further study should be done to confirm this finding. This model provides guidance about confirmation of endometriosis. CA-125 can be useful in directing the diagnosis of the disease, and clinical history, tumor marker and routine blood tests increase the diagnosis of endometriosis more accurately. For instance, a peri-menopause woman with multiple reproductive history and irregular menstruation, has a higher chance of containing endometriosis, and if CA-125 is quite high, ovarian cysts would be appropriate to confirm the presence of the disease.
In this study, we found that CA-125, clinical history, tumor marker and routine blood term testing are predictors of endometriosis. Our model can contribute in diagnosis as a predictor for endometriosis in patients with ovarian cysts.
Abbreviation | Full name |
---|---|
CEA | carcinoembryonic antigen |
BMI | body mass index |
NMSE | normalized mean square errors |
AUC | area under curve |
ROC | receiver operating curve |
Ethics approval and consent to participate
Ethics approval was obtained from Second Affiliated Hospital of Wenzhou Medical University Ethics Committee, and informed consent was obtained from individual participants prior to surgery.
Consent for publication
Not applicable
Availability of data and material
The datasets used or analysed during the current study are available from the corresponding author on reasonable request.
Competing interests
We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Funding
No external funding was used in this conduct of this study.
Authors' contributions
Junmiao Xiang contributed to the conception of the study.
Ping Duan contributed significantly to analysis and manuscript preparation.
Wei Shen performed the data analyses.
Zongwen Liang and Qiong Zhang helped perform the analysis with constructive discussions.
Acknowledgements
I would like to express my gratitude to all those who helped me during the writing of this paper. My deepest gratitude goes first and foremost to Professor Ping Duan, my supervisor, for her constant encouragement and guidance. And the author gratefully acknowledge the support of the Second Affiliated Hospital of Wenzhou Medical.