Objectives: Ovarian cancer ranks first among gynecological cancers in terms of the mortality rate. Accurately diagnosing ovarian benign tumors and malignant tumors is of immense important. The goal of this paper is to combine group LASSO/SCAD/MCP penalized logistic regression with machine learning procedure to further improve the prediction accuracy to ovarian benign tumors and malignant tumors prediction problem.
Methods: We combine group LASSO/SCAD/MCP penalty with logistic regression, and propose group LASSO/SCAD/MCP penalized logistic regression to predict the benign and malignant ovarian cancer. Firstly, we select 349 ovarian cancer patients data and divide them into two sets: one is the training set for learning, and the other is the testing set for checking, and then choose 46 explanatory variables and divide them into 11 different groups. Secondly, we apply the training set and group coordinate descent algorithm to obtain group LASSO/SCAD/MCP estimator, and apply the testing set to compute confusion matrix, accuracy, sensitivity and specificity. Finally, we compare the prediction performance for group LASSO/SCAD/MCP penalized logistic regression with that for artificial neural network (ANN) and support vector machine (SVM).
Results: Group LASSO/SCAD/MCP/ penalized logistic regression selects 6/4/1 groups. The prediction accuracy and AUC for group MCP/SCAD/LASSO penalized logistic regression/SVM/ANN is 93.33%/85.71%/82.26%/74.29%/72.38% and 0.892/0.852/0.823/0.639/0.789, respectively.
Conclusions: Group MCP/SCAD/LASSO penalized logistic regression performs than SVM and ANN in terms of prediction accuracy and AUC. In particular, group MCP penalized logistic regression predicts the best. Therefore, we suggest group MCP penalized logistic regression to predict ovarian tumors.