3.1 Baseline Characteristics
From 1 June 2016 to 31 December 2019, 97 patients were enrolled in our hospital as the training set. Two other hospitals enrolled 56 patients in the validation cohort. The detailed baseline characteristics included sex, age, ECOG score, Child–Pugh score, tumour number, maximum tumour diameter, total tumour diameter, tumour burden, type of PVTT, alpha-fetoprotein (AFP) level, and blood test results. There were no statistically significant differences in the baseline characteristics of patients in the training and validation sets (Table 1).
Table 1. Baseline characteristics of patients in training set and validation set
Variable
|
Level
|
Training set
|
Validation set
|
P-value
|
N=97
|
N=56
|
|
Sex
|
Male
|
85 (87.6)
|
49 (87.5)
|
1.000
|
Female
|
12 (12.4)
|
7 (12.5)
|
Age
|
|
57.21±10.99
|
58.96±9.76
|
0.323
|
ECOG
|
0
|
39 (40.2)
|
27 (48.2)
|
0.427
|
1
|
58 (59.8)
|
29 (51.8)
|
Child–Pugh Score
|
|
5.87±0.82
|
5.82±0.83
|
0.749
|
Number
|
1
|
45 (46.4)
|
22 (39.3)
|
0.317
|
2–4
|
40 (41.2)
|
22 (39.3)
|
≥5
|
12 (12.4)
|
12 (21.4)
|
Maximum tumour diameter
|
<60 mm
|
60 (61.9)
|
36 (64.3)
|
0.900
|
≥60 mm
|
37 (38.1)
|
20 (35.7)
|
Total tumour diameter
|
<100mm
|
69 (71.1)
|
45 (80.4)
|
0.285
|
≥100 mm
|
28 (28.9)
|
11 (19.6)
|
Tumour burden
|
|
7.71 (3.09)
|
8.29 (2.26)
|
0.226
|
Position
|
Right/Left
|
50 (51.5)
|
22 (39.3)
|
0.195
|
Biloba
|
47 (48.5)
|
34 (60.7)
|
PVTT
|
IIa
|
56 (57.7)
|
35 (62.5)
|
0.683
|
IIb
|
41 (42.3)
|
21 (37.5)
|
Preoperative AFP
|
< 400 ng/mL
|
58 (59.8)
|
35 (62.5)
|
0.874
|
≥400 ng/mL
|
39 (40.2)
|
21 (37.5)
|
PLT (×109/L)
|
|
146.97±39.86
|
145.79±34.97
|
0.854
|
PT (s)
|
|
13.12±1.56
|
13.19±1.36
|
0.779
|
TBL (umol/L)
|
|
23.69±15.96
|
20.33±13.27
|
0.186
|
ALB (g/L)
|
|
38.52±5.96
|
37.99±4.37
|
0.564
|
WBC (×109/L)
|
|
5.31±2.25
|
5.37±1.72
|
0.876
|
RBC (×109/L)
|
|
4.32±0.61
|
4.45±0.55
|
0.201
|
HGB (g/L)
|
|
132.34±18.79
|
130.12±14.38
|
0.447
|
Abbreviations: ECOG, Eastern Cooperative Oncology Group; PVTT, portal vein tumour thrombosis; AFP, alpha-fetoprotein; PLT, platelet; PT, prothrombin time; TBL, total bilirubin; ALB, albumin; WBC, white blood cell; RBC, red blood cell; HGB, haemoglobin.
3.2 Child–Pugh score changes and survival analysis
The overall survival of all patients was > 3 months. The Child–Pugh scores before and after treatment were 5.87 ± 0.83 and 5.61 ± 0.82, respectively, and the difference was statistically significant (t = 2.34, P = 0.02). As of 30 June, 2021, the mOS was 12.5 months (95% confidence interval [CI]: 11.78–13.22) in the training set patients (Fig. 1), and the survival rates at 6, 12, and 18 months were 93.8%, 61.9%, and 24.7%, respectively.
3.3 Preliminary screening of variables
As shown in Fig. 2, the LASSO regression (Fig. 2a) with cross-validation (Fig. 2b) and univariate Cox regression (Fig. 2c) were used as the preliminary screening variables. The results showed that the variables for univariate Cox regression screening were the Child–Pugh score, tumour burden, PVTT type, and platelet (PLT) level. LASSO regression with cross-validation showed that the screening variables were tumour burden, PVTT type, and PLT level.
3.4 Comparison of two different models
The multivariate Cox survival regression model and stepwise backward regression were used to determine the final model variables with the minimum AUC. The results showed that the variables from univariate Cox regression screening to be analysed by the multivariate Cox survival regression model were Child–Pugh score, tumour burden, PVTT type, and PLT level (Model 1). The variables selected by LASSO regression with cross-validation did not change after multivariate Cox survival regression model analysis (Model 2), including tumour burden, PVTT type, and PLT level.
As shown in Fig. 3a, the 12-month AUC of Model 2 (0.850, 95%CI: 0.781–0.933) was slightly larger than that of Model 1 (0.850, 95%CI: 0.770–0.931). However, no statistical difference was observed between Models 1 and 2 (delta.AUC = 0.70, P = 0.60). The time-dependent AUC curve showed that there was no significant difference between the two models for any period (Fig. 3b). Model 2 was selected to make the model more concise.
3.5 Development and validation of an individualized prediction model
A model that incorporated independent predictors (including tumour burden, PVTT type, and PLT level) was developed and presented as a nomogram (Fig. 4a). The calibration curve for 12-months probability showed a moderate level of consistency between the predicted and observed results. For internal validation (Fig. 4b), the C-index for the prediction nomogram was 0.770 (95% CI: 0.723–0.817) for the training set, which was confirmed to be 0.703 (95% CI: 0.628–0.778) via training set validation.
K-fold cross-validation (CV) was used for external validation, and in the K-fold CV procedure, the training and validation sets were randomly and evenly split into K parts. Model 1 was built based on K-1 parts of the training set. The prediction accuracy of this candidate model was then evaluated on K-1 parts of the validation set. By respectively using each of the K parts as the training set and repeating the model building and evaluation procedure, we choose the model with the smallest cross-validation score (the mean squared prediction error [MSPE]) as the “optimal” model. Our data were run 1000 times, where K was equal to 5. At last, the ultimate model was obtained with its parameter estimates as the average values across K candidate ‘optimal’ models. At the same time, we obtained an average of 1000 C-indices (0.766) (Fig. 5a).
3.6 Clinical usefulness
The DCA results for the nomogram are shown in Fig. 5b. The decision curve demonstrated that if the threshold probability of a patient or physician was 25%, using the developed nomogram to predict median survival was more beneficial than using the treat-all-patients or treat-none schemes.