3.1 Patient characteristics were stratified by tumor size
In this study, we included 6960 eligible cases (Fig. 1A). The median Ts and ELNs were 3.9 cm and 16, respectively. The median follow-up period of all patients was 27 months (range 1-179 months). Then, by selecting the highest χ2 value from the analysis of survival data with X-tile, the studied population was divided into three groups with small (≤ 2.5 cm), medium (2.6-5.2 cm), and large (≥ 5.3 cm) tumor size (Fig. 1B, Fig. S1A). Small tumor group was correlated with better OS than medium or large tumor group (the median OS was 126 months, 38 months, and 21 months for patients with small, medium, and large tumor group, respectively, P <.001) (Fig. S1B). In the graphical presentation, diffuse distribution of risk across the plot was discerned, suggesting a continuous indirect association of Ts with patient’s survival (Fig. 1B). Subsequently, the patient characteristics stratified by these three Ts categories are summarized in Table 1. Differences were detected for all variables except for sex (P = 0.203 for sex). Large tumors generally contained an undifferentiated type, showed deeper invasion and extended lymph node metastasis, and were tented to occupy the entire stomach.
3.2 Effects of tumor size on OS and GCSS
The Kaplan-Meier curves for OS and GCSS based on different Ts groups were presented (Fig. 1C, D). For tumors with size of ≤ 2.5 cm, 2.6-5.2 cm, or ≥ 5.3 cm, the five-year OS rates were 66.8%, 41.0%, 28.1% (P <.0001), respectively, and the corresponding five-year GCSS rates were 83.2%, 59.8%, 39.6% (P <.0001). Next, in the adjusted Cox models, continuous variables such as ELNs and Ts (OS for Ts: HR = 1.02, 95% CI = 1.01-1.04, GCSS for Ts: HR = 1.03, 95% CI = 1.02- 1.05) were found to be an independent prognostic factor for OS and GCSS, as were the following categorical variables: age, race, T stage, N stage, and location (Supplementary Table 1). When Ts was included as a categorical variable in the multivariate Cox regression analysis, the HRs for GCSS gradually raised with increasing Ts (≤ 2.5 cm vs. 2.6-5.2 cm, HR = 0.82, 95% CI = 0.71-0.94, P = 0.005, ≥ 5.3 cm vs. 2.6-5.2 cm, HR = 1.13, 95% CI = 1.03-1.24, P = 0.012) (Table 2). However, the HRs for OS did not show significant difference between the medium and large group (≥ 5.3 cm vs. 2.6-5.2 cm, HR = 1.02, 95% CI = 0.95-1.10, P = 0.564) (Table 2, Supplementary Table 2).
We also used RCS to visualize the association between Ts and prognosis. First, unadjusted Cox models with RCS showed a nonlinear positive correlation of Ts with the HRs for OS and GCSS (P for nonlinearity was < .0001 in OS, P for nonlinearity was < .0001 in GCSS) (Fig. S2A, B). After adjusting by covariates, the HRs for OS increased rapidly within the size of 3.9 cm, but then a U-shaped distribution was manifested and started to increase slowly (P for total was < .0001, P for nonlinearity was < .0001) (Fig. 1E). Meanwhile, the HRs for GCSS also raised quickly, reaching a plateau when Ts was approximately 4 cm (P for total was < .0001, P for nonlinearity was < .002) (Fig. 1F). However, when Ts exceeds the turning point at 3.9 cm, no statistical significance was achieved regarding the 95% CI of HR for both OS and GCSS (Fig. 1E, F). Therefore, despite the continuous nature of Ts, it should be treated as a categorical marker when assessing survival in GC, with statistically valid cut-points.
According to the results of multivariate analysis, we found the large group (≥ 5.3 cm) did not show a worse OS than the medium group as expected. Hence, we conducted the stratified analysis to determine the underlying effectors. Significant survival differences were demonstrated for all the subgroups (age, sex, race, grade, location, and N stage, P < 0.05 for OS) except for patients with stage T2 disease (P = 0.086 for OS) (Fig. 2A, B, Fig. S3A-E). Meanwhile, overlapping survival curves were also found for subgroups based on the T and N stage, reflecting the instability of this 3-way split of Ts. Regarding the performance of Ts on GCSS, survival differences were detected for all the subgroups (all P <.01) (Fig. 2C, Fig. S4A-F). However, in a manner similar to OS, the curves of GCSS also intersected under the subgroups of T2 stage, T4 stage, and N3 stage (Fig. 2C, Fig. S4A-F). Together, these results may suggest Ts is a more effective predictor of prognosis for patients with stage T1 and N0 disease.
3.3 The effect of tumor size on lymph node metastasis
Ts is recognized as an indicator of tumor growth, tumor invasion, and lymph node metastasis. However, the nonlinear association between Ts and LNM has not been assessed in GC. After adjusting by covariates, a nonlinear positive relationship was indicated, that the ORs for LNM steadily increased when Ts got larger (P for nonlinearity was < .0001) (Fig. 3A). As expected, extended lymph nodes dissection also showed an increased number of positive lymph nodes (P for nonlinearity was < .0001) (Fig. 3B). However, the risk of LNM began to flatten with ELNs higher than 15. It is reasonable because the number of positive lymph nodes is not truly determined by the number of harvested lymph nodes but by the biological behavior of the tumor.
3.4 Association of tumor size with prognosis based on examined lymph nodes
After adjusting by covariates, the RCS models showed a nonlinear negative correlation of ELNs with the HRs for OS and GCSS (P for total was < .0001, P for nonlinearity was < .0001 in OS, P for total was < .0001, P for nonlinearity was < .0001 in CSS) (Fig. S5A, B). The HRs rapidly declined but became sluggish when ELNs reached 15. However, a direct correlation of ELNs with a better prognosis could be concluded.
An insufficient number of harvested lymph nodes for diagnosis may result in the underestimation of nodal stages [20]. Considering the effect of Ts on LNM and prognosis, we then evaluated the value of Ts in prognostic prediction for patients with insufficient ELNs. Notably, as presented in Figures 3C and D, statistical significances of survival difference were obtained for all subgroups (group 1: ELNs < 16 with no LNM, group 2: ELNs < 16 with LNM, group 3: ELNs ≥ 16 with no LNM, group 4: ELNs ≥ 16 with LNM, all p-value < .0001). Specifically, this 3-way cut of Ts was more suitable for the evaluation of GCSS (Fig. 3D).