Participants characteristics
For the discovery cohort, the demographic and clinical characteristics of 130 ARDS cases, 33 healthy controls (HC), and 33 disease controls (DC) are presented in Table 1. At baseline, the mean age of the ARDS group was 72.5 (SD, 11.4) years, and 74.6% were male. Gender distribution and age were matched in DC and HC with ARDS (Table 1), while most of the lab tests were different between ARDS and DC groups. At the onset of ARDS, non-survivors showed decreased P/F ratio, PLT, and Alb levels, alongside increased SOFA scores, BUN, and AST, compared to the survivors. No significant differences were observed in other clinical indicators and comorbidities. We acquired serum proteome profiles of all participants (n =196) using a DIA strategy and global metabolome by the LC-MS method. An overview of proteomics and metabolomics workflow can be found in Fig. 1A. The independent prospective validation cohort incorporating 183 early ARDS patients with 85 deceased and 98 survived (Fig. 1A and Table S1). The differences in baseline characteristics between surviving and deceased ARDS patients were nearly consistent across the discovery and validation cohorts.
Altered serum proteomic profiling in the ARDS group and its biological features
After filtering the low abundant proteins, 2669 high-quality proteins were collected for the data analysis (Additional file 1). The median protein number for 196 samples was 2471 (Fig. 1B). The apolipoproteins, such as APOA1 and APOB were the most abundant proteins for each group (Fig. 1C). The clustering tree indicated the appropriate pre-processing method in this study (Fig. 1D).
Our preliminary analysis employed principal component analysis (PCA) to explore the clustering patterns among the groups (Fig. 2A and Fig. S1A-S1C), which successfully demonstrated a distinct separation between ARDS samples and those from HC or DC groups. A comprehensive differential expression analysis revealed that 1069, 319, and 511 proteins were uniquely altered in ARDS vs. HC, ARDS vs. DC, and DC vs. HC groups, respectively (Fig. 2B-2C and Fig. S1D, as detailed in Additional file 2). These findings demonstrated the progressive features of serum protein alterations correlating with the severity of the disease. A critical subset of 214 differentially abundant proteins emerged as consistently regulated across ARDS, with 16 proteins showing persistent downregulation and 198 showing upregulation in comparisons of both ARDS vs. DC and ARDS vs. HC (Fig. 2F, Additional file 3). This pattern highlights a distinctive protein signature characteristic of ARDS. Among the downregulated proteins, Fetuin B (FETUB) was an intriguing link between metabolism and bacteremia and can now be classified as biomarkers of SaB mortality[10]; Paraoxonase 1 (PON1), negatively associated with higher mortality of sepsis[18], underlines significance in ARDS pathology compared to DC and HC groups (Fig. 2G). Conversely, certain proteins, including Surfactant Protein D (SFTPD) and Signal Transducers and Activators of Transcription 3 (STAT3) et al., were found elevated (Additional file 3). Moreover, SFTPD, a circulating epithelial markers[19] and STAT3, an activator of macrophages and neutrophils[20], were implicated as potential key contributors to the pathogenesis of ARDS, offering novel insights into its molecular underpinnings.
Pathway analysis for distinguishing ARDS from non-ARDS controls at the protein level
Pathway analysis at the protein level identified nine distinct pathways that were specifically modulated in ARDS (Fig. 2D-2E and Fig. S2). Notably, the oxidative phosphorylation pathway emerged as significantly regulated in comparisons of ARDS with both DC (NES = 1.615, P = 0.001) and HC (NES = 1.582, P < 0.001), underscoring its pivotal role in the energetic metabolism associated with ARDS (comprehensive pathway listings are available in Additional file 4).
In addition to oxidative phosphorylation, key biological processes such as the spliceosome and proteasome pathways were upregulated in ARDS (Spliceosome: ARDS vs. DC, NES = 1.474, P = 0.007; ARDS vs. HC, NES = 1.500, P < 0.001. Proteasome: ARDS vs. DC, NES = 1.471, P = 0.011; ARDS vs. HC, NES = 1.534, P = 0.003). Signaling pathways, including mTOR, FoxO, VEGF, and sphingolipid signaling, also showed significant upregulation in ARDS relative to both DC and HC, indicating the complexity of molecular alterations in ARDS, including immune system, cell signaling and lipid mechanism. An exploration of unique pathways between ARDS and HC revealed an upregulation of processes related to amino acid degradation, autophagy, endocytosis, and the TNF signaling pathway, alongside restricted focal adhesions in ARDS (Fig. 2D and Fig. S2).
Fig. 2H illustrates the enriched proteins within these nine overlapping pathways, all of which exhibited increased serum levels in ARDS cases. Among the highlighted proteins, Succinate Dehydrogenase Complex (SDHB), involved in mitochondrial function, and various components of the vacuolar ATPase family (ATP6V1D, ATP5MG, ATP6V1H) were implicated in oxidative phosphorylation. Moreover, proteins such as Mitogen-Activated Protein Kinase Kinase 1 (MAP2K1), Phosphoinositide-3-Kinase Regulatory Subunit 1 (PIK3R1), and Non-Receptor Tyrosine Kinase (SRC), associated with the VEGF signaling pathway, were significantly upregulated in ARDS, illustrating a comprehensive network of molecular interactions contributing to the pathogenesis and progression of ARDS.
Metabolites LysoPCs were decreased in ARDS compared with non-ARDS controls
In our metabolomic investigation, we cataloged a comprehensive array of 3331 metabolites, encompassing amino acids, lipids, and other critical serum metabolites, as detailed in Additional file 5. The PCA plots, showcased in Fig. 3A-3C, revealed a pronounced metabolic differentiation of the ARDS in comparison to both DC and HC groups. This distinct separation revealed the role of metabolic dysregulation in the etiology of ARDS.
In our comparative metabolomics study, we identified 214, 113, and 206 differentially abundant metabolites (DAMs) in the ARDS vs. HC, ARDS vs. DC, and DC vs. HC, respectively (Additional file 6). Among these, lysophosphatidylcholine emerged as the most markedly altered metabolites in ARDS, exemplified by significant decreased in LysoPC (0:0/18:2(9Z,12Z)) (ARDS vs. HC: VIP = 22.5, P = 0.0003; ARDS vs. DC: VIP = 27.7, P = 0.017) and LysoPC (0:0/18:0) (ARDS vs. HC: VIP = 13.4, P < 0.0001; ARDS vs. DC: VIP = 9.5, P = 0.0003) (Additional file 6). Further exploration through the KEGG pathway analysis of these DAMs revealed the profound impact of ARDS on metabolic processes. This analysis notably highlighted the modulation of the sphingolipid signaling pathway and sphingolipid metabolism, which were distinctively observed in ARDS as compared to the non-ARDS groups (Fig. 3D).
Cross‑talk between proteomics and metabolomics implicated sphingolipid signaling pathway as a hub pathway in ARDS
To elucidate the pathogenetic mechanisms underlying ARDS, we undertook an integrated analysis of proteomics and metabolomics datasets. This approach enabled us to dissect the multifaceted changes characterizing dysregulated biological functions associated with ARDS. A significant finding from our study was the identification of the sphingolipid signaling pathway as a key regulatory axis, influenced at both the protein and metabolite levels, pinpointing it as a hub pathway in ARDS pathogenesis (Fig. 3D and Table S2). Further network analysis demonstrated the critical role of the sphingolipid signaling pathway, integrating it centrally within the ARDS-unique regulatory network (Fig. 3E and Fig. S3-S4). Specifically, critical regulatory proteins such as BCL2 Associated X (BAX), BH3 Interacting Domain Death Agonist (BID), PIK3R1, MAP2K1, and NF-κB Subunit (RELA) exhibited upregulation in ARDS (Fig. 3F). It can be seen that MAP2K1, the hub protein in network, interacts sphingolipid signaling pathway with several important pathways, including parathyroid hormone synthesis secretion and action, Apelin signaling pathway, choline metabolism, phospholipase D signaling pathway, Fc gamma R-mediated phagocytosis and apoptosis. BAX and BID mediate interactions linking the sphingolipid signaling pathway with necroptosis and apoptosis, and RELA associates with the pathway in apoptosis. PIK3R1 connects the sphingolipid signaling pathway to Fc gamma R-mediated phagocytosis, apoptosis, phospholipase D signaling pathway and choline metabolism (Fig. 3E).
Concurrently, metabolomic analysis revealed differential expression of key sphingolipids. S1P showed reduced levels, whereas sphingosine exhibited elevated levels in ARDS compared to both DC and HC groups (Fig. 3F). Importantly, S1P interacts sphingolipid signaling pathway with several signaling pathways, including Apelin, calcium and phospholipase D signaling pathways.
Dysregulation of immune response, energy metabolism, hematopoiesis, and signaling pathway in deceased patients at the ARDS onset
To further investigate the proteomic alterations associated with different prognoses in ARDS, we conducted a comparison analysis between deceased and survived ARDS. PCA elucidated that the first two principal components accounted for 14.2% and 5.8% of the variance between the deceased and survived ARDS (Fig. 4A). Differential expression analysis identified 40 proteins with significant variations (│fold change│≥ 1.5 and FDR < 0.05), suggesting distinctive protein profiles that may serve as early indicators of prognosis in ARDS (Table S3). The most significantly altered proteins between deceased and survived ARDS were Radixin (RDX) and Moesin (MSN) which are important in linking actin to the plasma membrane (Fig. 4B). Subsequent GSEA revealed several highly ranked molecular pathways markedly enriched across various biological themes such as energy metabolism, immune response, proteasome function, gap junction communication, calcium signaling, and hematopoietic cell lineage differentiation (Fig. 4C). Remarkably, pathways like the unfolded protein response (NES = 1.74, P = 0.04), proteasome (NES = 1.84, P = 0.02), glycolysis/gluconeogenesis (NES = 1.62, P = 0.1), and interferon-α response (NES = 1.63, P = 0.1) showed positive associations with the deceased ARDS group. Conversely, pathways associated with hematopoietic cell lineage (NES = -2.06, P = 0.01), gap junctions (NES = -1.85, P = 0.03), calcium signaling (NES = -1.78, P = 0.05), and CD4+ T cell activity (NES = -2.05, P = 0.04) were more prominent in patients who survived ARDS in the early phase.
Importantly, to identify proteins that may play crucial roles in deceased ARDS, we constructed the PPI network of a panel of 429 differentially abundant proteins (P < 0.05). A PPI network comprising 423 nodes and 11675 edges was obtained (Additional file 7). The network topology was further analyzed to identify hub proteins, with a particular focus on their degree of connectivity. To refine our selection of key protein candidates, we spotlighted the top 25 proteins exhibiting the highest connectivity degrees. Among these, Glyceraldehyde-3-Phosphate Dehydrogenase (GAPDH), Heat Shock Protein 90 Alpha Family Class A Member 1 (HSP90AA1), and Enolase 1 (ENO1) emerged as central hub proteins within the network (Fig. 4D-4E). This finding is in concordance with previous GSEA results, since GAPDH and ENO1 as critical enzymes in glycolysis; moreover, HSP90AA1 was the house-keeping protein that aids protein folding and has intrinsic ATPase activity.
Metabolites LysoPCs and S1P were negatively correlated with the prognosis of ARDS
We initially assessed the clustering of the metabolomics data using PCA (Fig. 4F). The PCA plot demonstrated, at the global metabolomic level, a slight separation of the ARDS-deceased and survived groups. Pathway analysis was performed at 45 DAMs with annotation by KEGG (Fig. 4G and Additional file 8). The DAMs were most enriched in signaling pathways, such as the sphingolipid signaling pathway, phospholipase D signaling pathway, calcium signaling pathway, and Apelin signaling pathway. Importantly, lysophosphatidylcholine LysoPC (O-18:0/0:0), LysoPC (15:0/0:0), and S1P were prominently featured within these pathways. Furthermore, the levels of LysoPC and S1P were significantly reduced in the ARDS-deceased group, highlighting their potential involvement in the fatal progression of ARDS (Fig. 4H). More importantly, a correlation analysis between these three metabolites and the severity of ARDS, as quantified by the SOFA score, revealed negative correlations (R = -0.47 for LysoPC (O-18:0/0:0); R = -0.50 for LysoPC (15:0/0:0); R = -0.36 for S1P) (Fig. 4I), suggesting the potential metabolome mechanism underlying the heightened mortality risk in ARDS non-survivors.
Biomarker panel for early prediction of ARDS prognosis
In order to early predict the prognosis of ARDS, we further construct prognosis model. LASSO (Fig. S5A-S5B) and Boruta methods were used to screen the candidate proteins in the discovery cohort, then combining with 40 DAPs, 36 candidate biomarkers with unique peptides ≥3 were proposed for further targeted proteomics analysis. Finally, 31 proteins with 174 peptides were targeted by PRM assay in the external validation cohort (Additional file 9), a total of 22 candidate proteins with peptides ≥2 were kept, in which 8 proteins maintaining consistent significance in both discovery and validation cohorts (Fig. 5A-5B). Among the eight proteins, six proteins were significantly upregulated in deceased patients in both discovery and validation cohorts, including Vascular Cell Adhesion Molecule 1 (VCAM1), Lactate Dehydrogenase B (LDHB), Moesin (MSN), Filaggrin 2 (FLG2), Lamin A/C (LMNA), and Lipopolysaccharide Binding Protein (LBP), while the other two proteins, Transgelin 2 (TAGLN2) and Mannose Binding Lectin 2 (MBL2) were consistently downregulated in the deceased ARDS group (Fig. 5B). We selected these markers as an eight-protein marker panel for early identification of deceased patients of ARDS.
In the discovery cohort, we first compared five state-of-the-art machine learning classifiers using the protein model, The Glm model was the final classifier due to its overall superior performance, evidenced by a 10-fold cross-validated ROC–AUC of 0.893 (95% CI 0.837-0.949) and a sensitivity of 0.920 (Fig. 5C). In comparison, the clinical risk model comprised of SOFA score, P/F ratio and PLT from LASSO feature selection (Fig. S5C-S5D) yielded in a ROC-AUC of 0.784 (95% CI 0.703-0.866) and a sensitivity of 0.885 (Fig. 5D). Combination of parameters in both models resulted in a ROC-AUC of 0.890 (95% CI 0.834-0.946) and a sensitivity of 0.931 (Fig. 5E). The protein model performed significantly better than the clinical risk model (Delong test, P < 0.001), whereas the combination of both models did not exhibit statistical superiority over the protein model alone (Delong test, P = 0.970). Subsequent validation of the Glm classifier using the protein model resulted in an ROC-AUC of 0.802 (95% CI 0.739-0.865) and a sensitivity of 0.835 (Fig. 5F) in the external cohort, whereas the clinical risk model resulted in an ROC-AUC of 0.738 (95% CI 0.655-0.811) and a sensitivity of 0.788 (Fig. 5G). The combination of both models led to an ROC-AUC of 0.844 (95% CI 0.786-0.902) (Fig. 5H). In the validation cohort, the protein model also outperformed the clinical risk model (Delong test, P = 0.008), a combination of both models was superior to the protein model alone (Delong test, P = 0.006).