PBMC Composition Changes with Patient Disease Severity and Outcome.
First, we analyzed the relative populations (fractions) of immune cell subsets in peripheral blood mononuclear cells (PBMCs) in relation to disease severity, outcome, and disease types. The population proportions of platelets, precursor cells, and erythroblasts in PBMCs increased with the disease and outcome severity (Fig. 1B, Supplementary Fig. 1A, B), while the T cell fraction decreased (Fig. 1B). B cell fractions were significantly increased in SLE patients, which is a known phenotype in SLE patients (Fig. 1C); monocyte and neutrophil fractions were significantly increased in convalescence patients (Fig. 1E, F); however, fractions of the dendritic cells decreased as outcome severity increased (Fig. 1G). We evaluated ratios of different cell types as a criterion of separating FT and S patients and found that the ratio of platelets to T cells, (Pla-T ratio) had the highest area under the curve (AUC) at 0.754, with the 0.063 ratio between FT and S patients (Fig. 1H).
XGBoost and Deep neural network modeling identifies biomarkers of survival and fatal platelets.
In our study, we employed two machine learning approaches—a Deep Neural Network (DNN) and an eXtreme Gradient Boosting (XGB) algorithm—to analyze and interpret with the objective of identifying key biomarkers. These biomarkers are indicative of patient survival and the presence of fatal platelets, which are critical in the prognosis and treatment of life-threatening conditions. When comparing the two models, the XGB model has a slightly higher accuracy, while the DNN model shows marginally better performance in F1 score, precision, and recall. This could suggest that while XGB is slightly better at correctly identifying both positive and negative classes, DNN is better at identifying positive cases when the outcome is positive and also better at not missing positive cases (Fig. 2A). We only obtained only the features that are within the top 5% of importance or gain from their respective models. We also looked into genes have the absolute log2fc > 1. The overlap of 21 biomarkers suggests that there is a consensus between the three models on these biomarkers' significance in relation to the outcome of interest (e.g., survival or fatal platelets) (Fig. 2B). In order to discern the specific features that could potentially serve as biomarkers indicative of either survival or fatal outcomes, we meticulously labeled each gene within a volcano plot, as presented in Fig. 2C of our study. The biomarkers identified in survival platelets include AIF1, FOS, CD74, JUN, JUNB, HLA-DRA, MNDA, RPL39, RPS21, RPS18, EEF1A1, RPS28, RPL34, S100A8, S100A11, S100A12. The biomarkers identified in fatal platelets include HBA2, HBB, HPSE SLC25A37 TMCC2. We conducted a gene ontology enrichment analysis on the identified genes. For genes associated with survival, the enrichment analysis revealed a strong connection to processes involved in cytoplasmic translation and the active immune response, as detailed in Fig. 2D. In contrast, genes related to fatal outcomes were predominantly associated with pathways involved in coagulation and reactive oxygen species (ROS) metabolic pathways, as depicted in Fig. 2E.
Platelets Amplify Endotheliopathy and Disseminated Intravascular Coagulation in Fatal Patients
Platelets have been shown to play a role in the development and progression of endotheliopathy and disseminated intravascular coagulation (DIC) [52, 53]. Platelets can bind to and activate endothelial cells, releasing pro-inflammatory and pro-coagulant molecules that can contribute to the development of DIC. Additionally, platelets can also contribute to thrombus formation and further damage to the endothelium [54].
We looked into the expression of the Integrin Subunit Alpha 2b (ITGA2B) gene, which is involved in platelet-endothelial cell interactions by binding to the Integrin Subunit Alpha V (ITGAV) [55]. The patients who ultimately passed away had the highest levels of ITGA2B expression, followed by the patients who were in severe disease state (Fig. 3A). On the pathway level, the reorganization of actin cytoskeleton (GO:0007010), also followed the same severity trend (Fig. 3B). DIC is a complex condition characterized by abnormal clotting and bleeding due to the activation of the coagulation cascade and the depletion of clotting factors and platelets [56]. Several other GO terms associated with DIC, such as blood coagulation (GO:0007596), inflammatory response (GO:0006954), extracellular matrix disassembly (GO:0022617), and platelet activation (GO:0030168) expression had highest expression in FT/SV patients. All modules had lower expression in CV and SLE patients (Fig. 3C). Fig. 3. Differential expression of platelets affecting endotheliopathy across disease severity states
Platelet Subpopulations Associated with Disease Severity
The standard analysis of scRNA-seq focuses on identifying clusters that overlap with the standard cell types. Here we carried the clustering further to understand the details of platelet population changes in diseases. By trial and error, we identified similarity thresholds that resulted in stable clusters with the best separation and the minimal signature pathway overlap, resulting in thirteen clusters, designated C0 through C12 (Fig. 4A, B). We computed the composition of each cluster in terms of the contributions from different outcome groups. The C11 cluster was strongly associated with the FT group, with 78% of all C11 cells found in this group. Clusters C3, C5, and C9 had the highest proportion of HC (17%), while clusters C6 and C11 had the lowest (1%). Cluster C3 had the highest proportion of mild group at 25%. In clusters C6, C7, C10, and C12, the severe group comprised over fifty percent of total platelets (Supplementary Table 1). Over 70% of platelets in clusters C6, C8, and C10 come from the samples from the survivors (Supplementary Table 2). Based on these results, we designate clusters C4, C9, and C11 as “fatal”, C8 as “convalescent”, and C6 and C10 as “survival” (Fig. 4C, D).
Characteristics of Platelet Subpopulations
We used DEG and Gene Set Enrichment Analysis (GSEA) to identify molecular factors and pathways that could help us differentiate between the clusters (subpopulations) platelet identified in the previous step. The sets of genes enriched in the fatal platelet clusters C4 and C11 were significantly different from that in other platelet clusters (Fig. 5A) and in some cases show opposite trends from that in the convalescent and survival clusters. The fatal cluster C4, whose signature module comprises HPSE, WFDC1, and PF4 genes, and the convalescence cluster C8, whose signature module consists of RPLP0, RPS6, and RPS23, have several pathways that trend in the opposite direction. Fatal cluster C11, whose signature module includes TPT1, POLR2L, and CSRP1, had the highest energy consumption pathway scores, including oxidative phosphorylation (OXPHOS) and glycolysis (Supplementary Fig. 2A, B), but the lowest inflammatory response score (Supplementary Fig. 2C). The other fatal cluster C9, whose signature modules include HBB, HBA2, and HBA1, showed the weakest interferon response, including alpha and gamma interferons (Supplementary Fig. 2D, E). Coagulation, epithelial-mesenchymal transition (EMT), and the apical junction are all at their highest in C4 but at their lowest in C8 (Fig. 5B-D). At the same time, MYC targets v1 and v2, which contain nuclear-encoded genes involved in mitochondrial biogenesis [59], are the lowest in C4 but highest in C8 (Supplementary Fig. 2F, G). Based on these findings, we can conclude that platelets from the fatal cluster C4 are highly active in angiogenesis, coagulation, and endotheliopathy, while having the lowest RNA processing, cell division, and mitochondrial biogenesis, while those from the convalescence cluster C8 had the opposite trend in all these pathways.
Cluster C6, associated with survival, was characterized by elevated expression of genes CRBN, CD58, and SRSF3, and exhibited the most significant activity in the Notch signaling pathway alongside minimal IL6/JAK/STAT3 signaling pathway activity, as shown in Supplementary Figs. 2H and I. Conversely, survival cluster C10, distinguished by a gene module including CD74, HLA-DRA, and CD79A related to antigen presentation, demonstrated higher activity in allograft rejection and MYC targets v2 pathways (Supplementary Figs. 2J and G). Both clusters C6 and C10 showed reduced activity in angiogenesis, coagulation, epithelial-mesenchymal transition (EMT), and apical junction pathways, which were notably upregulated in the fatal cluster C4, as detailed in Fig. 5B-D.
Then, we examined the GO terms up-regulated in the fatal clusters C4, C9, and C11. The key pathways up-regulated in fatal cluster C4 include wound healing, platelet activation, hemostasis, and coagulation, which is consistent with the known observations that thrombotic problems are a major cause of morbidity and mortality in COVID-19 patients [60]. Oxygen transport, hydrogen peroxide metabolism, gas transport, and erythrocyte development are enriched terms for C9, reflecting the hypoxic environment of C9 platelets (Fig. 5F). ATP metabolic activities such as oxidative phosphorylation, cellular respiration, and aerobic respiration are among the enriched terms in C11; this is consistent with the C11 GSEA results, which suggested that C11 platelets are inactive (Supplementary Fig. 2L). Platelets in quiescence are known for requiring ATP for their basic function. According to a study, glycolysis produces up to 65% of the required ATP in inactivated platelets, with mitochondria providing the remainder [61]. C4 is the most active cluster in FT groups. Therefore, we examined C4-enriched genes associated with diseases, such as arteriosclerosis, bacterial endocarditis, thrombosis, and frontotemporal lobar degeneration, a neurodegenerative disorder (Fig. 5G). Among the mechanisms shared by survivor clusters C6, C8, and C10 are platelet translation regulation, mRNA processing, and ribosomal RNA biogenesis (Fig. 5H, Supplementary Fig. 2M, N). Additionally, C10 was enriched for the GO terms homeostasis lymphocyte activation cells, and leukocyte antigen cell-cell adhesion (Supplementary Fig. 2N).
Pseudotime Trajectory Analysis Identifies Platelet Signature Dynamics in Survival or Fatal Disease Outcomes
The fatal cluster C4 appears to be critical for the negative outcome groups, so we evaluated possible events leading to its emergence. Pseudotime analysis, although unable to recover real time dynamics of cell populations, provides hints as to their order along the developmental trajectories [60]. The possible precursor of C4 is the C0 cluster, which is also close to the C1 and C6 clusters in the trajectory map (Supplementary Fig. 3A). C0 cluster has two possible developmental routes, fatal C4 or survival C1and C6 (Fig. 6A, B), indicating that the platelets in the C0 could be targeted by early intervention to inhibit their development into the C6 phenotype.
Then, we examined DEGs between C4 and C0 and C0 and C1. AKR7A2, CALD1, CALR, CD36, CSRP1, CYBA, ENSA, FCGR2A, HBG2, HCST, HMGN1, HPSE, MMRN1, NDUFA4, PF4, RPLP1, RPS9, SAMD14, and TPT1 genes are the consistently up-regulated, with a log2Fold change greater than 0.4 and an adjusted p value less than 0.05, from C0 to C4, and C1 to C0 (Fig. 6C, D). For the further analysis we defined a fatal platelet module consisting of these genes. Among the fatal module enriched GO terms are regulation of body fluid levels, coagulation, phagocytosis, and tumor necrosis factor (TNF) production (Supplementary Fig. 3B). The DEGs consistently down-regulated between C4 vs. C0 and C0 vs. C1 were ADIPOR1, CDKN1A, MAP3K7CL, MMD, NEAT1, NUTF2, PTMA, RAB31, SLC40A1, TMEM140, TSC22D3, they were used to define the platelet survival module. However, no GO, KEGG, or Reactome pathways were enriched in the survival module. The scores for the fatal and survival modules were then calculated for all disease clusters and disease severity levels. As expected, cluster C2 had the lowest fatal score, followed by C1 and then survivor cluster C6. C11 had the greatest fatal score, followed by C4 and C9 (Fig. 6E). C2 had the greatest survival module score, followed by C1 and survivor cluster C6, while C11 and C4 had the lowest scores for the fatal modules (Fig. 6F).
The fatal and survival module scores could also serve as indicators of the disease outcome. The patient group with the highest fatal scores was the fatal group, followed by the severe group, while the mild group received the lowest score on the fatal module, followed by the moderate group. The HC group was positioned in the center of the fatal module (Fig. 6G). The groups with the lowest survival module ratings were fatal and severe groups. However, the highest scores were in the SLE and mild groups (Fig. 6H). When platelets encounter an immunological disorder response, their expression shifts away from HC, as demonstrated by the findings. In addition, we investigated the composition of platelet subclusters and discovered that the fraction of fatal cluster C4 was the best indicator among clusters for distinguishing between S and FT patients (Fig. 6I), with an AUC of 0.749. When C4 platelets exceeded 3.36 percent of the PBMC platelet composition, patients were at risk of death. In this case, detecting the presence of platelets C4 in a timely manner could help in the patient's survival.
These findings show the complexity of COVID-19 and sepsis in relation to gene and pathway signatures in platelets. This is especially true for patients with severe active disease, as well as those recovering from it and surviving it. This research could lead to a better understanding of platelet processes in COVID-19 and sepsis patients, which could help guide therapeutic options for both patient subgroups.
Unique and Shared Gene Expression Changes in Platelets from COVID-19, SSH, Sepsis, and SLE Samples
Comparing COVID-19, SSH, sepsis, and SLE platelets to HC platelets, we identified differentially expressed genes (DEGs) and related pathways (whose up- and down-regulations were analyzed using the GO and KEGG databases). There were 45 DEGs that were up-regulated in all four diseases studied here, including IFI27L2, IFITM2, IFITM3, and S100 family genes (S100A8 and S100A9) and genes for ATP synthase, such as ATP5E and C9orf16. BEX3, LCN2, RHEB, and TMEM219 are genes associated with apoptosis (Supplementary Table 4) (Supplementary Fig. 4A). 271 DEGs were downregulated in COVID-19, 515 DEGs were downregulated in SSH, 901 DEGs were downregulated in Sepsis, and 747 DEGs were downregulated in SLE. 129 genes were downregulated in all four diseases (Supplementary Fig. 4B). The 129 genes included 68 ribosome-related genes. CD52 is a therapeutic target and predictive biomarker for sepsis (14). CD3E, CD48, CD7, LCK, LEF1, PTPRC, and TCF7 are required for the activation of T cells (Supplementary Table 4).
There were 294 pathways up-regulated in COVID-19, 346 in SSH, 394 in sepsis, and 139 in SLE as compared to HC (Fig. 7A). There were 47 pathways that were consistently up-regulated in the four considered diseases. There were 120 down-regulated pathways in COVID-19, 381 in SSH, 710 in sepsis, and 657 in SLE (Fig. 7B). Among them, 81 pathways were consistently down-regulated by all four considered diseases.
To better interpret the functional meaning of the consistently up/down-regulated pathways among the four diseases, we used Revigo [58] to summarize the 47 up-regulated pathways and 81-down-regulated pathways from COVID-19 vs. HC, SSH vs. HC, Sepsis vs. HC, and SLE vs. HC. Pathways up-regulated in platelets consistently among the four considered diseases include GO: neutrophil mediated immunity, KEGG: Parkinson disease, and GO: ATP metabolic process (Fig. 7D). All translation processes are down-regulated, including the GO: nuclear-transcribed mRNA catabolic process, nonsense-mediated decay, GO: regulation of translational initiation, GO: protein localization in the endoplasmic reticulum, KEGG: Ribosome, and GO: response to interleukin-4 (Fig. 7D). Interleukin-4 has several biological activities, including the stimulation of activated B cells and T cell proliferation, mediated by pathways such as T cell activation regulation and response [59].
We then focused on the pathways that were up-regulated in both COVID-19 and sepsis relative to HC. KEGG: Parkinson disease, KEGG: Huntington disease, KEGG: Prion disease, KEGG: Alzheimer disease, KEGG: Pathways of neurodegeneration - multiple diseases, and KEGG: Amyotrophic lateral sclerosis was consistently up-regulated in platelets from COVID-19 and sepsis (Supplementary Fig. 4C). COVID-19 has been recently associated with neurological diseases [60], even though the molecular mechanism of this association is not clear. As with sepsis, COVID-19 may act as a significant inflammatory insult, increasing the brain's susceptibility to neurodegenerative illnesses, cognitive decline, and the likelihood of acquiring dementia later in life [61]. However, up-regulation of the pathways seen in neurons in neurogenerative diseases in platelets may suggest an effect of common regulatory mechanisms rather than a direct effect of abnormal platelets on the brain.
The KEGG pathways related to Endocytosis and Fc gamma R-mediated phagocytosis were found to be up-regulated, indicating an active engagement of platelets in the internalization of virions. This internalization is facilitated by Toll-like receptors binding to virion-released lysosomal ligands such as single-stranded RNA, double-stranded RNA, and CpG DNA, which trigger platelet activation and the release of granules, leading to the exposure of P-selectin and the formation of platelet-leukocyte aggregates. Additionally, pathways linked to protein targeting, translation, and ribosomes were identified, alongside a noted down-regulation in the context of sepsis for KEGG: Coronavirus disease - COVID-19. This down-regulation is primarily attributed to the diminished expression of ribosomal proteins, a pattern observed in both COVID-19 and sepsis conditions, as elaborated in Supplementary Fig. 4C.
Then, we investigated the pathways that are enriched in COVID-19 and sepsis fatal patients (FT) in comparison to survivors (S) (Fig. 7C). GO: neutrophil mediated immunity and GO: ATP metabolic process were up-regulated in FT patients and were likewise up-regulated in disease vs. HC pathways. GO: response to endoplasmic reticulum stress, GO: response to hypoxia, and GO: intrinsic apoptotic signaling pathway in response to oxidative stress are up-regulated stress pathways. While the KEGG: Bacterial invasion of epithelial cells was also among the up-regulated pathways, we concluded that only the part of this pathway involved in cytoskeleton rearrangement is upregulated (see the discussion section). GO: antigen processing and presentation of peptide antigen via MHC Class I were also up-regulated in fatal patients of COVID-19 and sepsis. Platelet MHC Class I mediates CD8 + T-cell suppression during sepsis, according to a previous study [62]. Pathways such as GO: protein localization to the endoplasmic reticulum, GO: regulation of translational start, and GO: regulation of RNA stability were down-regulated. In addition, lymphocyte activation pathways, such as GO: regulation of T cell proliferation, GO: interferon-gamma-mediated signaling pathway, and GO: regulation of interleukin-12 production, were down-regulated in FT patients. Interferon-gamma is mostly secreted by activated lymphocytes, including CD4 T helper type 1 cells and CD8 cytotoxic T cells [63], whereas interleukin-12 is known as T cell-stimulating factor [64] (Fig. 7C). These data indicate that platelets from the disease cohorts exhibited less immunological activation, fewer translational activities, and more neurodegenerative tendencies, such as the KEGG enrichment for Parkinson's disease (Fig. 7D). Compared to survivor platelets, the aforementioned tendencies became more pronounced in FT platelets, which began to exhibit the ability to invade epithelial cells (Fig. 7C).
Pathway Enrichment Related to Disease Severity in Platelets
Given the dynamic gene expression changes in platelets in multiple diseases, we evaluate what gene expression modules were significantly changed depending on disease severity. Two modules that diminish with disease severity are MHC class II genes (Fig. 8A) and translation initiation (Fig. 8B). MHC Class II scores are much higher in convalescent patients than in healthy controls, indicating that MHC Class II can be utilized as an indicator of patient recovery. High case fatality rates of COVID-19 reported in some countries have been linked to inadequate MHC class II presentation and, consequently, a weak adaptive immune response against these viral envelope proteins, according to studies [65]. The lowest score for translation initiation modules (GO:0006413) was found in the platelets of patients who did not survive the diseases, indicating a halt in protein translation due to fatal illness. An interesting observation is the presence of pathways implicated in neurodegeneration (GO:0070843) in severely sick COVID-19 patients (Supplementary Figs. 5A-5H). Except for axonal transport modules, platelets from severe and fatal COVID-19 patients show all the trends observed in neurodegeneration diseases' major biological processes. In both sepsis and COVID-19 patients, neurodegeneration-related pathways became more severe as the disorders advanced. The scores for blood coagulation (GO:0007596), platelet activation (GO:0030168) (Fig. 3D), oxidative phosphorylation (OXPHOS) (Fig. 8C), and glycolysis (Fig. 8D) (35) modules confirmed the hypothesis that platelet coagulation and energy consumption are functionally linked to the severity of sepsis disease and the progression of COVID-19 disease [66, 67]. Moreover, platelets in convalescent patients had higher glycolysis scores, which corresponded to module scores in response to oxygen radicals (Supplementary Fig. 5D), indicating that platelets in convalescent patients were also hypoxic. Hypoxia induces oxidative damage to neural cells and causes widespread neurodegeneration [68].
Interferon response modules, such as response to type I IFN (GO:0034340) (Fig. 8E), IFN- β (GO:0035456) (Fig. 8F), and IFN-γ (GO:0003341) (Fig. 8G), had the highest scores in moderate patients and the lowest scores in HC. The interferon (IFN) protein family is crucial for the immune response against viruses and other infections. IFNs have been demonstrated to play a significant role in preventing SARS-CoV-2 infection in the context of COVID-19, but they have also been linked to severe symptoms [69]. The findings may explain the contradictory reports of COVID-19 patients with impaired and robust type I IFN responses. Although robust type I IFN responses have been reported in patients with severe COVID-19 [70], studies have demonstrated that type I IFN responses are severely impaired in the peripheral blood of patients with severe or critical COVID-19, as indicated by low levels of type I IFNs and interferon-stimulated genes [71]. Our data indicate that severe and fatal patients had IFN levels much lower than moderate patients, but greater than mild patients and healthy controls. In contrast to individuals with other disorders, COVID-19 patients have continuously elevated IFN levels. The above data conclude that the higher expression of MHC Class II and translational initiation expression in platelets mean better outcomes in patients. Coagulation and higher ATP synthesis from platelets, means worse outcomes for patients. As for interferon response, with both protective and deleterious effects being reported, we confirmed the theory that severe COVID-19 is associated with decreased IFN signaling [72, 73].
The Platelets’ Crosstalk with Monocytes and Lymphocytes
An important role of platelets is communication with other cell types during formation of the thrombi, both regular ones forming during the hemostasis or potential abnormal ones, forming during sepsis and COVID-19. Using the ligand and receptor database from iTalk [49], we evaluated these interactions by computing the product of average ligand and receptor expressions in the corresponding cell types from peripheral blood mononuclear cells (PBMC) (see Materials and Methods). Platelet-monocyte interaction was evaluated and found to have the highest score in fatal patients relative to other outcomes (Fig. 9A). Consistently with our previous sepsis study [12], platelet-monocyte interaction scores were significantly elevated in FT sepsis patients. This phenomenon was also noticed in COVID-19-severe patients. Compared to control participants and mildly infected individuals, ICU-admitted COVID-19 patients had higher platelet-monocyte aggregate levels [67]. Thus, we postulate that the aggregation of platelets and monocytes is linked with the severity and mortality of sepsis and COVID-19.
Chemokine receptor 2 (CCR2), CCR5, and their selective ligands, chemokine ligand 2 (CCL2), and CCL3, have been found to promote the trafficking of leukocytes to sites of inflammation and regulate their activation [73]. Also, the CCL2-CCR2 and CCL3-CCR5 ligand-receptor systems in differentiating T cells have been identified [74]. CCL2-CCR2 and CCL3-CCR5 interactions between platelets and T cells revealed that SLE had the highest expression of the CCL3-CCR5 system between platelets and T cells, a characteristic of SLE [75]. Compared to other outcome groups, the CCL3-CCR5 system’s expression was the lowest among patients who did not survive the diseases (Fig. 9B). The results are associated with T cell differentiation module scores (GO:0030217) (Fig. 9F). The CCL2-CCR2 and CCL3-CCR5 ligand-receptor systems are more prevalent in surviving patients than in HC (Fig. 9C).
CD40 is recognized to play a crucial role in B lymphocyte proliferation and differentiation [76]. And CD40 ligand CD40LG expression is low or undetectable on the surface of resting platelets but is highly expressed upon platelet activation [77]. Also, platelets are noted to directly influence adaptive immune responses via the secretion of CD40 and CD40L molecules [2]. Platelets and B cell interaction analysis revealed that healthy controls had the highest CD40 induction relative to other groups, whereas patients who did not survive had the lowest CD40 induction from platelets – B cell interaction (Fig. 9E). The module score for B cell proliferation (GO:0042100) (Fig. 9G) also supported this conclusion. SLE also had the highest expression of CD40LG and CD40 interaction between platelets and B cells compared to other diseases (Fig. 9D). These data implicate that in SLE, platelets may induce B cell activity, consistent with previous studies reporting the putative effects of activated platelets in SLE pathogenesis [13].