Excel Template for Identifying Mouse Myeloid Cell-Types in the Central Nervous System Based on Single-Cell RNA Sequencing Data

doi:10.21203/rs.3.rs-1071141/v1

Download PDF

Research

Excel Template for Identifying Mouse Myeloid Cell-Types in the Central Nervous System Based on Single-Cell RNA Sequencing Data

https://doi.org/10.21203/rs.3.rs-1071141/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Background

The myeloid cells play a vital role in health and disease of central nervous system (CNS). However, how to clearly distinguish them is still a knotty problem. At present, single-cell RNA Sequencing (scRNA-Seq) technology can sequence thousands of cells at the single-cell level, and then divide the cells into different clusters according to the similarity of gene expression, but it is still difficult to further identity these cell clusters. Generally, there are some specific marker genes for cell-type identities. However, it is difficult to distinguish a variety of myeloid cells in the CNS, because these cells often have the same or cross gene markers, and some markers will change significantly in different pathological states. Therefore, establishing a simple and practical method to distinguish these cell populations is of great significance for the analysis of scRNA-Seq data.

Methods

Referring to CellMarker (http://biocc.hrbmu.edu.cn/CellMarker/), PanglaoDB (https://panglaodb.se/) and Mouse Cell Atlas (http://bis.zju.edu.cn/MCA/gallery.html), combining with the recent literatures, a simple Excel template was designed, in which a panel of gene makers corresponding to the myeloid cells were included. The 83 cell clusters from several recently reported single-cell data were used to verify the accuracy of this template.

Results

This template could easily distinguish myeloid cell-subtypes and non-myeloid cells. Comparing with literatures, the overall consistency rate was 93.98%. There was no statistically significant difference between the two groups (Bowker’s test, P >0.05). Kappa symmetric measures showed that the Kappa value = 0.642 (P < 0.01).

Conclusions

The cell identities of scRNA-Seq cluster data could be performed using our simple Excel formulae, a panel of gene markers and ideal cell clustering data are the basis for accurate identification of CNS myeloid cell-subtypes.

Immunology

Neurology

Cellular & Molecular Neuroscience

Excel template

mouse

myeloid cell-types

central nervous system

single-cell RNA Sequencing

clusters

The cell composition of central nervous system (CNS) includes not only neurons, oligodendrocytes and astrocytes, but also a variety of myeloid cells, such as microglia, monocytes, macrophages, dendritic cells and granulocytes [1]. These myeloid cells play a vital role in health and disease of CNS [2, 3]. Although there have been many studies on these cells, how to clearly distinguish them is still a difficult problem. The morphology, immunohistochemistry and flow cytometry are frequently used to identify these cells [4–6]. However, none of the above methods is perfect. They all have their own defects, such as poor objectivity, lack of specific markers, easy to produce cell recognition errors, etc.

Recently, single-cell RNA Sequencing (scRNA-Seq) technology can sequence thousands of cells at the single-cell level, and then divide the cells into different clusters according to the similarity of gene expression [7], but it is still difficult to further define these cell clusters because it is difficult for researchers to collect cell markers for interested cells [8]. At present, there are three main methods for cell-type identification based on single-cell transcriptome data: 1. Comparing the difference genes of a cluster with the marker genes of which cell-type in the database, and identify the cell-type in combination with their expressions. Common marker gene databases include CellMarker (http://biocc.hrbmu.edu.cn/CellMarker/) [8], PanglaoDB (https://panglaodb.se/) [9], Mouse Cell Atlas (http://bis.zju.edu.cn/MCA/gallery.html) [10], etc. In addition, we can also collect marker genes of certain cell-types in the literatures; 2. The expression profiles of genes in unknown cell clusters and known cell-types are used for similarity analysis. If the correlation was high, it would be identified as this kind of cells [11, 12]. For example, the R package (SingleR) can complete this analysis [13]; 3. Using the expression profiles of known cell-types to construct classifiers as the training sets, and the gene expression profiles of unknown cell clusters are input for classification and identification [11, 12]. For example, the R package (Garnett) can be used for this analysis [14]. Although, more and more automatic cell-type annotation tools have been developed, it is difficult to ensure that an automatic cell-type identification tool is suitable for all cell-types [15]. Therefore, researchers should select one of the defined results as a reference, and name the corresponding cell clusters in combination with manual annotation and relevant knowledge background. In any case, the specific marker genes are still the basis for defining cell cluster [11, 12]. Generally, the specific marker genes are selected according to the discipline background knowledges, literatures and databases. However, it is difficult to distinguish a variety of myeloid cells in the CNS, because these cells often have the same or cross gene markers, and some markers will change significantly in different pathological states [16]. For example, adgre1 (F4/80), the established marker for macrophages [17, 18], also expresses in monocytes, microglia and dendritic cells [19]. Gene expressions of microglial specific markers (P2RY12 and TMEM119) in microglia are often down regulated or even negative under the conditions of CNS injury, inflammation and degeneration [20–22]. Therefore, establishing a simple and practical method to distinguish these cell populations is of great significance for the analysis of scRNA-Seq data.

Excel template design for cell-type definition

Referring to CellMarker (http://biocc.hrbmu.edu.cn/CellMarker/) [8], PanglaoDB (https://panglaodb.se/) [9], Mouse Cell Atlas (http://bis.zju.edu.cn/MCA/gallery.html) [10], combining with the recent literatures [1, 3, 4, 6, 16, 18, 22-30], a simple excel template for cell definition was designed, in which a panel of gene makers corresponding to the myeloid cells, lymphocytes, common CNS cells, and proliferative cells were included (Fig. 1 and Table S1). Here, myeloid cells include monocytes (MNC), macrophages (MAC), microglia (MG), granulocytes (mainly neutrophils, NEUT), and dendritic cells (DC). In order to minimize the effects of lymphocytes on myeloid cell identifies, T, B, and NK cell specific gene markers were also listed in the table.

Excel template design for gene markers and expression extraction

To perform the cell identification of a cluster, we need four Excel tables: cell definition (Fig.1, Fig.2E and Table S1), cluster data (Fig2A and Table S1), avg_logFC extraction (Fig2B, D and Table S1), and gene extraction (Fig2C and Table S1). In cluster data table, column A is the genes in a cluster and column B is avg_logFC, which means average Log2 Fold Change, it is the ratio of the normalized mean gene counts in each cluster relative to all other clusters for comparison. In some literatures, the average value of gene expression is also used. In avg_logFC extraction table, the data in columns A and B should come from the corresponding columns of cluster data table, column C is extracted genes from column C of gene extraction table, column D is extracted values from column C using Excel command: VLOOKUP(Cn,A:B,2,0). In gene extraction table, the data in columns A is the gene markers from column B of cell definition table, column B is the genes from column A of avg_logFC extraction table, column C is extracted values from column A using Excel command: IF(COUNTIF(B:B,An)>0,An,"").

Cell-type identity workflow

The cell-type identity workflow included the follow steps (Fig2): 1. Copy lines A and B form cluster data table, and paste them to the corresponding columns A and B of avg_logFC extraction table, 2. Copy line A form avg_logFC extraction table, and paste it to the column B of gene extraction table, then we will get the extracted genes form gene markers (column A), 3. Copy column C form gene extraction table, and PasteSpecial it to the column C of avg_logFC extraction table, then we will get the extracted values in column D, 4. Copy column D form avg_logFC extraction table, and PasteSpecial it to any blank column we like in the cell definition table, 5. In cell definition table, we can perform cell identities by comparing the extracted values (upregulated and downregulated genes are shown as red and green, respectively) to the cell-types (column A) and gene markers (column B).

Data

The sources of gene expression data used in this paper are shown in Table 1 [10, 31-33]. The data in each literature are displayed in the form of Excel (Fig. 2A).

Consistency test of cell-type identity methods

In order to test the consistency of our cell identity method with the literatures, the identification results were divided into three grades: excellent, satisfactory and poor, based on Table 2. Bowker’s test and Kappa symmetric measures were used to test the difference and consistency of the paired data between the two groups, respectively. For Bowker’s test, P < 0.05 was considered to be a statistically significant difference. For Kappa symmetric measures, Kappa ≥ 0.75 indicates good consistency, 0.4 ≤ kappa < 0.75 indicates general consistency, and kappa < 0.4 indicates poor consistency. The data were analyzed using SPSS software v.26 (SPSS Inc., Chicago, IL, USA).

Descriptive comparison of our method with the literatures in CNS myeloid cells

Using our cell-type identification method, we identified CNS myeloid cells in the four data reported in the literatures (Table 1).

In the Supplementary Table 3 of Ximerakis, et al. [31], they listed the most discriminating genes per cell-type. From this table, we chose monocytes (MNC), macrophages (MAC), microglia (MG), neutrophils (NEUT), dendritic cells (DC), neuronal-restricted precursors (NRP), immature neurons (ImmN), mature neurons (mNEUR), astrocyte-restricted precursors (ARP), astrocytes (AST), oligodendrocyte precursor cells (OPC), oligodendrocytes (OL), ependymocytes (EPC), and hypendymal cells (HypEPC) as “gold standard” to test our method. As shown in Fig. 3, Table 3 and Table S2, among the 14 cell clusters being compared, we identified MNC as MNC (mixed with a few NEUT and DC), and NRP as proliferative cells. The other 12 cell clusters were completely consistent.

The 15 clusters of adult mouse brain from the Table S3 of Han, et al. [10] were also identified, the results were shown in Table 4 and Table S3. We found that among the 15 cell clusters being compared, pan-GABAergic and Schwann cell were not within the scope of our evaluation, the reported cluster 4 (Macrophage_Klf2 high) was mixed with a few MG, the other 12 cell clusters were completely consistent.

The CD11b⁺CD45⁺CD3^-B220^-Ly6G^-cells isolated using fluorescence-activated cell sorting (FACS) from adult mouse brain parenchyma, choroid plexus, leptomeninges, and perivascular space (embj2021108605-sup-0008-datasetev1) by Sankowski et al. [32] were also compared. As shown in Table 5 and Table S4, we found that among the 17 cell clusters compared, there 14 were completely consistent. The non-consistent clusters included stromal cells (cluster 15) which was not within the scope of our evaluation, the reported cluster 6 (CNS-associated macrophages, CAMs) which expressed MG specific markers, and cluster 9 (CAMs) which the typical genes of MAC were not elevated.

Of course, our cell identification process was not smooth sailing. When we analyzed another data (Table S2 of Mimouna et al.) [33], we encounter thorny problems. In this report, Louvain graph-based community clustering was used to divide the cells into different clusters, and PanglaoDB was used to identify putative cell and/or activation state for each individual Louvain cluster. We still identified the cell-types using our method based the author’s data. As shown in Table 6 and Table S5, although the cell-type identification was basically consistent, in both reported and our results, the cell-types in each of the nine clusters were mixed, which indicates that the cell clustering in this data is not ideal.

Descriptive comparison of our method with the literatures in peripheral blood and bone marrow myeloid cells

In order to test whether our method was suitable for the identification of non-CNS myeloid cells, the 21 peripheral blood cell clusters and 17 bone marrow cell clusters of adult mice from the Table S3 of Han, et al. [10] were also identified.

The peripheral blood results were shown in Table 7 and Table S6. We found that among the 21 cell clusters being compared, cluster 14 (Erythroblast_Car2 high), cluster 20 (B cell_Igha high), and cluster 21 (Erythroblast_Hba-a2 high) were not within the scope of our evaluation, the reported cluster 18 (Macrophage_Pf4 high) was mixed with a few NEUT, the other 17 cell clusters were completely consistent. The bone marrow results were shown in Table 8 and Table S7. We found that among the 17 cell clusters being compared, cluster 3 (Neutrophil progenitor), cluster 8 (Hematopoietic stem progenitor cell), cluster 9 (Erythroblast), and cluster 15 (Mast cell) were not within the scope of our evaluation, the other 14 cell clusters were completely consistent.

Statistical comparison of our method with the literatures

According to the grading evaluation method in Table 2, we graded the results of all data analysis (Table 3-8). Excluding those clusters (N/A) that are not within the scope of our analysis, we obtained a total of 83 valid cases. As shown in Fig. 4, the excellent, satisfactory and poor results in literatures were 74, 3 and 6, respectively, and they were 77, 1, and 5 in our results. The overall consistency rate was 93.98% (78/83). The Bowker’s test showed that there was no statistically significant difference between the two groups (P >0.05). Kappa symmetric measures showed that the Kappa value = 0.642 (P < 0.01), indicated general consistency.

For the last few decades, although advanced techniques, such as flow cytometry, can be used to identify CNS myeloid cell-subtypes, it is still difficult to be very accurate due to the lack of absolutely specific markers and the instability of marker expression under different pathophysiological conditions [16]. Although, scRNA-Seq is a promising new technology to solve this problem (Cembrowski, 2019), for ordinary researchers, various programming language analysis packages for scRNA-Seq data are really not an easy task, and for bioinformatics experts, they do not necessarily know the specific markers for CNS myeloid cell-subtype identifies. Therefore, building a bridge to connect the knowledge gap between ordinary researchers and bioinformatics experts is the key to solve this problem.

In this report, a simple excel template was designed, in which a panel of gene makers corresponding to the myeloid cells, lymphocytes, common CNS cells, and proliferative cells were included. For users, as long as the gene expression data of cell clusters are obtained, the clusters can be named directly using this excel template. It should be emphasized that this template is mainly suitable for determining the major categories of myeloid cells. If researchers need to further distinguish the subtypes of certain cells, it is necessary to add corresponding gene markers. Therefore, this Excel template is open, and researchers can modify or add new genes based on their need. In addition, in the selection of gene markers, we consider not only their relative specificity, but also the crossover and commonality of different cells. Therefore, in the Excel template, we defined the positive gene marker as “P”, negative as “N”, and if the marker could be positive or negative, we defined it “P/N” (Fig. 1 and Table S1). For example, Ptprc (the gene of CD45) was the common marker of myeloid cells and lymphocytes [34–36]. Therefore, we used it as a common marker of myeloid cells and lymphocytes to distinguish CNS non-myeloid cells (such as astrocytes, oligodendrocytes, neurons, etc.). In addition, in theory, the protein molecule CD45 expressed by Ptprc gene is positive in many leukocytes, but in the process of collecting gene markers and drawing the Excel template, we found that Ptprc gene is not expressed in every cell cluster, so we defined it as P/N. In addition to Ptprc, there are many similar examples. We will not list them one by one. Please see Fig. 1 and Table S1 for details. For a certain cell, although there are some relatively specific gene markers, we do not use a single or a small number of markers to identify it. We use a panel of gene markers to comprehensively evaluate it and then define it. This can effectively distinguish the cell-types with similar or cross gene expression and ensure the accuracy of cell cluster identification. In this Excel template, there are 73 gene markers (excluding non-myeloid CNS cells) in each panel can be used to distinguish myeloid cell-subtypes and lymphocytes (Fig. 1 and Table S1). For example, MNC could express Ptprc (P/N), Cd14 (P/N), Itgam (P/N), Itgax (P/N), Csf3r (P/N), Adgre1(P/N), Ly6c1 (P/N), S100a4 (P/N), Cd68 (P), Ly86 (P/N), Ctsb (P/N), Ccr2 (P/N), Ly6c2 (P), Plac8 (P), Pf4 (P/N), Lyz1 (P), Hmox1 (P/N), F13a1(P), Lyst (P/N), Prtn3 (P/N), Elane (P/N), and Pilra (P/N). Although, several molecules (Cd68, Ly6c2, Plac8 and Lyz1) are positive (P) in MNC, they are also expressed in other cells. Therefore, there is no absolute specific marker of MNC in this template. Nevertheless, we can still determine its cell type using comparative analysis. The typical examples can be found in table S4 (C8 and 11). For those cell-types with their own specific gene markers, it is easy to identify cell clusters using comparative analysis. Typical examples are Ms4a7, Lyve1, Cbr2, Mrc1 and CD163 for MAC; Hexb, Olfml3, Sparc, Tgfbr1, P2ry12 and Tmem119 for MG; Ltf, Ly6g, Mmp8, Camp, Ngp, Fcnb, Cebpe, Retnlg, S100a8, S100a9, Lcn2, G0s2, Wfdc21 for NEUT. Of course, due to the limitations of knowledge background and research level, this Excel template still has some defects. For example, for DC, the expressions of H2-Ab1, H2-Eb1, H2-Aa, Cd74 and Cd209a should be positive, but these markers can also be expressed in MAC and B cells, especially B cells do not belong to myeloid cells, which is easy to cause misjudgment. Therefore, in this template, we also added B cell markers to facilitate distinguish B cells from DC.

In order to verify the accuracy of this Excel template, the 83 cell clusters from several recently reported single-cell data were used (Table 1). The results showed that comparing with literatures, the overall consistency rate was 93.98%. The Bowker’s test showed that there was no statistically significant difference between the two groups (P >0.05). Kappa symmetric measures showed that the Kappa value = 0.642 (P < 0.01). These indicate that our method is general consistency with the literatures. Next, we will analyze the possible causes of inconsistency.

Comparing with the report of Ximerakis, et al. [31], only one cluster is inconsistent (Table 3). Our results showed that there were a few NEUT and DC mixed with their MNC. The possible reason is that they take Plac8 as a specific marker of MNC. In fact, Plac8 is also expressed in NEUT and DC [10]. Comparing with the cell-type identifies in adult brain of Han, et al. [10], the cluster 4 is inconsistent (Table 4). The reason may be that the reported cluster 4 was mixed with a few MG, because we can find the typical microglia markers (Hexb, Olfml3, Sparc, Tgfbr1, P2ry12 and Tmem119) in Table S3. Comparing with the report of Sankowski, et al.[32], the clusters 6 and 9 are inconsistent (Table 5). Both clusters were identified as CAMs, however, the expression of typical genes of MACs (Mrc1, Cd163, Lyve1, Pf4, Ms4a7, Stab1, and Cbr2) were not elevated in both clusters. In contrast, MG specific markers (Hexb, Olfml3, and Sparc) were significantly elevated in cluster 6, while the other genes in cluster 9 were not within the scope of our evaluation. Comparing with the cell-type identifies in peripheral blood and bone marrow of Han, et al. [10], excepting cluster 18 of peripheral blood was mixed with a few NEUT, the others were completely consistent. These indicate that our Excel template is also very effective for the analysis of non-CNS myeloid cells.

From the above analysis, we can deduce that the appropriate gene markers and ideal scRNA-Seq data clustering are key factors for the accuracy of cell definition. We can understand the importance of cell clustering through the following example. When we analyzed another data (Table S2 of Mimouna et al.) [33], both the reported and our results were not ideal. Analyzing the reasons, we find that their data clustering methods are different from the other literatures mentioned above. The cell clustering method in this literature is Louvain graph-based community clustering, which may be the reason why clustering is not ideal. Although, our Excel template still can be used to identify the cell-types based on the author’s data, the cell-types in each of the nine clusters were mixed (Table 6). Therefore, the data used in this Excel template should be processed through the standard scRNA-Seq analysis process, including quality control, standardization, data correction, feature selection and data dimensionality reduction, finally the cells were divided into different clusters according to the similarity of gene expression.

In conclusion, the cell identities of the scRNA-Seq data could be performed using our simple Excel formulae, a panel of gene markers must be compared to obtain accurate analysis of CNS myeloid cell-subtypes. For data with better cell clustering, this template could effectively distinguish myeloid cell-subtypes, various lymphocytes and other CNS cells. For data with poor clustering, this template could also identify various cell-types, but it would need to be further subdivided.

ARP: Astrocyte-restricted precursors

AST: Astrocytes

CAMs: CNS-associated macrophages

CNS: central nervous system

DC: Dendritic cells

EPC: Ependymocytes

FACS: fluorescence-activated cell sorting

HypEPC: Hypendymal cells

ImmN: Immature neurons

MAC: Macrophages

MG: Microglia

MNC: Monocytes

mNEUR: mature neurons

NEUT: Neutrophils

NRP: Neuronal-restricted precursors

OL: Oligodendrocytes

OPC: Oligodendrocyte precursor cells

scRNA-Seq: single-cell RNA Sequencing

Ethical Approval and Consent to participate

Not applicable

Consent for publication

Not applicable.

Availability of supporting data

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Competing interests

The authors declare that they have no competing interests.

Funding

This study was supported by grants from the National Natural Science Foundation of China (82072416 and 81772321).

Author contributions

HZL and JGH participated in study design, data interpretation and writing. XYL, JLL and SQD participated in literature search, data collection, data analysis tables and figures. All authors read and approved the final manuscript.

Acknowledgement

Not applicable.

Author disclosure statement

The authors declare no competing financial interest.

Herz J, Filiano AJ, Smith A, Yogev N, Kipnis J: Myeloid Cells in the Central Nervous System. Immunity 2017, 46:943-956.
Croese T, Castellani G, Schwartz M: Immune cell compartmentalization for brain surveillance and protection. Nat Immunol 2021, 22:1083-1092.
Prinz M, Erny D, Hagemeyer N: Ontogeny and homeostasis of CNS myeloid cells. Nat Immunol 2017, 18:385-392.
Ajami B, Samusik N, Wieghofer P, Ho PP, Crotti A, Bjornson Z, Prinz M, Fantl WJ, Nolan GP, Steinman L: Single-cell mass cytometry reveals distinct populations of brain myeloid cells in mouse neuroinflammation and neurodegeneration models. Nat Neurosci 2018, 21:541-551.
Manouchehri N, Hussain RZ, Cravens PD, Esaulova E, Artyomov MN, Edelson BT, Wu GF, Cross AH, Doelger R, Loof N, et al: CD11c(+)CD88(+)CD317(+) myeloid cells are critical mediators of persistent CNS autoimmunity. Proc Natl Acad Sci U S A 2021, 118.
Schwabenland M, Bruck W, Priller J, Stadelmann C, Lassmann H, Prinz M: Analyzing microglial phenotypes across neuropathologies: a practical guide. Acta Neuropathol 2021, 142:923-936.
Cembrowski MS: Single-cell transcriptomics as a framework and roadmap for understanding the brain. J Neurosci Methods 2019, 326:108353.
Zhang X, Lan Y, Xu J, Quan F, Zhao E, Deng C, Luo T, Xu L, Liao G, Yan M, et al: CellMarker: a manually curated resource of cell markers in human and mouse. Nucleic Acids Res 2019, 47:D721-D728.
Franzen O, Gan LM, Bjorkegren JLM: PanglaoDB: a web server for exploration of mouse and human single-cell RNA sequencing data. Database (Oxford) 2019, 2019.
Han X, Wang R, Zhou Y, Fei L, Sun H, Lai S, Saadatpour A, Zhou Z, Chen H, Ye F, et al: Mapping the Mouse Cell Atlas by Microwell-Seq. Cell 2018, 172:1091-1107 e1017.
Zhao X, Wu S, Fang N, Sun X, Fan J: Evaluation of single-cell classifiers for single-cell RNA sequencing data sets. Brief Bioinform 2020, 21:1581-1595.
Huang Q, Liu Y, Du Y, Garmire LX: Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data. Genomics Proteomics Bioinformatics 2020.
Aran D, Looney AP, Liu L, Wu E, Fong V, Hsu A, Chak S, Naikawadi RP, Wolters PJ, Abate AR, et al: Reference-based analysis of lung single-cell sequencing reveals a transitional profibrotic macrophage. Nat Immunol 2019, 20:163-172.
Pliner HA, Shendure J, Trapnell C: Supervised classification enables rapid annotation of cell atlases. Nat Methods 2019, 16:983-986.
Zhang AW, O'Flanagan C, Chavez EA, Lim JLP, Ceglia N, McPherson A, Wiens M, Walters P, Chan T, Hewitson B, et al: Probabilistic cell-type assignment of single-cell RNA-seq for tumor microenvironment profiling. Nat Methods 2019, 16:1007-1015.
Quintana FJ: Myeloid cells in the central nervous system: So similar, yet so different. Sci Immunol 2019, 4.
Lee MN, Lee Y, Wu D, Pae M: Luteolin inhibits NLRP3 inflammasome activation via blocking ASC oligomerization. J Nutr Biochem 2021, 92:108614.
Schulz C, Gomez Perdiguero E, Chorro L, Szabo-Rogers H, Cagnard N, Kierdorf K, Prinz M, Wu B, Jacobsen SE, Pollard JW, et al: A lineage of myeloid cells independent of Myb and hematopoietic stem cells. Science 2012, 336:86-90.
Summers KM, Bush SJ, Hume DA: Network analysis of transcriptomic diversity amongst resident tissue macrophages and dendritic cells in the mouse mononuclear phagocyte system. PLoS Biol 2020, 18:e3000859.
Kenkhuis B, Somarakis A, de Haan L, Dzyubachyk O, ME IJ, de Miranda N, Lelieveldt BPF, Dijkstra J, van Roon-Mom WMC, Hollt T, van der Weerd L: Iron loading is a prominent feature of activated microglia in Alzheimer's disease patients. Acta Neuropathol Commun 2021, 9:27.
Zrzavy T, Hametner S, Wimmer I, Butovsky O, Weiner HL, Lassmann H: Loss of 'homeostatic' microglia and patterns of their activation in active multiple sclerosis. Brain 2017, 140:1900-1913.
Milich LM, Choi JS, Ryan C, Cerqueira SR, Benavides S, Yahn SL, Tsoulfas P, Lee JK: Single-cell analysis of the cellular heterogeneity and interactions in the injured mouse spinal cord. J Exp Med 2021, 218.
Niehaus JK, Taylor-Blake B, Loo L, Simon JM, Zylka MJ: Spinal macrophages resolve nociceptive hypersensitivity after peripheral injury. Neuron 2021, 109:1274-1282 e1276.
Abe N, Nishihara T, Yorozuya T, Tanaka J: Microglia and Macrophages in the Pathological Central and Peripheral Nervous Systems. Cells 2020, 9.
Plemel JR, Stratton JA, Michaels NJ, Rawji KS, Zhang E, Sinha S, Baaklini CS, Dong Y, Ho M, Thorburn K, et al: Microglia response following acute demyelination is heterogeneous and limits infiltrating macrophage dispersion. Sci Adv 2020, 6:eaay6324.
Xiao Y, Hu X, Fan S, Zhong J, Mo X, Liu X, Hu Y: Single-Cell Transcriptome Profiling Reveals the Suppressive Role of Retinal Neurons in Microglia Activation Under Diabetes Mellitus. Front Cell Dev Biol 2021, 9:680947.
Mrdjen D, Pavlovic A, Hartmann FJ, Schreiner B, Utz SG, Leung BP, Lelios I, Heppner FL, Kipnis J, Merkler D, et al: High-Dimensional Single-Cell Mapping of Central Nervous System Immune Cells Reveals Distinct Myeloid Subsets in Health, Aging, and Disease. Immunity 2018, 48:380-395 e386.
Somebang K, Rudolph J, Imhof I, Li L, Niemi EC, Shigenaga J, Tran H, Gill TM, Lo I, Zabel BA, et al: CCR2 deficiency alters activation of microglia subsets in traumatic brain injury. Cell Rep 2021, 36:109727.
Wahane S, Zhou X, Zhou X, Guo L, Friedl MS, Kluge M, Ramakrishnan A, Shen L, Friedel CC, Zhang B, et al: Diversified transcriptional responses of myeloid and glial cells in spinal cord injury shaped by HDAC3 activity. Sci Adv 2021, 7.
David S, Kroner A, Greenhalgh AD, Zarruk JG, Lopez-Vales R: Myeloid cell responses after spinal cord injury. J Neuroimmunol 2018, 321:97-108.
Ximerakis M, Lipnick SL, Innes BT, Simmons SK, Adiconis X, Dionne D, Mayweather BA, Nguyen L, Niziolek Z, Ozek C, et al: Single-cell transcriptomic profiling of the aging mouse brain. Nat Neurosci 2019, 22:1696-1708.
Sankowski R, Ahmari J, Mezo C, Hrabe de Angelis AL, Fuchs V, Utermohlen O, Buch T, Blank T, Gomez de Aguero M, Macpherson AJ, Erny D: Commensal microbiota divergently affect myeloid subsets in the mammalian central nervous system during homeostasis and disease. EMBO J 2021:e108605.
Mimouna S, Rollins DA, Shibu G, Tharmalingam B, Deochand DK, Chen X, Oliver D, Chinenov Y, Rogatsky I: Transcription cofactor GRIP1 differentially affects myeloid cell-driven neuroinflammation and response to IFN-beta therapy. J Exp Med 2021, 218.
Hermiston ML, Xu Z, Weiss A: CD45: a critical regulator of signaling thresholds in immune cells. Annu Rev Immunol 2003, 21:107-137.
Thomas ML: The leukocyte common antigen family. Annu Rev Immunol 1989, 7:339-369.
Rosenberg AB, Roco CM, Muscat RA, Kuchina A, Sample P, Yao Z, Graybuck LT, Peeler DJ, Mukherjee S, Chen W, et al: Single-cell profiling of the developing mouse brain and spinal cord with split-pool barcoding. Science 2018, 360:176-182.

Table 1 The sources of gene expression data used in this paper

Data	Mice	Tissue	single cell	scRNA-seq	Clustering	Cluster annotation
Ximerakis, et al. Nat Neurosci. 2019; 22(10):1696-1708. Table S3	C57BL/6J mice (male, 2-3 months of age, and 21-22 months of age).	8 young and 8 old brains	Dissociated brain	Chromium Single Cell 3′ Chip (10x Genomics), the sequencing was performed on NextSeq 500 instrument (Illumina)	Seurat package (v.2.3) in R (v.3.3.4)	Using multiple cell-type-specific/enriched marker genes that have been previously described in the literature (Plac8 for MNC).
Han, et al. Cell. 2018; 172(5):1091-1107.e17. Table S4	Wild-type C57BL/6J mice (SPF, female, 6-10 week-old).	Brain, blood and bone marrow	Brain was dissociated using accutase; bone marrow was treated red blood cell lysis buffer; blood was treated red blood cell lysis buffer or Ficoll separation	Microwell-seq, the 3' ends of the transcripts are then enriched during library generation using PCR and sequenced using the Illumina Hiseq platform	Seurat was used for dimension reduction, clustering and differential gene expression analysis.	Single cell MCA (scMCA) analysis built by authors (Fig 7a)
Sankowski, et al. EMBO J. 2021; e108605. embj2021108605-sup-0008-datasetev1	SPF and GF C57BL/6J mice (mixed sex, 6-10 weeks old)	The brain parenchyma, choroid plexus, leptomeninges, and perivascular space (20 mice per group).	Parenchyma and perivascular space cells were isolated using Percoll gradient. The choroid plexuses and leptomeninges were treated by mechanical dissociation through a 70 micron cell strainer. Viable CD11b⁺CD45⁺CD3^-B220^-Ly6G^-cells were FACS-isolated.	High-throughput scRNA-seq using the high-sensitivity method mCEL-Seq2, the sequencing was performed on Illumina HiSeq 3000 sequencing system (pair-end multiplexing run) at a depth of 130,000–200,000 reads per cell.	Seurat version 3	Generating maps for the myeloid cell populations based on published signature genes (Jordao et al, 2019). Fig 1B
J Exp Med. 2021;218(1):e20192386. Table S2	C57BL/6 mice (mixed sex, 6–10 weeks old)	EAE mouse spinal cord	CNS-infiltrating cells were isolated using Percoll density gradient. F4/80⁺CD11b⁺CD45⁺ cells were sorted using FACS.	Chromium Single Cell 3′ Chip (10x Genomics), The sequencing was performed on the Illumina NovaSeq system using a 28-8-98 paired-end cycle.	R version 4.0.1 software (R Core Team, 2019), fastMNN implementation, Louvain graph–based community clustering.	Cluster-specific markers were searched using the Wilcoxon rank-sum test. An automated cell type assignment was performed with singleR using training sets derived from the Immunological Genome Project database. PanglaoDB was used to identify putative cell and/or activation state for each individual Louvain cluster. The cell type and cell activation state transitions were identified by performing trajectory analysis with slingshot.

Table 2 The grade evaluation criterion of cell identity

Consistency	Accuracy	Grade
Consistent	Both completely accurate	Both excellent (A)
	Both partially accurate	Both satisfactory (B)
	Neither is accurate	Both poor (C)
Non-consistent	One is completely accurate	Excellent (A)
	One is partially accurate	Satisfactory (B)
	One is not accurate	Poor (C)

Table 3 Comparison of the cell-type identifies with Ximerakis, et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
MNC	MNC	MNC (mixed with a few NEUT and DC)	Part	Plac8 is also expressed in NEUT and DC
MAC	MAC	MAC	Yes
MG	MG	MG	Yes
NEUT	NEUT	NEUT	Yes
DC	DC	DC	Yes
NRP	NRP	Proliferative cells	N/A	not within the scope of our evaluation
ImmN	ImmN	neuron	Yes
mNEUR	mNEUR	neuron	Yes
ARP	ARP	AST	Yes
AST	AST	AST	Yes
OPC	OPC	OPC	Yes
OL	OL	OL	Yes
EPC	EPC	Ependymal	Yes
HypEPC	HypEPC	Ependymal	Yes

Abbreviations:

ARP: Astrocyte-restricted precursors

AST: Astrocytes

DC: Dendritic cells

EPC: Ependymocytes (a kind of ependymal cells)

HypEPC: Hypendymal cells (a kind of ependymal cells)

ImmN: Immature neurons

MAC: Macrophages

MG: Microglia

MNC: Monocytes

mNEUR: mature neurons

NEUT: Neutrophils

NRP: Neuronal-restricted precursors

OL: Oligodendrocytes

OPC : Oligodendrocyte precursor cells

Table 4 Comparison of the cell-type identifies in adult brain with Han, et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
1	Myelinating oligodendrocyte	OL	Yes
2	Microglia	MG	Yes
3	Astrocyte_Mfe8 high	AST	Yes
4	Macrophage_Klf2 high	MAC/MG	Part	The reported cluster 4 was mixed with a few MG
5	Astrocyte_Atp1b2 high	AST	Yes
6	Oligodendrocyte precursor cell	OPC	Yes
7	Neuron	Neuron	Yes
8	Macrophage_Lyz2 high	MAC	Yes
9	Astroglial cell (Bergman glia)	AST	Yes
10	Pan-GABAergic	Proliferative cells	N/A	not within the scope of our evaluation
11	Astrocyte_Pla2g7 high	AST	Yes
12	Schwann cell	Unkonw	N/A	not within the scope of our evaluation
13	Granulocyte_Il33 high	NEUT	Yes
14	Hypothalamic ependymal cell	Ependymal cells	Yes
15	Granulocyte_Ngp high	NEUT	Yes

Abbreviations:

AST: Astrocytes

DC: Dendritic cells

MAC: Macrophages

MG: Microglia

MNC: Monocytes

NEUT: Neutrophils

OL: Oligodendrocytes

OPC : Oligodendrocyte precursor cells

Table 5 Comparison of the cell-type identifies with Sankowski, et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
C0	MG	MG	Yes
C1	CAMs	MAC	Yes
C2	MG	MG	Yes
C3	CAMs	MAC	Yes
C4	CAMs	MAC	Yes
C5	MG	MG	Yes
C6	CAMs	MG	No	The expression of typical genes of MAC including Mrc1, Cd163, Lyve1, Pf4, Ms4a7, Stab1, and Cbr2 were not elevated. In contrast, MG specific markers Hexb, Olfml3 and Sparc were significantly elevated.
C7	CAMs	MAC	Yes
C8	Ly6c^low monocytes	MNC	Yes
C9	CAMs	Unknow	N/A	The expression of typical genes of MAC including Mrc1, Cd163, Lyve1, Pf4, Ms4a7, Stab1, and Cbr2 were not elevated. The other genes were not within the scope of our evaluation.
C10	MG	MG	Yes
C11	Ly6c^hi monocytes	MNC	Yes
C12	DCs	DC	Yes
C13	CAMs	MAC	Yes
C14	Prolif. Cells	Prolif. Cells	Yes
C15	Stromal cells	Unknow	N/A	not within the scope of our evaluation
C16	Lymphocytes	NK	Yes

Abbreviations:

CAMs: central nervous system (CNS)-associated macrophages

DC: Dendritic cells

MAC: Macrophages

MG: Microglia

MNC: Monocytes

NEUT: Neutrophils

NK: Natural killer cells

Table 6 Comparison of the cell-type identifies with Mimouna et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
C1	MAC/MG/others	MAC/MG/others	Yes	Cell clustering was not ideal.
C2	MAC/MG/NEUT	MAC/MG/NEUT	Yes
C3	MNC/MAC/MG	MAC/MG/NEUT	Part
C4	MAC/MG/NEUT	MAC/MG/NEUT	Yes
C5	MNC/MAC	MAC/MG/NEUT	Part
C6	NEUT	MAC/MG/NEUT	Part
C7	MAC/MG/others	MAC/MG/NEUT	Yes
C8	T/others	MAC/MG/NEUT	Part
C9	MNC/MAC	MAC/MG/NEUT	Part

Abbreviations:

MAC: Macrophages

MG: Microglia

MNC: Monocytes

NEUT: Neutrophils

Table 7 Comparison of the cell-type identifies in peripheral blood with Han, et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
1	T cell_Trbc2 high	T	Yes
2	B cell_Ly6d high	B	Yes
3	Macrophage_S100a4 high	MAC	Yes
4	Neutrophil_Retnlg high	NEUT	Yes
5	Neutrophil_Ltf high	NEUT	Yes
6	Neutrophil_Camp high	NEUT	Yes
7	Neutrophil_Il1b high	NEUT	Yes
8	NK cell_Gzma high	NK	Yes
9	Macrophage_Ace high	MAC	Yes
10	Monocyte_Elane high	MNC	Yes
11	B cell_Vpreb3 high	B	Yes
12	Monocyte_F13a1 high	MNC	Yes
13	T cell_Gm14303 high	T	Yes
14	Erythroblast_Car2 high	Proliferative cells	N/A	not within the scope of our evaluation
15	B cell_Rps27rt high	B	Yes
16	Dendritic cell_Siglech high	DC	Yes
17	Basophil_Prss34 high	Unknow	N/A
18	Macrophage_Pf4 high	MAC/NEUT	Part	The reported cluster 18 was mixed with a few NEUT
19	B cell_Igha high	Unknow	N/A	not within the scope of our evaluation
20	Macrophage_Flt-ps1 high	MAC	Yes
21	Erythroblast_Hba-a2 high	Unknow	N/A	not within the scope of our evaluation

Abbreviations:

B: B cells

DC: Dendritic cells

MAC: Macrophages

MG: Microglia

MNC: Monocytes

NEUT: Neutrophils

NK: NK cells

T: T cells

Table 8 Comparison of the cell-type identifies in bone marrow with Han, et al.

Clusters	Reported cell types	Our cell types	Consistency	Reason
1	Neutrophil_Cebpe high	NEUT	Yes
2	Neutrophil_Mmp8 high	NEUT	Yes
3	Neutrophil progenitor	MNC/MAC/NEUT	N/A	not within the scope of our evaluation
4	Monocyte_Prtn3 high	MNC	Yes
5	Macrophage_Ms4a6c high	MAC	Yes
6	Neutrophil_Ngp high	NEUT	Yes
7	Pre-pro B cell	B	Yes
8	Hematopoietic stem progenitor cell	Unknow	N/A	not within the scope of our evaluation
9	Erythroblast	Proliferative unknow cell	N/A	not within the scope of our evaluation
10	Neutrophil_Fcnb high	NEUT	Yes
11	B cell_Igkc high	B	Yes
12	Macrophage_S100a4 high	MAC	Yes
13	T cell_Ms4a4b high	T	Yes
14	Dendritic cell_Siglech high	DC	Yes
15	Mast cell	Unknow	N/A	not within the scope of our evaluation
16	Dendritic cell_H2-Eb1 high	DC	Yes
17	Monocyte_Mif high	MNC	Yes

Abbreviations:

B: B cells

DC: Dendritic cells

MAC: Macrophages

MG: Microglia

MNC: Monocytes

NEUT: Neutrophils

NK: NK cells

T: T cells

Download PDF

Version 1

posted

You are reading this latest preprint version

Excel Template for Identifying Mouse Myeloid Cell-Types in the Central Nervous System Based on Single-Cell RNA Sequencing Data

Status:

Version 1

Abstract

Background

Methods

Results

Conclusions

Figures

Background

Methods

Results

Discussion

Conclusions

Abbreviations

Declarations

References

Tables

Supplementary Files

Status:

Version 1