Single-cell and transcriptome analysis reveals TAL cells in diabetic nephropathy

doi:10.21203/rs.3.rs-3030338/v1

Download PDF

Research Article

Single-cell and transcriptome analysis reveals TAL cells in diabetic nephropathy

https://doi.org/10.21203/rs.3.rs-3030338/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 08 Sep, 2023

Read the published version in Functional & Integrative Genomics →

You are reading this latest preprint version

Diabetic nephropathy is a global public health concern with multifaceted pathogenesis, primarily involving hypertension. Excessive activation of AT1R has been strongly associated with hypertension onset and progression in diabetic nephropathy. This study aimed to conduct thick ascending limb cell single-cell and transcriptomic analysis in diabetic nephropathy, including screening for biological markers, cellular communication, and immune infiltration, to identify potential biomarkers and effective means for prevention and treatment. By using high-dimensional weighted gene co-expression network analysis, Least Absolute Shrinkage and Selection Operator, machine learning, neural deconvolution, quasi-chronological analysis, Non-negative Matrix Factorization clustering, and Monocyte Chemotactic Protein-induced Counter, we identified 7 potential thick ascending limb cell biomarkers for diabetic nephropathy and elucidated the Bone Morphogenetic Protein pathway's regulation of thick ascending limb cells through podocyte epithelial cells and podocyte cells. The study also highlighted the role of COBL, PPARGC1A, and THSD7A in Non-negative Matrix Factorization clustering and their relationship with thick ascending limb cell immunity in diabetic nephropathy. Our findings provide new insights and avenues for managing diabetic nephropathy, ultimately alleviating the burden on patients and society.

Biological Markers

Quasi-timing Analysis

TAL cells

Diabetic Nephropathy

Single-cell Analysis

Diabetic nephropathy is a prevalent complication of diabetes and the primary cause of end-stage renal disease worldwide. It afflicts up to 40% of individuals with type 1 diabetes and up to 30% of those with type 2 diabetes[1]. The financial burden of diabetic nephropathy is staggering. Early detection and intervention are crucial to forestall or impede the progression of diabetic nephropathy. Diabetic nephropathy is characterized by hypertension, progressive albuminuria, glomerulosclerosis, and diminished glomerular filtration rate (GFR). The primary causes of hypertension in Diabetes Mellitus type 1 and Diabetes Mellitus type 2 include volume dilation due to augmented renal sodium reabsorption and peripheral vasoconstriction due to dysregulation of peripheral vascular resistance[2]. Research has demonstrated that for patients with diabetic nephropathy, stringent blood pressure management can effectively decelerate the deterioration of renal function, diminish the risk of cardiovascular events, and enhance quality of life[3]. Angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, whose primary function is to lower blood pressure and mitigate renal damage by inhibiting the activity of the renin-angiotensin-aldosterone system (RAAS), play a pivotal role in controlling hypertension in diabetic nephropathy[4]. Their principal effect is to reduce blood pressure and renal damage by inhibiting the activity of the RAAS. In this instance, the activation of RAAS, upregulation of endothelin 1 (ET-1), upregulation of reactive oxygen species, and downregulation of nitric oxide (NO) combine to induce hypertension[5]. Angiotensin II receptor (AT1R) is present on renal TAL cells, a G protein-coupled receptor primarily responsible for mediating the biological effects of the RAAS. In TAL cells, AT1R can sense Angiotensin II (Ang II) in the RAAS system, thereby regulating the reabsorption of sodium, potassium, chlorine, and other ions by TAL cells[6]. Studies have shown that excessive activation of AT1R is strongly associated with the onset and progression of hypertension[7]. Further research is warranted to explore the significance of TAL cells in controlling hypertension in diabetic nephropathy and delaying the progression of diabetic nephropathy[8].

The Crude Ascending Branch (TAL) of the Henle loop plays a pivotal role in regulating the salt and water balance of the kidneys[9]. AT1R is present on renal TAL cells, a G protein-coupled receptor primarily responsible for mediating the biological effects of the RAAS. AT1 receptors are present on TAL, and Ang II binds to renal AT1 receptors, augmenting sodium reabsorption in the proximal tubules of the kidneys and stimulating the release of aldosterone from the adrenal cortex[10]. These effects, in conjunction with sodium reabsorption in the aldosterone-stimulated collecting ducts, result in vasoconstriction, sodium reabsorption, and heightened blood pressure[8]. Studies have shown that excessive activation of AT1R is strongly associated with the onset and progression of hypertension[7]. Further research is warranted to explore the significance of TAL cells in controlling hypertension in diabetic nephropathy and delaying the progression of diabetic nephropathy[11].

In this study, PRKAR2B and TGFBI were identified as potential biomarkers of DN using Weighted Gene Co-expression Network Analysis, Least Absolute Shrinkage and Selection Operator, Support Vector Machine - Recursive Feature Elimination, and Random Forest algorithms. A diagnostic model that combines PRKAR2B and TGFBI was established to assess the risk of diabetic glomerular injury with high sensitivity and accuracy. Additionally, the potential association with infiltrating immune cells was demonstrated, providing a novel perspective on their roles in DN[12]. In addition to the identification of PRKAR2B and TGFBI as potential biomarkers of DN, this study also identified 92 immune score-related DEGs (ISRDEGs) that were enriched in inflammation- and immune-associated pathways. Correlation analysis indicated that the expressions of LCK and HCK were positively correlated with aDC, CD4 + Tem, CD8 + T cells, CD8 + Tem, and mast cells[13]. These findings suggest that immune cells may play a significant role in the development and progression of DN. In this study, four candidate genes (FN1, C1QA, C1QB, and CD44) were identified as potentially involved in the development of DN. Further investigation of the biological functions of FN1 revealed that it was positively correlated with THBS2, COL1A2, COL6A3, and CD44, and involved in the development of DN through the ECM-receptor interaction pathway. These findings suggest that FN1, THBS2, COL1A2, COL6A3, and CD44 may be novel biomarkers and target therapeutic candidates for DN. In this study, IDO1 was identified as a diagnostic and prognostic biomarker for DN. Furthermore, it was shown to play a vital role in immune cell infiltration in DN, which was ascertained using microarray data and Cell-type Identification By Estimating Relative Subsets Of RNA Transcripts for the first time[11]. These findings suggest that IDO1 may be a potential therapeutic target for DN, and further research is needed to explore its role in the disease. This study identified seven candidate BM-related genes as potential biomarkers for diabetic complications. Enrichment analysis revealed the involvement of extracellular matrix organization, PI3K-Akt signaling pathway, and AGE-RAGE signaling pathway in diabetic complications[14]. The study also found that PP2 may have therapeutic effects in ameliorating renal fibrosis in diabetic mice by decreasing the expression of PPARγ and UCP2 and attenuating Epithelial-mesenchymal transition. These findings suggest potential targets for the development of new therapies for diabetic complications[15].

The study identified seven potential TAL biomarkers for diabetic nephropathy, including ESRRG, IGF1R, PTGER3, LAMB1, CYFIP2, EPN1, and MAST4. Of these biomarkers, IGF1R was expressed early and may have a good molecular docking effect with the diabetes drug Empagliflozin. The study also found that TAL cells of diabetic nephropathy are regulated by PEC and PODO through the BMP pathway, which may lead to Papillary Collecting Duct and Proximal Convoluted Tubule cytopathies. Further investigation of the BMP pathway of TAL cells could be important in understanding the lesions of PC cells in diabetic nephropathy. The NMF clustering results showed that genes related to the immunity of TAL cells in diabetic nephropathy may include COBL, PPARGC1A, and THSD7A. These findings provide new insights into the prevention and treatment of DN.

1. Transcriptome data acquisition and processing

The transcriptome dataset for diabetic nephropathy was meticulously searched in the GEO database, and after careful consideration, two datasets, namely GSE30529 and GSE104954, were deemed suitable for the study. GSE30529 comprised of 10 renal groups with diabetic nephropathy and 12 blank control groups, while GSE104954 included 7 diabetic nephropathy groups and 18 blank control groups. To obtain the expression matrix of the chip data, the geoChina package was utilized for GSE30529, while the clinical information of the expression matrix was extracted using the pData function. The normalizeBetweenArrays function was employed to perform the debatching effect, and the annotation platform GPL571 of the GSE30529 dataset was read using the fread function, followed by the transformation of ID probes and gene names. For GSE104954, the high-throughput sequencing expression matrix was read using the fread function, and the debatching effect was performed. The GPL22945 platform was used to convert ID probes and gene names. Supplementary picture 1 is the flowchart of this study.

2. Single-Cell Analysis of Diabetic Nephropathy Datasets

A comprehensive search was conducted in the GEO database to obtain the single-cell dataset of diabetic nephropathy, and 10× data were downloaded. The Read10X function was utilized to read the data, and a Securat object was created using the CreateSeuratObject function. The samples were merged using the merge function, and the number of cells was viewed. Quality control was performed on the single-cell data, and the mitochondrial ratio was calculated using the PercentageFeatureSet function. The UMI number of each gene was calculated, and the appropriate nFeature was selected. The data was normalized using LogNormalize, and highly variable genes were selected using FindVariableFeatures. PCA dimensionality reduction clustering was performed using RunPCA, and the PCA results were visualized using VizDimReduction, Dimplot, and DimHeatmap functions. Before cell clustering, the RunHarmony function was utilized to de-batch integrate the data[16]. The FindClusters function was used for cell clustering, and the DimPlot function was used for umap plot display. After clustering, cell annotation was required for the single-cell data. The genetic markers of the published article in the GSE131882 database were referred to for cell annotation.

3. Single-cell quasi-chronological analysis and cell communication analysis

To perform quasi-timing analysis, the Monocle 2 package in Bioconductor needs to be installed[17]. Firstly, TAL cells in the single-cell data are extracted, and the negative binomial distribution is specified as the expressionFamily parameter of the newCellDataSet. The estimateSizeFactors and estimateDispersions functions are then used to estimate size factor and dispersion. High-side genes are selected for analysis, and the reduceDimension function is used to further reduce dimensionality clustering. The orderCells function is used to sort cells according to trajectories, and the state trajectory distribution map and Pseudotime trajectory map results are displayed using the plot_cell_trajectory function. Clustering genes according to pseudo-temporal expression patterns, the plot_pseudotime_heatmap function takes a CellDataSet object and generates a smooth expression curve[18,19]. These genes are then clustered and plotted using the pheatmap software package to visualize how gene modules change together over time.

For cell communication analysis, CellPhoneDB is used. Firstly, TAL group cells are extracted, and the TAL group is divided into diabetic nephropathy-related TAL and other TALs using the if logic function. The createCellChat function is used for cell communication analysis. CellChatDB.human, a human ligand receptor receptor database, is selected for further ligand-receptor analysis. The subsetData function is used to extract supported data subsets, and the identifyOverExpressedGenes function is used to identify overexpressed genes. The identifyOverExpressedInteractions function is used to identify ligand-receptor pairs, and the projectData function is used to project ligands and receptors to PPI networks. Finally, the computeCommunProb function is used to calculate the aggregation communication network between cells, and the mynetVisual_aggregate function presents a classic ligand receptor circle map. The netAnalysis_signalingRole function identifies primary senders, receivers, mediators, and influencers. The netVisual_bubble function is used to display the communication pictures of the receiving signal and the signaling cell respectively.

4. hdWGCNA (Single Cell Co-Expression Network Analysis)

To begin the analysis, we filtered genes that were expressed in at least 5% of cells in the entire dataset. We then utilized the K-nearest neighbors (KNN) algorithm to identify groups of similar cells and calculate the average or total expression of these cells, resulting in a metacell gene expression matrix with K = 20 in the KNN algorithm. We grouped the information by tissue type and cell type and normalized the metacell expression matrix.

Next, we performed co-expression network analysis. We set up the expression matrix using the SetDatExpr function with the name parameter to extract the cell matrix of interest for analysis. We then used the TestSoftPowers function to simulate the similarity of the co-expression network without scale maps at different soft power thresholds. We visualized the results of the parameter sweep using the PlotSoftPowers function and inspected further using the GetPowerTable function. We built a co-expression network using the ConstructNetwork function and inspected the topological overlap matrix using the GetTOM function. We further inspected the topological overlap matrix and computed harmonized module eigengenes using the ModuleEigengenes function.

To identify intra-module hub genes, we calculated kME values in single-cell datasets using the ModuleConnectivity function and used the GetHubGenes function. We also computed gene scoring for the top 25 hub genes by kME for each module using Seurat's AddModuleScore function. We constructed FeaturePlots for each coexpression module colored by each module's uniquely assigned color using the ModuleFeaturePlot function.

To visualize the correlation between each module based on their hMEs or hub gene scores, we used the corrplot R package and the ModuleCorrelogram function. Finally, we flipped the x/y axes, rotated the axis labels, and changed the color scheme for improved visualization[20].

5. Screening genes

For univariate logistic regression, the glm() function is used to manually extract OR, 95% CI, and P value. The GLM logistic regression parameter setting is family = binomial(link='logit'). The lrm function of the rms package directly outputs 95% CI and P value, while the epiDisplay function outputs OR, 95% CI, and P value. The gtsummary package outputs the three-line table image.

For Lasso regression dimensionality reduction, the glmnet function is used to build a model. X is the variable of the data, Y is the result, and the parameter family="binomial" is set. The plot function is applied, and xvar="lambda" is set. The glmnet's own function is cross-checked, and cvfit$lambda.min is used to find the minimum value, while cvfit$lambda.1se finds the λ value of the minimum value of one standard error[21]. These two values are brought into the model to view the results and find OR and 95% CI.

For machine learning, the mlr3verse package is used[22]. The dataset is divided into a training set and a validation set, and the as_task_classif function creates a task. The learner function is encapsulated, and a regr.rpart selection task that can be applied to the regression task is selected. The learner's train method is used to train the task, where the task is the training set, and the trained model attribute gets the training result. The KNN algorithm is used for machine learning, and the rsmp function is used for K-fold analysis, where the parameters folds = 5 and repeats is set to 10. The ACU curve and sensitivity specificity are further calculated, and the result images are output using ggplpt2 and ggpubr. At the same time, SVM (Support Vector Machine Model) is used for further simulation. The second dataset is used for independent external testing, and the plot.roc function is used to output pictures.

6. MCPcounter immunoinfiltration typing

Immunoinfiltration analysis using the MCP-counter package involves quantifying the absolute abundance of 8 immune cells and 2 stromal cells in heterogeneous tissues using transcriptome data. The eight immune cells are CD3 + T cells, CD8 + T cells, cytotoxic lymphocytes, NK cells, B lymphocytes, cells originating from monocytes (monocytic lineage), myeloid dendritic cells, and neutrophils. The two stromal cells are endothelial cells and fibroblasts. To estimate the abundance of infiltrating cells, potential biological marker gene expression matrices and probesets files are required. The MCPcounter.estimate function is used, which employs NMF clustering analysis to decompose the non-negative matrix into the product of two non-negative matrices. The output of the MCPcounter.estimate function includes the estimated abundance of each immune cell type and stromal cell type, as well as the standard error and p-value for each estimate. This analysis can provide valuable insights into the immune microenvironment of tumors and its association with clinical outcomes. MCP-counter is a widely used tool for immunoinfiltration analysis in cancer research[23].

7. NMF(Non-negative matrix factorization)

The NMF toolkit is a potent instrument for scrutinizing high-dimensional data through non-negative matrix factorization. To initiate the factorization, the nmfModel function is employed to determine the optimal rank, which is then utilized for analysis with the nmf function. The consensusmap function assesses the connection matrix derived from multiple independent NMF runs and generates a proportional difference heat map between different cluster typing of NMF. To scrutinize immune infiltration between different NMF subtypes, the gsva_matrix function can be utilized to execute gene set variation analysis (GSVA) on a matrix of gene expression data. In summary, the NMF package is used to determine the optimal rank and perform analysis, the consensusmap function evaluates the connection matrix derived from multiple independent NMF runs, and outputs the proportional difference heat map between different cluster typing of NMF. The gsva_matrix function is used to analyze immune infiltration between different NMF subtypes[24].

8. Convolutional neural networks

The gene-immunoconvolutional neural network is a deep learning model used to evaluate the effect of training and verification sets on gene and immunity ratios[25]. The model involves separating the input and output data of the training and validation sets and converting them into matrix types. The Keras model is defined by extracting the input dimension and adding a one-dimensional convolutional layer, followed by Flatten and Dense layers. The model is compiled with the "Adam" optimizer and trained using the training data. The trained model is then used to predict the test data, and the accuracy of the forecast is evaluated using the RMSE indicator. Based on the gene and immunity ratio, a deep learning model of gene-immunoconvolutional neural network was constructed to evaluate the effect of training and verification set. Separate the x input and y output of the training set and the validation set and convert them into matrix types,extract the input dimension of the keras model, define the Keras model, and add a one-dimensional convolutional layer. Add Flatten and Dense layers and compile them with the "Adam" optimizer. Use the training data to fit the model. Use the trained model to predict test data. We will check the accuracy of the forecast with the RMSE indicator.

9. Molecular docking

The 3D structure of Empagliflozin small molecules was collected from the PubChem database (https://pubchem.ncbi.nlm.nih.gov/) database. The 3D structure of each substrate is then prepared and coordinates determined by Schrödinger's Ligrep module based on a OPLS_3e-based position. Identify all possible stereoisomers and associated protonation states using the Epik module.

Protein preparation for docking requires crystal structure. Search the crystal structure of the protein corresponding to the gene of the potential biological marker from the protein database (PDB), all of which are human and saved in PDB format. Use Schrödinger's Protein Prearation Wizard module for protein preparation, dispensing bond level hydrogenation, then removing water molecules and cofactors from proteins, then optimizing hydrogen bond networks, and finally using OPLS_3e force fields to minimize protein energy.

Molecular docking is performed using the Glide module in Schrödinger software, and their ligands are defined as the centers of the docking lattice. The size of the docking outer box is set to 20A, and the size of the inner box is set to 10A. It uses standard precision (SP-docking) for docking, and uses Schrödinger's built-in GlideScore as the scoring function. Each small molecule generates 10 conformations and minimizes energy after docking them. For each idea generated by docking small molecules, the conformation with the highest score was selected for subsequent analysis.

10. Statistical analysis

R version 4.1.3 was used for statistical analysis, while Student’s t-test was performed to assess significant differences among distinct groups. In addition, the glmnet R package was used for the LASSO and Cox regression analyses. p-values < 0.05 indicated statistical significance (*p < 0.05; **p < 0.01; ***p < 0.001; ****p < 0.0001).

1. Single-cell data processing and cell annotation

Apply the filter conditions of nFeature_RNA > 200 & nFeature_RNA < 5000 & percent.mt < 10 to obtain 29462 cells. Here, nFeature_RNA represents the number of genes measured per cell, nCount is the sum of the expression of all genes measured for each cell, and percent.mt is the proportion of mitochondrial genes (Fig. 1A). The PCA plot after the Harmony debatching effect demonstrated a good correction effect (Fig. 1B). Umap dimensionality reduction clustering yielded a total of 24 subpopulations, which were cell-annotated by article gene markers published in the original data (Fig. 1C). This diabetic nephropathy single-cell data comprises 13 cells, including proximal tubule (PT), parietal epithelial cells (PEC), thick ascending limb (TAL), distal tubule (DCT), connecting tubule (CNT), collecting duct (PC), Type A Intercalated cells (ICA), Type B intercalated cells (ICB), Podocyte cells (PODO (MES)), fibroblasts (FIB), endothelial cells (ENDO), and leukocytes (LEUK). Different types of cells correspond to different colors in the umap picture (Fig. 1D). Refer to Fig. 1E for a picture of a single-cell Umap of the kidney in the diabetic nephropathy group and the blank control group. Figure 2A shows the proportion of 13 kidney cells in diabetic nephropathy and the blank control group, where TAL is significantly increased in diabetic nephropathy. The histogram in Fig. 2B shows that PT cells accounted for the largest proportion of 13 cells in the blank control group, followed by TAL cells. The histogram in Fig. 2C shows the number of 13 cells in the diabetic nephropathy group, of which PT cells account for the largest proportion, followed by TAL cells. Based on the three pictures in Fig. 2, it is evident that diabetic nephropathy has significantly more TAL cells than the blank control group.

2. Single-cell co-expression network analysis of TAL in diabetic nephropathy

Visualize the parameter sweep results using PlotSoftPowers and select the most suitable softpower as 5 (Fig. 3A). The co-expression network of TAL cells in diabetic nephropathy is visualized in Fig. 3B, which comprises a total of 9 modules. Based on the eigengene-based connectivity (KME) score for each module, highly connected genes within each module are derived (Fig. 3C). Figure 3D shows the correlation between the genes, where the genes in the turquoise module are strongly positively correlated with the genes in the brown and yellow modules, the genes in the brown and pink modules are strongly positively correlated, and the genes in the Yellow and blue modules are strongly negatively correlated. Dot plots in Fig. 3E and F suggest that genes in turquoise, brown, and pink are most associated with subclass 1.2.12. Please refer to Supplementary Material 1 for genes from hdWGCNA turquoise, brown, and pink.

3.Diabetic nephropathy TAL time-bound sequence analysis and cell communication

We analyzed the development trajectory of TAL cells in diabetic nephropathy using a pseudo-time series analysis. The trajectory starts from node 2 and is divided into two branches, developing backward from cluster 1 and cluster 2, respectively, and gradually developing into subgroups 3, 4, and 5 after subgroup 6 (Fig. 4A and B). Figure 4C shows the heat maps of representative genes in different cluster plots, where genes in clusters 1, 2, and 7 are early cell expression genes, and genes in clusters 3 and 4 belong to genes expressed by late cells. Early expressed genes include LINC01320, TENT2, SOX6, VWA8, UBE2E3, FBXL4, PPARGC1A, HIBADH, NEDD4L, OSBPL3, KLF12, TRPS1, DENND1B, CADPS2, ARID1B, MITF, NCOA2, EXOC4, MEF2F, VAV3, GRAMD2B, ARHGAP24, SGIP1, PPM1L, ESRRG, IGF1R, TMEM161B-AS1, PANTR1, and NTRK2. The potential TAL biomarkers we screened for, ESRRG and IGF1R, are expressed at an early stage. The genes expressed in late stages are TTC14, PAX2, LAMB1, TNS3, SELENBP1, DENND2A, GOLM1, KIFC3, CCDC7, PRODH, PRMD16, MCF2L, EPN1, EHBP1, GRB14, HPN, and C20Orl194. We also investigated the cell communication networks between TAL cells and other cell types in the kidney. Our results suggest that TAL cells in diabetic nephropathy send BMP signals to many cells, which are mediated and regulated by PEC and PODO cells, ultimately leading to altered BMP pathway in PC cells and contributing to the development of the disease (Fig. 4D, E).

4. Immune infiltration analysis of TAL cell-related genes in diabetic nephropathy

The TAL cells of diabetic nephropathy were subjected to non-negative matrix factorization (NMF), wherein the point preceding the point with the fastest decline was selected, resulting in the acquisition of three distinct cell types (Fig. 5A). The heat map in Fig. 5B illustrates the variations in NMF of the three TAL cell types in diabetic nephropathy. The heat map in Fig. 5C reveals that the first NMF type is predominantly associated with immunity, particularly Neutrophils, NK cells, Cytotoxic lymphocytes, and Myeloid dendritic cells. Figure 5D box plot indicates that there are differences in immune infiltration among the three NMF types, with the first type of cells being most closely linked to immune infiltration. When comparing the three NMF cell types, T cells, Cytotoxic lymphocytes, NK cells, Myeloid dendritic cells, and Fibroblasts exhibited statistical differences, with the first group of NMF cells exhibiting high expression levels in these five immune cell types (Fig. 5E). Furthermore, the first subset of TALNMF cells in diabetic nephropathy exhibited high expression levels of three genes, namely COBL, PPARGC1A, and THSD7A (Fig. 5F).

5. Machine learning screens for TAL biological markers of diabetic nephropathy

The transcriptome datasets GSE30529 and GSE104954 were merged, and batch effects were eliminated to obtain an expression matrix (Supplementary Material 2). GSE30529 served as the training set, comprising a total of 22 samples, including 10 diabetic nephropathy groups and 12 blank control groups. On the other hand, GSE104954 was utilized as the verification set, consisting of 25 samples, including 7 diabetic nephropathy groups and 18 blank control groups. Single-gene logistic regression was performed on the training set, followed by lasso regression, which yielded 7 genes (Supplementary picture 2A, B). These seven genes, namely ESRRG, IGF1R, PTGER3, LAMB1, CYFIP2, EPN1, and MAST4, were primarily expressed in the NMF type II cell population of TAL cells in diabetic nephropathy (Fig. 5F). Further verification was conducted using mlr3 machine learning, with the training set adopting fold = 5, repeats = 10, and 50 internal cross-validations, while the external training set was utilized for verification. The AUC and ROC graphs demonstrated that the SVM machine learning algorithm and lda exhibited high sensitivity and specificity. In this study, SVM was selected for external dataset testing (Supplementary picture 2C, D). The area under the AUC curve tested on the external dataset was 0.905 (Fig. 7E). The correlation between potential biomarkers of TAL cells in diabetic nephropathy and immune cells and immune infiltration revealed that the IGF1R gene was most closely associated with NK cell infiltration (Supplementary picture 2F). Figures 6A-G showed the expression of seven screened biomarkers, which were statistically different in the blank control group and the diabetic nephropathy group, and the expression was reduced in the diabetic nephropathy group.

6. Convolutional neural network deep learning validation of biomarkers immune infiltration

GSE30529 is the training set, and GSE104954 is the verification set. The process of deep learning training gradually decreases and the accuracy gradually increases (Supplementary picture 3A). After deep learning on the training set GSE30529, when the threshold is 0.487, the sensitivity and specificity are both 1 (Supplementary picture 3B). After deep learning on the validation set GSE104954, when the threshold is 0.300, the sensitivity is 0.833 and the specificity is 1 (Supplementary picture 3C).

7. Molecular docking

Molecular docking of 7 genes with Empagliflozin, the results are as follows. Empagliflozin binds to the active cavity of CYFIP, establishes hydrophobic interactions with amino acids near the active site, and forms Pi-Pi stacking with TYR700 near the active cavity (Fig. 7A,B, Supplementary picture 4A). The hydrophobic amino acids present in the active cavity include MET37, PRO32, TYR700, LEU642, LEU636. These interactions stabilized the binding of Empagliflozin to CYFIP with a score of -9.273. A higher score indicates that Empagliflozin is stably combined with CYFIP. Empagliflozin binds to the active cavity of ESRRG, establishes hydrophobic interactions with amino acids near the active site, forms Pi-Pi stacking with PHE435 near the active cavity, and forms H bond with GLU275 near the active cavity (Fig. 7C,D, Supplementary picture 4B). The hydrophobic amino acids present in the active cavity include TRP305, MET306, LEU309, ILE310, VAL313, LEU276, ALA272, LEU271, CYS269, LEU268, LEU265. These interactions stabilized the binding of Empagliflozin to ESRRG with a score of -8.253. A higher score indicates that Empagliflozin is stably combined with ESRRG. Empagliflozin binds to the active cavity of PTGER3 and establishes hydrophobic interactions with amino acids near the active site (Fig. 7E,F, Supplementary picture 4C). The hydrophobic amino acids present in the active cavity include LEU329, VAL332, ALA335, ILE340, VAL110, LEU59, MET58, PRO55. These interactions stabilized the binding of Empagliflozin to PTGER3 with a score of -8.253. A higher score indicates that Empagliflozin binds stably to PTGER3. Empagliflozin binds to the active cavity of IGF1, establishes hydrophobic interactions with amino acids near the active site, and forms H bonds with ASP150 and ASN1137 near the active cavity (Fig. 7G,H, Supplementary picture 4D). The hydrophobic amino acids present in the active cavity include ALA1028, LEU1078, MET1079, MET1139, LEU1002. These interactions stabilized the binding of Empagliflozin to IGF1 with a score of -7.073. A higher score indicates that Empagliflozin binds stably to IGF1.

Empagliflozin binds to the active cavity of MAST4, establishes hydrophobic interactions with amino acids near the active site, and forms H bonds with GLU73, ASN114, ASN156, ASP169, GLN37 and ARG69 near the active cavity (Fig. 8A,B, Supplementary picture 4E). The hydrophobic amino acids present in the active cavity include LEU57, ALA53, ALA113, MET111, MET108, ILE32, VAL40. These interactions stabilized the binding of Empagliflozin to MAST4 with a score of -7.577. A higher score indicates that Empagliflozin is stably bound to MAST4. Empagliflozin binds to the active cavity of EPN1, forms Pi-cation with ARG7 and ARG25 near the active cavity, and forms H bond with ARG8 near the active cavity (Fig. 8C,D, Supplementary picture 4F). These interactions stabilized the stable binding of Empagliflozin to EPN1, with a score of -4.642. Empagliflozin binds to the active cavity of LAMB1, forms Pi-cation with ARG152 near the active cavity, and forms H bond with ASP155, SER154 and ASN226 near the active cavity (Fig. 8E,F, Supplementary picture 4G). The hydrophobic amino acids present in the active cavity include ALA159, TRP160, VAL162, TYR163. These interactions stabilized the binding of Empagliflozin to LAMB with a score of -4.571.

CYFIP is a key component of the WAVE regulatory complex (WRC), which regulates actin, which is essential for cell motility, intercellular adhesion, and epithelial differentiation[26]. A nascent mutation in the CYFIP2 subunit of WRC that causes intellectual disability (ID) in humans[27]. Cyfip regulates synaptic breeding and endocytosis by inhibiting actin assembly[28]. CYFIP2 acts as a tumor suppressor gene in ccRCC, and enhancing its expression may provide a new strategy for the treatment of the disease[26]. In our study, CYFIP gene expression was decreased in TAL cells of diabetic nephropathy.

ESRRG encodes estrogen-related receptor gamma, which is an orphan nuclear receptor expressed in tissues with high metabolic activity, such as the heart and kidney, and is involved in lipid metabolism[29]. ESRRG functions in early branch generation of the ureteric bud and is essential for normal development of the renal papilla[30]. Coordinated actions of nuclear receptor estrogen-related receptor gamma (ERRγ) and hepatocyte nuclear factor 1 beta (HNF1β) in regulating renal mitochondrial and reabsorptive functions through epigenomic programs. When ERRγ is deleted in renal epithelial cells, which highly and specifically express the gene, severe renal energetic and reabsorptive dysfunction occurs, leading to progressive renal failure[31]. ESRRG showed a 1.4-fold decrease with age and there is a clear link between Esrrg and kidney aging via cardiovascular disease, lipid metabolism, or kidney function itself[32]. CRIP1 and ESRRG genes have great potential to become novel biomarkers and therapeutic targets concerning hypertension-related RCC[33]. Overall, the ESRRG gene is a critical regulator of metabolism, and its dysregulation has been linked to various metabolic disorders. In our study, ESSRG gene expression decreased in TAL cells of diabetic nephropathy. and expressed early in the cell cycle.

The PTGER3 gene, also known as the prostaglandin E2 receptor 3 gene, encodes a receptor protein that binds with high affinity to prostaglandin E2 (PGE2). This receptor is a G protein-coupled receptor that plays a key role in modulating the immune response and inflammation. EP3 contributes to diabetic polyuria by inhibiting expression of aquaporins and that it promotes renal injury during diabetes. EP3 may prove to be a promising target for more selective management of diabetic kidney disease[34].

EP(3 )receptors mediate vasoconstriction in the kidney of male mice and its actions are tonically active in the basal state. Furthermore, EP(3) receptors are capable of buffering PGE(2)-mediated renal vasodilation[35]. The vasoconstriction is mediated by an EP(3) receptor coupled to G(alphai). In our study, the expression of the PTGER3 gene was decreased in TAL cells of diabetic nephropathy.

Studies of IGF1 expression during mouse kidney development revealed IGF1 mRNA expression in all renal cells at embryonic day 15, with a drastic decrease after birth[36]. During early embryogenesis, the IGF1R mRNA is expressed in the rat mesonephros and is detected in all nephron segments through adulthood. In the human kidney, the IGF1R is strongly expressed in glomeruli and the tubular epithelium[37,38]. GH and IGFs play a significant role in the early development of diabetic renal disease[39]. Mesangial cells isolated from experimental models of diabetic nephropathy exhibit altered IGF1 synthesis, IGF1 pathway activation, and higher IGF1R expression and activation compared with controls[40]. Hyperglycemia reduces IGFBP-2 expression in mesangial cells, exacerbating IGF1 effects on mesangial cells, and increases the expression of IGFBP-3, which mediates mesangial cell apoptosis[41,42]. In our study, the expression of the IGF1R gene was decreased in TAL cells with diabetic nephropathy. It is expressed early in the cell cycle, which is consistent with previous studies, and immunoinfiltration analysis further shows that IGF1R gene and NK cell infiltration are most correlated. IGF1 may be an important early biomarker in TAL cells of diabetic nephropathy.

The Microtubule Associated Serine/Threonine Kinase 4 (MAST4) gene is a serine/threonine kinase coding gene that encodes a multifunctional protein involved in neuronal functions. The MAST4 gene is predominantly expressed in the brain, particularly in the hippocampus, amygdala, and thalamus. The variants of MAST4 gene might lead to neurodevelopmental disorders with developmental delay and infantile spasm[43]. In our study, the expression of the MAST4 gene was decreased in TAL cells of diabetic nephropathy.

The EPN1 gene, also known as EPN or epsin 1, is located on human chromosome 17 and is involved in the regulation of clathrin-mediated endocytosis. The encoded protein, epsin 1, is a member of a small family of proteins that contain binding domains for both clathrin and phospholipids. Epsin 1 plays a crucial role in controlling the dynamics of clathrin-coated pits, which are the initial sites of vesicle formation within cells[44]. Epsin 1 also interacts with a variety of other proteins involved in endocytosis, such as clathrin adaptors and receptors[45]. In our study, the expression of the EPN1 gene was decreased in TAL cells of diabetic nephropathy. It is expressed in the late cell cycle of TAL in diabetic nephropathy.

LAMB1 is a gene that encodes for the laminin beta-1 subunit, which is a protein component of laminins, a group of extracellular matrix glycoproteins that play vital roles in various biological processes, including cell adhesion, migration, differentiation, and signaling[46,47]. The LAMB1 gene encodes for the laminin beta-1 subunit, a major component of basement membranes in the kidney. The basement membrane is a specialized extracellular matrix that lines the surface of the renal glomerulus and participates in filtration and maintenance of glomerular structure[48]. The LAMB1 protein plays a crucial role in maintaining the structural integrity of the glomerular basement membrane, as well as regulating signals that control renal development and repair. Mutations or abnormalities in the LAMB1 gene have been linked to several kidney diseases, including congenital nephrotic syndrome and focal segmental glomerulosclerosis, which result in the disruption of the kidney's normal filtration process. Understanding the molecular mechanisms underlying LAMB1 gene regulation and function may lead to the development of novel therapeutic strategies for the early detection and treatment of chronic kidney diseases.

LAMB1 is BM-related(basement membranes) genes as possible diagnostic and therapeutic biomarkers for DN[14]. In our study, LAMB1 gene expression increased in TAL cells of diabetic nephropathy. It is expressed in the late cell cycle of TAL in diabetic nephropathy.

Bone morphogenetic protein (BMP) signaling and its modifiers in renal development and physiology. BMP7 is a crucial BMP for kidney development, and its absence can lead to defects in kidney and eyes, renal dysplasia, and reduced nephrons. BMP7 is also protective against chronic kidney disease (CKD) and hypertensive nephrosclerosis. BMP signaling is important for cell growth, apoptosis, and differentiation in early development. BMP4 mutations have been identified in patients with renal hypodysplasia. The balance of TGF-β1 and BMP-7 signaling and involvement of extracellular and intracellular modifiers in these cascades are important parts of podocyte physiology and pathophysiology. BMP-7 can restore SnoN protein level via Smad1/5 pathway in diabetic kidney disease, ameliorating partial epithelial-mesenchymal transition. Our study shows that TAL cells in diabetic nephropathy mediate and regulate PEC and PODO via the BMP pathway, ultimately leading to PC cytopathies.

The passage discusses the research on TAL cells in diabetic nephropathy at the single-cell level. The study identified 7 potential biological markers of TAL cells in diabetic nephropathy and explored molecular docking of new drugs for diabetic nephropathy control. The analysis also revealed cellular communication and genes associated with TAL cell immune infiltration in diabetic nephropathy, providing guidance for studying cell-cell interactions and immune infiltration in the disease. The study expands the understanding of diabetic nephropathy and has the potential to inform future research and clinical practice in this area.

hdWGCNA: high-dimensional weighted gene co-expression network analysis

LASSO: Least Absolute Shrinkage and Selection Operator

TAL cell: thick ascending limb cell

NMF: Non-negative Matrix Factorization clustering

MCPcounter : Monocyte Chemotactic Protein-induced Counter

BMP: Bone Morphogenetic Protein

PEC: Podocyte epithelial cells

PODO: Podocyte cells

RAAS: Renin-angiotensin-aldosterone system

AT1R: Angiotensin II receptor

SVM-RFE: Support Vector Machine - Recursive Feature Elimination

RF: algorithms: Random Forest algorithms

DN: Diabetic nephropathy

DEGs: Differentially Expressed Genes

CIBERSORTx: Cell-type Identification By Estimating Relative Subsets Of RNA Transcripts

EMT: Epithelial-mesenchymal transition

PC: Papillary Collecting Duct and Proximal Convoluted Tubule

KNN: K-nearest neighbors

GSVA: Gene set variation analysis

PT: Proximal tubule

PEC: Parietal epithelial cells

TAL: Thick ascending limb

DCT:, Distal tubule

CNT:, Connecting tubule

PC: Collecting duct

ICA: Type A Intercalated cells

ICB: Type B intercalated cells

FIB: Fibroblasts

ENDO: Endothelial cells

LEUK: Leukocytes

PDB: Protein database

Ethics approval and informed consent

GEO is a public database, and ethical approval was obtained for the studies that collected the various data in the database. Users can freely download relevant data for use in research and publication of relevant articles. Our study was based on open-source data; therefore, there are no ethical issues or other conflicts of interest.

Data availability statement

Publicly available datasets were analyzed in this study. This data can be found here: GSE30529, GSE104954, and GSE131882 .

GEO Accession viewer https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE30529;

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE104954;

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE131882 ;

Ethics statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author contributions

Chengyu Zhang: Performed the study; Analyzed and interpreted the data; Wrote the paper. Han Li: Conceived and designed the study; Performed the study; Analyzed and interpreted the data; Wrote the paper. Shixiang Wang: Conceived and designed the study; Analyzed and interpreted the data; Contributed analysis tools or data. All authors read and approved the final manuscript.

Acknowledgments

We would like to thank the authors of the studies that generated the GSE30529, GSE104954, and GSE131882 datasets for their contributions.

Funding

This research did not receive any specific grant from funding agencies in the public, commercial, or not-for-profit sectors.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Gross, J.L., de Azevedo, M.J., Silveiro, S.P., Canani, L.H., Caramori, M.L., Zelmanovitz, T., 2005. Diabetic nephropathy: diagnosis, prevention, and treatment. Diabetes Care 28(1): 164–76, Doi: 10.2337/diacare.28.1.164.
Van Buren, P.N., Toto, R., 2011. Hypertension in Diabetic Nephropathy: Epidemiology, Mechanisms, and Management. Advances in Chronic Kidney Disease 18(1): 28–41, Doi: 10.1053/j.ackd.2010.10.003.
Grassi, G., Mancia, G., Nilsson, P.M., 2016. Specific Blood Pressure Targets for Patients With Diabetic Nephropathy? Diabetes Care 39(Supplement_2): S228–33, Doi: 10.2337/dcS15-3020.
Momoniat, T., Ilyas, D., Bhandari, S., 2019. ACE inhibitors and ARBs: Managing potassium and renal function. Cleveland Clinic Journal of Medicine 86(9): 601–7, Doi: 10.3949/ccjm.86a.18024.
Van Beusecum, J.P., Inscho, E.W., 2015. Regulation of Renal Function and Blood Pressure Control by P2 Purinoceptors in the Kidney. Current Opinion in Pharmacology 21: 82–8, Doi: 10.1016/j.coph.2015.01.003.
Polidoro, J.Z., Rebouças, N.A., Girardi, A.C.C., 2021. The Angiotensin II Type 1 Receptor-Associated Protein Attenuates Angiotensin II-Mediated Inhibition of the Renal Outer Medullary Potassium Channel in Collecting Duct Cells. Frontiers in Physiology 12, Doi: 10.3389/fphys.2021.642409.
Wakui, H., 2020. The pathophysiological role of angiotensin receptor-binding protein in hypertension and kidney diseases: Oshima Award Address 2019. Clinical and Experimental Nephrology 24(4): 289–94, Doi: 10.1007/s10157-020-01861-4.
Eladari, D., Hübner, C.A., 2011. Novel mechanisms for NaCl reabsorption in the collecting duct. Current Opinion in Nephrology and Hypertension 20(5): 506–11, Doi: 10.1097/MNH.0b013e3283486c4a.
Bankir, L., Figueres, L., Prot-Bertoye, C., Bouby, N., Crambert, G., Pratt, J.H., et al., 2020. Medullary and cortical thick ascending limb: similarities and differences. American Journal of Physiology. Renal Physiology 318(2): F422–42, Doi: 10.1152/ajprenal.00261.2019.
El-Arif, G., Khazaal, S., Farhat, A., Harb, J., Annweiler, C., Wu, Y., et al., 2022. Angiotensin II Type I Receptor (AT1R): The Gate towards COVID-19-Associated Diseases. Molecules (Basel, Switzerland) 27(7): 2048, Doi: 10.3390/molecules27072048.
Yu, K., Li, D., Xu, F., Guo, H., Feng, F., Ding, Y., et al., 2021. IDO1 as a new immune biomarker for diabetic nephropathy and its correlation with immune cell infiltration. International Immunopharmacology 94: 107446, Doi: 10.1016/j.intimp.2021.107446.
Han, H., Chen, Y., Yang, H., Cheng, W., Zhang, S., Liu, Y., et al., 2022. Identification and Verification of Diagnostic Biomarkers for Glomerular Injury in Diabetic Nephropathy Based on Machine Learning Algorithms. Frontiers in Endocrinology 13: 876960, Doi: 10.3389/fendo.2022.876960.
Lu, K., Wang, L., Fu, Y., Li, G., Zhang, X., Cao, M., 2022. Bioinformatics analysis identifies immune-related gene signatures and subtypes in diabetic nephropathy. Frontiers in Endocrinology 13: 1048139, Doi: 10.3389/fendo.2022.1048139.
Gui, H., Chen, X., Ye, L., Ma, H., 2023. Seven basement membrane-specific expressed genes are considered potential biomarkers for the diagnosis and treatment of diabetic nephropathy. Acta Diabetologica 60(4): 493–505, Doi: 10.1007/s00592-022-02027-2.
Wei, J., Deng, X., Li, Y., Li, R., Yang, Z., Li, X., et al., 2021. PP2 Ameliorates Renal Fibrosis by Regulating the NF-κB/COX-2 and PPARγ/UCP2 Pathway in Diabetic Mice. Oxidative Medicine and Cellular Longevity 2021: 7394344, Doi: 10.1155/2021/7394344.
Korsunsky, I., Millard, N., Fan, J., Slowikowski, K., Zhang, F., Wei, K., et al., 2019. Fast, sensitive and accurate integration of single-cell data with Harmony. Nature Methods 16(12): 1289–96, Doi: 10.1038/s41592-019-0619-0.
Qiu, X., Mao, Q., Tang, Y., Wang, L., Chawla, R., Pliner, H.A., et al., 2017. Reversed graph embedding resolves complex single-cell trajectories. Nature Methods 14(10): 979–82, Doi: 10.1038/nmeth.4402.
Jin, S., Guerrero-Juarez, C.F., Zhang, L., Chang, I., Ramos, R., Kuan, C.-H., et al., 2021. Inference and analysis of cell-cell communication using CellChat. Nature Communications 12(1): 1088, Doi: 10.1038/s41467-021-21246-9.
Vu, R., Jin, S., Sun, P., Haensel, D., Nguyen, Q.H., Dragan, M., et al., 2022. Wound healing in aged skin exhibits systems-level alterations in cellular composition and cell-cell communication. Cell Reports 40(5): 111155, Doi: 10.1016/j.celrep.2022.111155.
Morabito, S., Reese, F., Rahimzadeh, N., Miyoshi, E., Swarup, V., 2022. High dimensional co-expression networks enable discovery of transcriptomic drivers in complex biological system.
Tibshirani, R., 1996. Regression Shrinkage and Selection Via the Lasso. Journal of the Royal Statistical Society: Series B (Methodological) 58(1): 267–88, Doi: 10.1111/j.2517-6161.1996.tb02080.x.
Lang, M., Binder, M., Richter, J., Schratz, P., Pfisterer, F., Coors, S., et al., 2019. mlr3: A modern object-oriented machine learning framework in R. Journal of Open Source Software 4(44): 1903, Doi: 10.21105/joss.01903.
Zheng, X., Ma, Y., Bai, Y., Huang, T., Lv, X., Deng, J., et al., 2022. Identification and validation of immunotherapy for four novel clusters of colorectal cancer based on the tumor microenvironment. Frontiers in Immunology 13: 984480, Doi: 10.3389/fimmu.2022.984480.
Esposito, F., Gillis, N., Del Buono, N., 2019. Orthogonal joint sparse NMF for microarray data analysis. Journal of Mathematical Biology 79(1): 223–47, Doi: 10.1007/s00285-019-01355-2.
Wu, Z., Pan, S., Chen, F., Long, G., Zhang, C., Yu, P.S., 2021. A Comprehensive Survey on Graph Neural Networks. IEEE Transactions on Neural Networks and Learning Systems 32(1): 4–24, Doi: 10.1109/TNNLS.2020.2978386.
Tong, J., Meng, X., Lv, Q., Yuan, H., Li, W., Xiao, W., et al., 2021. The Downregulation of Prognosis- and Immune Infiltration-Related Gene CYFIP2 Serves as a Novel Target in ccRCC. International Journal of General Medicine 14: 6587–99, Doi: 10.2147/IJGM.S335713.
Kaplan, E., Stone, R., Hume, P.J., Greene, N.P., Koronakis, V., 2020. Structure of CYRI-B (FAM49B), a key regulator of cellular actin assembly. Acta Crystallographica. Section D, Structural Biology 76(Pt 10): 1015–24, Doi: 10.1107/S2059798320010906.
Anitei, M., Stange, C., Parshina, I., Baust, T., Schenck, A., Raposo, G., et al., 2010. Protein complexes containing CYFIP/Sra/PIR121 coordinate Arf1 and Rac1 signalling during clathrin-AP-1-coated carrier biogenesis at the TGN. Nature Cell Biology 12(4): 330–40, Doi: 10.1038/ncb2034.
Giguère, V., 2008. Transcriptional control of energy homeostasis by the estrogen-related receptors. Endocrine Reviews 29(6): 677–96.
Berry, R., Harewood, L., Pei, L., Fisher, M., Brownstein, D., Ross, A., et al., 2011. Esrrg functions in early branch generation of the ureteric bud and is essential for normal development of the renal papilla. Human Molecular Genetics 20(5): 917–26, Doi: 10.1093/hmg/ddq530.
Zhao, J., Lupino, K., Wilkins, B.J., Qiu, C., Liu, J., Omura, Y., et al., 2018. Genomic integration of ERRγ-HNF1β regulates renal bioenergetics and prevents chronic kidney disease. Proceedings of the National Academy of Sciences of the United States of America 115(21): E4910–9, Doi: 10.1073/pnas.1804965115.
Rodwell, G.E.J., Sonu, R., Zahn, J.M., Lund, J., Wilhelmy, J., Wang, L., et al., 2004. A transcriptional profile of aging in the human kidney. PLoS Biology 2(12): e427, Doi: 10.1371/journal.pbio.0020427.
Huang, W., Wu, K., Wu, R., Chen, Z., Zhai, W., Zheng, J., 2020. Bioinformatic gene analysis for possible biomarkers and therapeutic targets of hypertension-related renal cell carcinoma. Translational Andrology and Urology 9(6): 2675–87, Doi: 10.21037/tau-20-817.
Hassouneh, R., Nasrallah, R., Zimpelmann, J., Gutsol, A., Eckert, D., Ghossein, J., et al., 2016. PGE2 receptor EP3 inhibits water reabsorption and contributes to polyuria and kidney injury in a streptozotocin-induced mouse model of diabetes. Diabetologia 59(6): 1318–28, Doi: 10.1007/s00125-016-3916-5.
Audoly, L.P., Ruan, X., Wagner, V.A., Goulet, J.L., Tilley, S.L., Koller, B.H., et al., 2001. Role of EP(2) and EP(3) PGE(2) receptors in control of murine renal hemodynamics. American Journal of Physiology. Heart and Circulatory Physiology 280(1): H327-333, Doi: 10.1152/ajpheart.2001.280.1.H327.
Lindenbergh-Kortleve, D.J., Rosato, R.R., van Neck, J.W., Nauta, J., van Kleffens, M., Groffen, C., et al., 1997. Gene expression of the insulin-like growth factor system during mouse kidney development. Molecular and Cellular Endocrinology 132(1–2): 81–91, Doi: 10.1016/s0303-7207(97)00123-8.
Bondy, C.A., Werner, H., Roberts, C.T., LeRoith, D., 1990. Cellular pattern of insulin-like growth factor-I (IGF-I) and type I IGF receptor gene expression in early organogenesis: comparison with IGF-II gene expression. Molecular Endocrinology (Baltimore, Md.) 4(9): 1386–98.
Chin, E., Bondy, C., 1992. Insulin-like growth factor system gene expression in the human kidney. The Journal of Clinical Endocrinology and Metabolism 75(3): 962–8.
Flyvbjerg, A., 2000. Putative pathophysiological role of growth factors and cytokines in experimental diabetic kidney disease. Diabetologia 43(10): 1205–23, Doi: 10.1007/s001250051515.
Vasylyeva, T.L., Ferry, R.J., 2007. Novel roles of the IGF-IGFBP axis in etiopathophysiology of diabetic nephropathy. Diabetes Research and Clinical Practice 76(2): 177–86, Doi: 10.1016/j.diabres.2006.09.012.
Fornoni, A., Rosenzweig, S.A., Lenz, O., Rivera, A., Striker, G.E., Elliot, S.J., 2006. Low insulin-like growth factor binding protein-2 expression is responsible for increased insulin receptor substrate-1 phosphorylation in mesangial cells from mice susceptible to glomerulosclerosis. Endocrinology 147(7): 3547–54.
Vasylyeva, T.L., Chen, X., Ferry, R.J., 2005. Insulin-like growth factor binding protein-3 mediates cytokine-induced mesangial cell apoptosis. Growth Hormone & IGF Research: Official Journal of the Growth Hormone Research Society and the International IGF Research Society 15(3): 207–14, Doi: 10.1016/j.ghir.2005.02.008.
Zhang, X., Xiao, N., Cao, Y., Peng, Y., Lian, A., Chen, Y., et al., 2023. De novo variants in MAST4 related to neurodevelopmental disorders with developmental delay and infantile spasms: Genotype-phenotype association. Frontiers in Molecular Neuroscience 16: 1097553, Doi: 10.3389/fnmol.2023.1097553.
Kvalvaag, A., Valvo, S., Céspedes, P.F., Saliba, D.G., Kurz, E., Korobchevskaya, K., et al., 2023. Clathrin mediates both internalization and vesicular release of triggered T cell receptor at the immunological synapse. Proceedings of the National Academy of Sciences of the United States of America 120(6): e2211368120, Doi: 10.1073/pnas.2211368120.
Shen, Q., He, B., Lu, N., Conradt, B., Grant, B.D., Zhou, Z., 2013. Phagocytic receptor signaling regulates clathrin and epsin-mediated cytoskeletal remodeling during apoptotic cell engulfment in C. elegans. Development (Cambridge, England) 140(15): 3230–43, Doi: 10.1242/dev.093732.
Petz, M., Them, N.C.C., Huber, H., Mikulits, W., 2012. PDGF enhances IRES-mediated translation of Laminin B1 by cytoplasmic accumulation of La during epithelial to mesenchymal transition. Nucleic Acids Research 40(19): 9738–49, Doi: 10.1093/nar/gks760.
Petz, M., Kozina, D., Huber, H., Siwiec, T., Seipelt, J., Sommergruber, W., et al., 2007. The leader region of Laminin B1 mRNA confers cap-independent translation. Nucleic Acids Research 35(8): 2473–82, Doi: 10.1093/nar/gkm096.
Verrou, K.-M., Galliou, P.A., Papaioannou, M., Koliakos, G., 2019. Phosphorylation mapping of Laminin β1-chain: Kinases in association with active sites. Journal of Biosciences 44(2): 55.

No competing interests reported.

Download PDF

Journal Publication

published 08 Sep, 2023

Read the published version in Functional & Integrative Genomics →

Editorial decision: Major revision
24 Jul, 2023
Reviews received at journal
18 Jul, 2023
Reviewers agreed at journal
18 Jul, 2023
Reviewers agreed at journal
14 Jul, 2023
Reviewers invited by journal
13 Jun, 2023
Editor assigned by journal
08 Jun, 2023
Submission checks completed at journal
08 Jun, 2023
First submitted to journal
06 Jun, 2023

You are reading this latest preprint version

Single-cell and transcriptome analysis reveals TAL cells in diabetic nephropathy

Status:

Journal Publication

Version 1

Abstract

Figures

Introduction

Materials and Methods

1. Transcriptome data acquisition and processing

2. Single-Cell Analysis of Diabetic Nephropathy Datasets

3. Single-cell quasi-chronological analysis and cell communication analysis

4. hdWGCNA (Single Cell Co-Expression Network Analysis)

5. Screening genes

6. MCPcounter immunoinfiltration typing

7. NMF(Non-negative matrix factorization)

8. Convolutional neural networks

9. Molecular docking

10. Statistical analysis

Results

1. Single-cell data processing and cell annotation

2. Single-cell co-expression network analysis of TAL in diabetic nephropathy

3.Diabetic nephropathy TAL time-bound sequence analysis and cell communication

4. Immune infiltration analysis of TAL cell-related genes in diabetic nephropathy

5. Machine learning screens for TAL biological markers of diabetic nephropathy

6. Convolutional neural network deep learning validation of biomarkers immune infiltration

7. Molecular docking

Discussion

Conclusion

Abbreviations

Declarations

References

Additional Declarations

Supplementary Files

Status:

Journal Publication

Version 1