Human Muscle-specific Metabolic Model Reconstruction
To reconstruct the muscle-specific metabolic network, the generic metabolic model and gene expression data of muscle were needed. For model reconstruction and analysis of metabolic disturbance in T2DM, we looked for the most comprehensive gene expression data from the muscle of healthy and diabetic individuals who had the following conditions: 1) newly diagnosed with T2DM. 2) Taking no medication for diabetes. 3) Do not have any other disease affecting the analysis. This resulted in data with the accession code phs001068.v1.p1 in the dbGaP database. Diabetic patients in this sample have been newly diagnosed as T2DM patients, and in most of them, the fasting blood glucose concentration was around 7 mmol/l, thus they were at the onset of diabetes. Table 1 shows subjects characteristics of the samples used in this study. Investigation of metabolism reprogramming in these patients who are at the onset of T2DM has significant value in identifying the underlying disorders in T2DM. We used gene expression data of healthy individuals in this sample for the reconstruction of the muscle-specific metabolic model.
Table 1
Subjects characteristics of the individuals in the dataset [6]. Means ± standard deviation values for fasting plasma glucose, fasting serum insulin, body mass index (BMI) and waist/hip ratio (WHR) in each diabetic and healthy groups are shown.
Sample
|
Glucose (mmol/L)
|
Insulin (mu/l)
|
BMI
|
WHR
|
T2DM
|
7.17 ± 0.66
|
10.68 ± 6.94
|
29.37 ± 5.04
|
0.99 ± 0.07
|
Healthy
|
5.62 ± 0.33
|
6.87 ± 3.34
|
26.35 ± 3.47
|
0.92 ± 0.08
|
The muscle-specific metabolic model was reconstructed by integrating muscle gene expression data from healthy group into the Human Metabolic Reaction 2 (HMR2) [10] as the generic metabolic model. Muscle-specific biomass reaction was added to the model in order to have a functional and alive muscle model. The model comprises of 3585 metabolites, 5740 reactions and, 3627 genes. Table 2 shows the characteristics of the muscle-specific metabolic model in comparison with HMR2. For quality controls and validation, we first checked for the biomass production to assess that all the required reactions to have alive model are satisfied. Then, 56 metabolic tasks known to occur in all types of human cells were checked; these metabolic functions can be found in [17]. Moreover, for muscle-specific validation, like Bordbar et.al. [3], we tested the ability of the model to produce ATP from several different sources like as glucose, fatty acids, glycogen branched-chain amino acids, and ketone bodies. The model successfully passed all these tests. The reconstructed muscle-specific metabolic model was employed for further analyses.
Table 2
Comparison of HMR2 metabolic model with our reconstructed muscle-specific metabolic model
|
HMR2 model
|
Muscle model
|
Reactions
|
8181
|
5740
|
Metabolites
|
6006
|
3585
|
Genes
|
3765
|
3627
|
Reporter metabolites associated with the onset of diabetes were identified by topology-based analysis
In this analysis, the reconstructed muscle-specific metabolic network was employed as the scaffold to explore the effects of transcriptional reprogramming in the context of metabolism. Using this approach, it is possible to identify so-called reporter metabolites that are those metabolites associated with the most significant transcriptional changes [18]. Gene sets associated with each metabolite can be identified from a metabolic network through gene-reaction-metabolite relation. Here, the Piano R package was used to perform reporter metabolites analysis. Results demonstrated that from 3310 metabolites having associated genes, 275 ones were identified as reporter metabolites with adjusted p-value < 0.05 in at least one directional/non-directional class. Most of the reporter metabolites were affected by down-regulated genes. Also, most of them were implicated in mitochondrial functions such as the production of mitochondrial ATP, activated fatty acids, pyruvate, ubiquinol, ubiquinone, AKG, succinyl-CoA, NADH, Acetyl-CoA, lipoamide, 5,10-methenyl-THF, and 5,10-methylene-THF. In addition, some metabolites including ceramide, tyrosine, and tryptophan were affected by up-regulated genes. The reporter metabolites were mainly involved in glycolysis, tricarboxylic acid, fatty acids, and amino acids metabolism. The complete list of the reporter metabolites can be found in Additional File1 Table S1.
Constraint-based analysis of metabolism at the onset of T2DM reveals reprogramming of pathways related to extracellular matrix
We attempted to capture cell metabolism at the system-level by using constraint-based metabolic modeling. In this approach, the metabolic model is converted to the mathematical format, and several constraints (e.g. adjusting lower and upper bound of flux in each reaction) were imposed on the model. Combining high-throughput transcriptional data and constraint-based models provides an opportunity to infer cellular metabolism at the relevant physiological state like diabetes [19]. Flux variability analysis (FVA) is a well-known constraint-based approach, which can provide allowable flux states in the metabolic model through linear programming based strategy [20].
In order to study metabolic capability changes in muscle between healthy and newly diagnosed diabetic patients, personalized GEMs were reconstructed and FVA was applied. Significant perturbed reactions between the two states identified using the two-sample t-test. This analysis resulted in perturbed reactions involving in several pathways such as glucose, amino acid, and lipid metabolisms. The perturbations in the glucose to glucose-6-phosphate conversion reaction and the glucose exchange reaction were also observed. Many reactions contributing to fatty acids and lipid metabolism were dysregulated. Also, changes in fluxes of several reactions implicated in inositol phosphate, chondroitin, and heparin sulfate metabolism were observed. To find associated pathways with these perturbed reactions, flux enrichment analysis was employed. These significantly perturbed pathways are shown in Table 3.
Table 3
Significant dysregulated pathways in T2DM obtained from flux enrichment analysis
Group
|
Adjusted P-value
|
Transport, mitochondrial
|
6.98E-27
|
Keratan sulfate biosynthesis
|
1.05E-23
|
Chondroitin / heparan sulfate biosynthesis
|
3.16E-22
|
Glycerolipid metabolism
|
6.86E-21
|
Exchange reactions
|
4.51E-14
|
Fatty acid activation (cytosolic)
|
5.44E-13
|
N-glycan metabolism
|
3.78E-12
|
Heparan sulfate degradation
|
4.15E-10
|
Nucleotide metabolism
|
9.91E-10
|
Acyl-CoA hydrolysis
|
3.67E-09
|
Carnitine shuttle
|
1.4E-07
|
Omega-6 fatty acid metabolism
|
1.07E-07
|
Arginine and proline metabolism
|
1.08E-07
|
Folate metabolism
|
2.8E-07
|
Beta oxidation of even-chain fatty acids (peroxisomal)
|
2.39E-06
|
Acylglycerides metabolism
|
5.77E-06
|
Sphingolipid metabolism
|
2.14E-05
|
Glycerophospholipid metabolism
|
2.59E-05
|
Glycine, serine and threonine metabolism
|
9.12E-05
|
Valine, leucine, and isoleucine metabolism
|
0.000622
|
Transport, nuclear
|
0.001187
|
Alanine, aspartate and glutamate metabolism
|
0.001398
|
Tricarboxylic acid cycle and glyoxylate/dicarboxylate metabolism
|
0.004234
|
Glycosphingolipid biosynthesis
|
0.004949
|
Pyrimidine metabolism
|
0.007809
|
Fatty acid elongation (even-chain)
|
0.017541
|
Amino sugar and nucleotide sugar metabolism
|
0.018405
|
Nicotinate and nicotinamide metabolism
|
0.018405
|
Pyruvate metabolism
|
0.018405
|
Cysteine and methionine metabolism
|
0.022241
|
Fatty acid elongation (odd-chain)
|
0.026726
|
Inositol phosphate metabolism
|
0.036758
|
Pentose and glucuronate interconversions
|
0.038008
|
Phenylalanine, tyrosine and tryptophan biosynthesis
|
0.040009
|
Omega-3 fatty acid metabolism
|
0.044516
|
Formation and hydrolysis of cholesterol esters
|
0.046091
|
Purine metabolism
|
0.048746
|
Pentose phosphate pathway
|
0.051904
|
Glycolysis / Gluconeogenesis
|
0.055204
|
Biopterin metabolism
|
0.064145
|
Propanoate metabolism
|
0.064552
|
Glycosphingolipid metabolism
|
0.083322
|
Potential Metabolite Markers Prediction
The genome-scale model of human metabolism provides an opportunity to predict potential biofluid markers associated with the diseases. The overall approach involves identifying exchange metabolites that their flux range in disease state is shifted compared to healthy ones. The successful performance of this method has already been evaluated and validated [21]. Here, we applied this method to find exchange metabolite markers of insulin resistance in muscle. Two generic metabolic models of healthy and diabetic muscles were reconstructed using average gene expression data of each group and E-Flux method; Then, FVA analysis was applied to them. The exchange reactions in which the flux interval was shifted relative to the normal state were selected (Fig. 2). This approach resulted in 32 exchange reactions. In order to evaluate the discriminative capability of these reactions at the individual level, a machine learning approach was employed. To do this, personalized metabolic models of healthy and diabetic individuals were reconstructed and FVA analysis was applied. The generated minimum and maximum fluxes of these reactions based on FVA analysis were used as the features for classification with support vector machine (SVM). This evaluation resulted in 72.22% accuracy, 81.11% specificity, and 60.32 sensitivity. We found that these exchange metabolites altered at the individuals’ level.
To achieve a fewer and better feature subset with improvement in classification accuracy, a wrapper feature selection method was employed that comprises a combination of genetic algorithm (GA) and SVM as the feature selector and classifier, respectively. Several subsets of features with which SVM classifier can discriminate diabetic patients from healthy ones with approximately 90 percentages accuracy were found. This GA-SVM procedure was iterated 100 times resulting in 100 features subsets; then, the observation frequency of each feature in these iterations calculated. These frequencies were considered as indicative of the importance of features and features with at least 80% frequency regarded as top-ranked features. The top-ranked features were related to 13 reactions exchanging glucose, alanine, aspartate, sodium, hyaluronate, galactose, GQ1b, acetaldehyde, deoxyribose, 4-OH-Estradiol, hypoxanthine, retinoate, and methylglyoxal. Subsequently, the SVM classification performance with the top-ranked features was assessed. Our analysis revealed that using these top-ranked features improved classification accuracy to 81.04% with 73.02% sensitivity and 86.67% specificity. Since, the classifier performance will vary depending on which samples are assigned to the training set and which ones to the test set, we repeated the 10-fold cross-validation 100 times and evaluated the SVM performance. The boxplot of this evaluation is shown in Fig. 3.
For further validation, we assessed the performance of these top-ranked features with data obtained from another study [7]. This data comprises of RNA-Seq samples from vastus lateralis muscle of 24 participants, which is divided into four following groups: normal glucose tolerant (NGT)/non-obese, NGT/obese, T2DM/non-obese, and T2DM/obese. Preprocessing of the read counts was applied. Then personalized metabolic models were reconstructed and FVA analysis was performed. Associated fluxes of top-ranked features were obtained and were used as the features for SVM classifier.
We divided these people into NGT and diabetic groups regardless of the obesity state and the generated fluxes of marker reactions in these 24 participants were used for validation with SVM. This analysis resulted in 49.46% accuracy, 42.25% specificity, and 56.67% sensitivity, which is remarkably low, probably due to the presence of normoglycemic obese people in the healthy group. Although the blood glucose level in these individuals is in the normal range, high levels of fasting blood insulin level (60 ± 33 pmol/l in NGT/obese vs 25 ± 7 pmol/l in NGT/non-obese) indicates insulin resistance in this group, which is compensated by more insulin secretion. Thus, we decided to remove data from NGT/obese individuals and re-evaluated markers in this set. The result significantly altered with the 78.50% accuracy, 70.17% specificity, and 82.67% sensitivity. Therefore, the metabolites markers are related to insulin-resistant metabolism and can successfully discriminate insulin-resistant individuals from healthy ones.