3.1 Intragenus and interspecies SNPs identified from buffalo liver transcriptome data
A total of 1,40,056 interspecies SNPs were found in 13801 genes based on Bos taurus genome. By considering an average gene size of 1200 base pairs (bp) in eukaryotes, the 13801 genes would cover nearly 16,561,200 bp of the genome. As 1,40,056 locations showed interspecies SNPs between cattle and buffaloes, the variation between these two species could be 0.84% in the genes specific to early postpartum liver transcriptome in buffaloes. However, the variation at genomic level between the two species was reported to be 3% (Moaeen-ud-Din M and Bilal G 2015) indicating a lesser variation in the coding region of the genome than in the non-coding regions between the cattle and buffaloes. Although these SNPs were distributed throughout the genome, much variation was on the chromosome No. 3, which is syntenic to the buffalo Chromosome No. 6 (Fig. 1a). From these 1,40,056 interspecies SNPs, 188 intra-genus buffalo SNPs were identified in 156 genes. Among these 188 SNPs, 163 has been successfully mapped to different chromosomes of Bubalus bubalis (breed Mediterranean ASM312139v1) (Fig. 2a). The remaining SNPs were indicated as unplaced or not matched or shown many matches with the buffalo genome. Maximum number of intragenus SNPs were mapped to the chromosomes, 1, 2, 3 and 4 in buffaloes (Fig. 2a). The variation in the coding region between cattle and buffaloes appear to be high on cattle chromosomes 3 (syntenic position on chromosome number 6 in buffaloes) and 19 (syntenic position on chromosome number 3 in buffaloes).
3.2 Bioinformatics analysis of the identified intragenus and interspecies SNPs
Pathway analysis of the genes containing interspecies SNPs found that majority of the genes were annotated to be involved in top five pathways, such as metabolism of proteins (1999), metabolism (1960), immune system (1827), post-translational modification (1378), gene expression (1364), RNA polymerase II transcription (1228) and innate immunity (1065) (Fig. 1b). Variation in innate immune genes between cattle and buffalo may explain their differential susceptibility to certain diseases. The SNPs in the innate immune genes may affect their differential expression of these genes between the two species. For instance, differential expression of immune genes among the two species was reported to understand the less susceptibility of water buffalo to Schistosoma japonicum than yellow cattle (Yang et al. 2015). As the innate immunity pathway was annotated to be functionally defined pathway by CPDB analysis, the gene list from this pathway was used for the network analysis by the STRING and CYTOSCAPE software. This network analysis resulted in eleven significant (P < 0.05) sub networks (Fig. 1c and Fig. 1d). DAVID functional annotation of the genes in these sub-networks revealed that majority of these genes were involved in cytoskeletal organization, protein catabolism, and membrane specific signalling involved in innate immunity. This observation based on liver transcriptome generated interspecies SNPs suggest that the two species vary at the basic molecular signalling response repertoire involved in the defence mechanism against infections.
Pathway analysis of the successfully mapped 163 intragenus SNPs in 156 genes found that the SNPs were found majorly in the genes involved in metabolic pathways, vesicle mediated transport, and disease signalling (Fig. 2b). Among the genes involved in the metabolic pathways, the genes related to lipid metabolism explains the importance of their SNPs and the liver in handling lipids. During NEB, the liver and its lipid metabolizing genes play an important role in meeting the energy demand for milk synthesis (Drackley et al. 2001), especially in high yielders by initiating lipolysis in adipose tissue (Loor et al. 2007; McCarthy et al. 2010) and adapting liver’s metabolism (Veshkini et al. 2022). Therefore, variations imposed by gene polymorphisms in lipid metabolic pathways in the liver may affect the animal’s ability to sustain the stress caused by the NEB during early post-partum and resume their normal physiological activities like reproduction. These differences in animal’s capability will also influence their differential susceptibility to post-partum diseases. Similarly, among the genes annotated to be involved in vesicle mediated transport and disease signalling, the genes related to the TGF (Tumor growth factor) beta signalling and EGFR1 (Epidermal growth factor receptor 1) signalling showed variation among buffaloes. Vesicle mediated transport and signalling are important during cross-talk among the liver, mammary, adipose, and reproductive organs, which manage the reciprocal control of metabolic activities and immune adaptation among these organs (Bu et al. 2017) during post-partum.
Among eight intragenus SNPs found in the coding regions of the genes, six SNPs in the genes, Interferon alpha/beta receptor 1 (INFAR1B), Low molecular weight phosphotyrosine phosphatase isoform x2 (LMPTP), Ribosome binding protein 1 isoform X2 (RBP1), Leukocyte specific transcript 1 (LST1), Complement 4 like and N-fatty-acyl-amino acid synthase/hydrolase (PM20D1) are non-synonymous for the two allelic forms (Table 1). Among these six SNPs, the maximum point mutation score (-4) for amino acid change was observed in LMPTP as per PAM 250 log odds substitution matrix. The polymorphisms in this gene were reported to be associated with the conception season and insulin resistance in humans (Bottini et al. 2002; Stanford et al. 2021). Insulin resistance during NEB may affect lipolysis and lipid mobility, which are very crucial for regaining reproductive efficacy. Hence, future studies are needed to explore its association with reproductive efficiency in Murrah buffaloes.
Table 1
Intragenus SNPs present in coding regions: Eight SNPs are present in coding regions of the genes out of 163 intragenus SNPs, of these six SNPs are nonsynonymous and two are synonymous. PAM score determined for non synonymous SNPs shows LMPTP gene non synomous SNP with maximum score
S.No | Gene | Intra-genus SNP and type of SNP | Codon change with SNP | Amino acid change with SNP | PAM Score |
1 | Interferon alpha/beta receptor 1 | A/T and Non synonymous SNP | TTA/TTT | Leucine/ Phenylalanine | 2 |
2 | SGTA (small glutamine rich tetratricopeptide repeat containing protein alpha isoform X2) | C/T and Synonymous SNP | CCC/CCT | Proline | NA |
3 | Low molecular weight phosphotyrosine phosphatase isoform x2 | G/C/A and Non synonymous SNP | TGC/TTC/TAC | Cysteine/ Phenylalanine | -4 |
4 | Ribosome binding protein 1 isoform X2 | C/G and Non Synonymous SNP | CGT/TGT | Alanine/ Threonine | 1 |
5 | Fatty acyl amino acid synthase / hydrolase | G/T/C and Non synonymous SNP | TCG/TCT/TCC | Alanine/Serine /Proline | 1 |
6 | Complement C4 like | A/G and Non synonymous SNP | GG/CAG | Arginine/Glutamine | 1 |
7 | HSP1A | C/A and Synonymous SNP | CGG/AGG | Glycine | NA |
8 | LST1 | A/G/C and Non synonymous SNP | AAG/GAG/CAG | Lysine/Glutamic acid | 0 |