Cotton is an important commodity in the world economy. In this study we have carried out genome-wide identification and bioinformatics characterization of basic leucine zipper domain proteins (bZIPs) from cultivated cotton species G. hirsutum along with two sub-genome species of allotetraploid cotton, G. arboreum and G. raimondii. A total of 228 bZIP genes of G. hirsutum, 91 bZIP genes of G. arboreum and 86 bZIP genes of G. raimondii were identified from CottonGen database. Cotton bZIP genes were annotated in standard pattern according to their match with Arabidopsis bZIPs. Multiple genes with similar bZIP designations were observed in cotton, linked to the gene duplication. Cotton bZIPs are distributed across all 13 chromosomes with varied density. Phylogenetic characterization of all three cotton species bZIPs classified them into 12 subfamilies, namely A B, C, D, E, F, G, H, I, J, K and S and further into eight subgroups according to their predicted functional similarities, viz., A1, A2, A3, C1, C2, S1, S2 and S3. Subfamily A and S are having maximum number of bZIP genes, subfamily B, H, J and K are single member families. Cotton bZIP protein functions were predicted from identified motifs and orthologs from varied species. BRLZ domain analysis of G. raimondii bZIPs revealed the presence of conserved basic region motif N-X7-R/K in almost all subfamily members, variants are GrbZIP62 with N-X7-I motif and GrbZIP76 with K-X7-R motif. Leucine heptad repeats motif, are also present in variant numbers from two to nine with leucine or other hydrophobic amino acid at designated position among 12 subfamily members. STRING protein interaction network analysis of G. raimondii bZIPs observed strong interaction between A-D, B-K and C-S subfamily members.

Figure 1

Figure 2

Figure 3

Figure 4
This is a list of supplementary files associated with this preprint. Click to download.
Tables 1 - 4
1] Supplementary file 1 –Xls. (Online resource 1) All identified bZIP gene ID, exon number, CDS length, protein length and protein sequences of G. hirsutum, G. arboreum and G. raimondii. G. hirsutum, G. arboreum and G. raimondii bZIP34, bZIP61, bZIP69 and bZIP76 protein alignment.
2] MEME motif analysis MAST files of G. hirsutum, G. arboreum and G. raimondii bZIP proteins
2] MEME motif analysis MAST files of G. hirsutum, G. arboreum and G. raimondii bZIP proteins
3] Gamma distribution phylogenetic tree
Loading...
Posted 06 Apr, 2021
Posted 06 Apr, 2021
Cotton is an important commodity in the world economy. In this study we have carried out genome-wide identification and bioinformatics characterization of basic leucine zipper domain proteins (bZIPs) from cultivated cotton species G. hirsutum along with two sub-genome species of allotetraploid cotton, G. arboreum and G. raimondii. A total of 228 bZIP genes of G. hirsutum, 91 bZIP genes of G. arboreum and 86 bZIP genes of G. raimondii were identified from CottonGen database. Cotton bZIP genes were annotated in standard pattern according to their match with Arabidopsis bZIPs. Multiple genes with similar bZIP designations were observed in cotton, linked to the gene duplication. Cotton bZIPs are distributed across all 13 chromosomes with varied density. Phylogenetic characterization of all three cotton species bZIPs classified them into 12 subfamilies, namely A B, C, D, E, F, G, H, I, J, K and S and further into eight subgroups according to their predicted functional similarities, viz., A1, A2, A3, C1, C2, S1, S2 and S3. Subfamily A and S are having maximum number of bZIP genes, subfamily B, H, J and K are single member families. Cotton bZIP protein functions were predicted from identified motifs and orthologs from varied species. BRLZ domain analysis of G. raimondii bZIPs revealed the presence of conserved basic region motif N-X7-R/K in almost all subfamily members, variants are GrbZIP62 with N-X7-I motif and GrbZIP76 with K-X7-R motif. Leucine heptad repeats motif, are also present in variant numbers from two to nine with leucine or other hydrophobic amino acid at designated position among 12 subfamily members. STRING protein interaction network analysis of G. raimondii bZIPs observed strong interaction between A-D, B-K and C-S subfamily members.

Figure 1

Figure 2

Figure 3

Figure 4
This is a list of supplementary files associated with this preprint. Click to download.
Tables 1 - 4
1] Supplementary file 1 –Xls. (Online resource 1) All identified bZIP gene ID, exon number, CDS length, protein length and protein sequences of G. hirsutum, G. arboreum and G. raimondii. G. hirsutum, G. arboreum and G. raimondii bZIP34, bZIP61, bZIP69 and bZIP76 protein alignment.
2] MEME motif analysis MAST files of G. hirsutum, G. arboreum and G. raimondii bZIP proteins
2] MEME motif analysis MAST files of G. hirsutum, G. arboreum and G. raimondii bZIP proteins
3] Gamma distribution phylogenetic tree
Loading...