Background The three helical Histone Fold Motif (HFM) of core histone proteins provides an evolutionarily favoured site for the protein-DNA interface. Despite significant variation in sequence, the HFM retains a distinctive structural fold that has diversified into several non-histone protein families in. In this work we explore the ancestry of non-histone HFM containing families in the plant kingdom.
Results A sequence search algorithm was developed using iterative profile Hidden Markov Models to identify remote homologs of core-histone proteins. The resulting hits were functionally annotated, classified into families, and subjected to comprehensive phylogenetic analyses via Maximum likelihood and Bayesian methods. We have identified over 4000 HFM containing proteins in the plant kingdom that are not histones, mostly existing as diverse transcription factor families, distributed widely within and across taxonomic groups.
Conclusion Patterns of homology suggest that core histone subunit H2A has evolved into newer families like NF-YC and DrAp1, whereas the H2B subunit of core histones shares a common ancestry with NF-YB and Dr1 class of TFs. Core histone subunits H3 and H4 were found to have evolved into DPE and TAF proteins, respectively. Taken together these results provide insights into diversification events during the evolution of the histone fold motif, including sub-functionalization and neo-functionalization of the HFM.

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5
This is a list of supplementary files associated with this preprint. Click to download.
Loading...
Posted 08 Sep, 2019
Posted 08 Sep, 2019
Background The three helical Histone Fold Motif (HFM) of core histone proteins provides an evolutionarily favoured site for the protein-DNA interface. Despite significant variation in sequence, the HFM retains a distinctive structural fold that has diversified into several non-histone protein families in. In this work we explore the ancestry of non-histone HFM containing families in the plant kingdom.
Results A sequence search algorithm was developed using iterative profile Hidden Markov Models to identify remote homologs of core-histone proteins. The resulting hits were functionally annotated, classified into families, and subjected to comprehensive phylogenetic analyses via Maximum likelihood and Bayesian methods. We have identified over 4000 HFM containing proteins in the plant kingdom that are not histones, mostly existing as diverse transcription factor families, distributed widely within and across taxonomic groups.
Conclusion Patterns of homology suggest that core histone subunit H2A has evolved into newer families like NF-YC and DrAp1, whereas the H2B subunit of core histones shares a common ancestry with NF-YB and Dr1 class of TFs. Core histone subunits H3 and H4 were found to have evolved into DPE and TAF proteins, respectively. Taken together these results provide insights into diversification events during the evolution of the histone fold motif, including sub-functionalization and neo-functionalization of the HFM.

Figure 1

Figure 2

Figure 3

Figure 4

Figure 5
This is a list of supplementary files associated with this preprint. Click to download.
Loading...