KnLOV1 more closely related with higher plants phototropin LOV1 then algal phototropin LOV1
To perform evolutionary and homology analysis, well-characterized LOV domain sequences from several algae and plants were selected, including OlPhot LOV1 from Ostreococcus lucimarinus, DsPhot LOV1 from Dunaliella salina, CsPhot LOV1 from Coccomyxa subellipsoidea, CrPhot LOV1 from Chlamydomonas reinhardtii, VcPhot LOV1 from Volvox carteri, MpPhot LOV1 from Marchantia polymorpha, SmPhot from Sellaginella moellendorfii, Ac-vPhot LOV1 from Adiantum capillus-veneris, GmPhot LOV1 from Gnetum montanum, and AtPhot LOV1 from Arabidopsis thaliana. The results indicated that KnPhot sequence shares an overall sequence homology of ~42% with algal phototropin and ~57% with higher plant phototropin. However, KnPhot LOV1 domain exhibits ~65% similarity to algal LOV1 domain and ~70% similarity to higher plant LOV1 domain, as depicted in Fig. 1b. Interestingly, the linker region between LOV1 and LOV2 domain was longer than the algal Phot(s) and shorter than the higher plant Phot(s), indicating the evolutionary changes in K. nitens phototropin to adapt from aquatic to land habitat. Moreover, homology analysis showed the presence of a conserved NCRFLQ motif and other key residues regulating photocycle time in K. nitens phototropin LOV1 domain, but differ in some binding pocket residues from other LOV domains. The residues involved in protein oligomerization were also conserved in KnLOV1 domain similar to algal and higher plant LOV1 domain.
For phylogenetic analysis, we utilized phototropin gene sequences from different algae and plants like Ostreococcus lucimarinus (OlPhot), Micromonas sps. (MspPhot), Dunaliella salina (DsPhot), Coccomyxa subellipsoidea (CsPhot), Chlamydomonas reinhardtii (CrPhot), Volvox carteri (VcPhot), Marchantia polymorpha (MpPhot), Anthoceros bharadwajii (AbPhot), Sellaginella moellendorfii (SmPhot), Adiantum capillus-veneris (Ac-vPhot), Gnetum montanum (GmPhot), and Arabidopsis thaliana (AtPhot). The phylogenetic analysis revealed that KnPhot LOV1 domain is closer to higher plant Phot LOV1 domain than the algal Phot LOV1 domain, as illustrated in Fig. 1a. These findings provide valuable insights into the evolutionary history and functional characteristics of phototropin genes in K. nitens and other related organisms. The blue and green color of node branches represent the algal and higher plants similarity.
KnLOV1 protein is structurally more similar to higher plant phototropin LOV1 domain
The KnLOV1 protein possesses a structure comprising of five β-sheets bordered by five ɑ-helices, as depicted in Fig. 2a. The C-terminal ɑ-helix of the protein is rotatable and essential for its dimerization, and it contains both polar and non-polar residues in its flavin binding pocket. Six active amino acid residues interact with the FMN ring at six distinct positions: (i) FMN-O4-NE2-Gln122, 2.89 Å; (ii) FMN-N5-NE2-Gln122, 3.33 Å; (iii) FMN-O4-ND2-Asn101, 2.99 Å; (iv) FMN-N3–ND2-Asn91, 2.92 Å, (v) FMN-O2-NE2-Gln63, 2.94 Å; and (vi) FMN-O4-NE2-Gln63, 2.78 Å. The side chains of Cys59(S) interact with FMN ring C4A and N5 atom at distances of 3.8 Å and 3.5 Å, respectively. Additionally, the phosphate of FMN interacts with the guanidinium chain of Arg60 (Arg60-NE–O3P-FMN, 2.9 Å; Arg60-NH2–O2P-FMN, 2.88 Å) and Arg76 (Arg-76-NE-O1P-FMN, 3.21 Å; Arg76-NH2-O1P-FMN, 2.82 Å) (as shown in Fig. 2c and 3a).
In terms of structural similarity, the KnLOV1 protein is most closely related to the AtPhot2-LOV1, with a similarity of ~64%, as indicated by a Z score of 20.2 and an RMSD value of 1.5 in the DALI server. The C. reinhardtii, a freshwater alga, phototropin LOV1 domain (CrPhotLOV1) has a similarity of ~66%, with a Z score of 19.7 and an RMSD value of 1.1. One notable difference between KnLOV1 and AtPhot-2LOV1 is the presence of an α-helical structure in the latter, while CrPhotLOV1 has a similar loop structure to KnLOV1. The first and second β-sheets in KnLOV1 are shorter than the third, fourth, and fifth β-sheets in AtPhot2-LOV1, and all of the α-helices in KnLOV1 are shorter than those in AtPhot2-LOV1. The length of the loop between the first and second β-sheets is more variable and has a more hanging structural orientation in AtPhot-LOV1A than in KnLOV1 (as seen in Fig. 2a). The FMN binding pocket in KnLOV1 is larger than that in AtPhot-LOV1, and the isoalloxazine ring of FMN is surrounded by Cys59 (in the third α-helix) and Leu103 (in the third β-sheet) residues, with an average distance between them of 7.1 Å. This distance is 7.0 Å (Cys170 and Leu214) and 6.55 Å (Cys57 and Leu101) in AtPhot2-LOV1A and CrPhotLOV1, respectively (Suppl. Fig. S1a-d).
The Hydrogen-bonding (HB) network residues in KnLOV1 are critical for protein dark recovery. The residues involved in HB network in KnLOV1, namely Gln122, Asn101, Asn91, Gln63, Arg60 and Arg76, are similar to those found in higher plant (AtPhot2-LOV1) and algal phototropin LOV1 (CrPhotLOV1). Additionally, Ile36 and Ile119 in KnLOV1 are structurally more similar to Ile147 and Ile230 in AtPhot2-LOV1, but differ from Leu34 and Val117 in CrPhotLOV1. However, KnLOV1 is structurally more similar to CrPhotLOV1 (Asp31 and Ile77) than to AtPhot2-LOV1 (His144, Val190), due to the presence of Asp33 and Ile79 in the ligand binding pocket (Suppl. Fig. S1a-d). These residues in KnLOV1 are located farther from FMN compared to CrPhotLOV1. The flanking residue Asp33 in KnLOV1 is comparable to Asp31 in CrPhotLOV1, which forms a salt bridge connection with Arg58 to cover FMN ring and stabilize the structure. Therefore, mutation at the similar site in KnLOV1 at Asp33 could lead to structural deformity. In AtPhot2LOV1, His143 residue is present and is located slightly further from Arg171 compared to CrPhotLOV1. However, the comparable distance is higher in KnLOV1 between Asp33 and Arg60 (Suppl. Fig. S1e). A significant difference between these three proteins is the presence of a proline residue before Asp31 (CrPhotLOV1) and His143 (AtPhot2LOV1), which restricts the flexibility of loop. Nevertheless, in KnLOV1, a lysine residue is present instead of proline residue, which maintains slight flexibility (Fig. 3c).
Alteration in protein chromophore environment affects photorecovery kinetics of KnLOV1 proteins
The native KnLOV1 protein is a monomeric structure and all the three mutant named D33N R60K, and Q122N, are dimer in an asymmetric unit of the crystal structure. The KnLOV1 mutants D33N, R60K and Q122N crystals were diffracted up to 2.0 Å, 2.1 Å and 1.9 Å respectively and were processed in the P43212 space group. The Rwork and Rfree values of D33N, R60K and Q122N mutants are 16/21, 17/22 and 18/22 respectively. The summary of molecular refinement statistics has been mentioned in Suppl. Table S1. The structural comparison between native and mutants shows high similarity with the following RMSD values of D33N (0.170), R60K (0.352 ) and Q122N (0.132).
The primary difference in all mutants is interaction of Gln122 residue with N5 atom of FMN which introduces change in structure; culminating ultimately to the loose binding and faster recovery rate (Fig. 3). The first promising position that is Asp33 do not interact directly with FMN ligand but affecting the stability of the protein. The mutation D33N then causes a steep decrease in the recovery time which is 580 seconds. D33N mutation causes an increase in the interaction distance 3.3 Å to 3.5 Å between Gln122 to N5-FMN atom compared to native KnLOV1. The distance of all H-bond increases that are interacting with FMN ring along and decreases in those which interacts with phosphate group (Fig. 3b, Table 1). D33N mutation make FMN more dynamic compare to native because it destabilizes it which results to reduction in recovery time. The second mutant R60K structure shows that the distance between N5-FMN and Gln122 increased from 3.3 Å to 3.4 Å compared to native. An increase in the H-bond distances (0.1 Å) with active site residue and decreasing bond distance towards Arg76 (Fig. 3d, Table 1). All these interactions then make FMN unstable compare to native KnLOV1 leading to drastic fall in recovery time from 2647 seconds to 122 seconds. In Q122N mutation structure, there is shorting of first and third beta sheet compared to native structure. This mutation also causes the loss of complete interaction with N5 atom of FMN and only weak interactions of FMN is leftover with active site residue compare to native KnLOV1. Due to main interacting residue (Gln), the structural alteration collectively responsible for the destabilization of cofactor FMN leading to remarkable reduction in recovery time from 2647 seconds to 101 seconds. The mutated residue Asn have no interaction with N5-FMN ring and distance increased from 3.3 to 3.8 Å and distance between ND2 group of O4-FMN decreased from 3.0 to 2.7 Å (Fig. 3c, Table 1).
The comparison between all the three mutant shows that differential recovery time ranging from highest 580 seconds (D33N), 122 seconds (R60K) to lowest 101 seconds (Q122N) is depends on the interaction of Gln122 with N5-FMN. As Q122N mutation lost direct interaction with N5-FMN but R60K and D33N persists weak interaction and correlates with the time of recovery rate fastest in Q122N and lowest in D33N. The effect of all the mutations also increases the area of binding pocket size of ligand with protein might also contributing the effect on recovery rate (Fig. 3b).
Hydrogen-bonding network residues attribute to unusual long dark recovery of KnLOV1 protein
The duration of dark recovery in the LOV domain is influenced by the photophysical behavior of the chromophore and its environment in the binding pocket. Alterations in the HB network and the chromophore binding pocket can affect the photophysical behavior and lifetime of the photocycle13,14,22,23. In this study, we investigated the changes in the photophysical characteristics and recovery kinetics of KnLOV1 by measuring the absorption spectra in the dark and at various time points after blue light illumination (see Fig. 4a). Our results showed that the protein exhibits maximum absorbance (λ max) at 448 nm for the S0 → S1 transition bands, with shoulder peaks at 426 and 475 nm, which differs from the algal (λ max = 447 nm) and higher plant LOV1 (λ max = 449 nm). Additionally, we observed prominent S0 → S2 bands with minor peak at 351 nm and major peak at 369 nm, differing from algal LOV1 at 360 nm and higher plant LOV1 at 361 nm. Upon blue light illumination, protein absorbance decreased, and in the dark, the protein exhibited dark reversion after some time. The KnPhot LOV1 protein showed a dark recovery time of approximately 2.5 hours, which was much longer than that of algal and higher plants LOV1 protein16,17 (as shown in Fig. 4a).
To explore the role of different amino acids in dark recovery, we introduced various mutations in the protein. We selected mutations by substituting binding pocket residues and part of HB network involved in the photo-recovery kinetics of the protein, such as R60K, Q122N, and D33N. We expressed and purified mutants such as LOV1 R60K, LOV1 Q122N, LOV1 D33N, and LOV1 R60K D33N, LOV1 Q122N D33N, with an intact chromophore. The absorption and fluorescence spectra revealed that the mutations R60K and R60K D33N caused a slight blue shift of approximately 1 nm, while Q122N D33N caused a shift of approximately 2 nm in the transition I peaks. Wild-type KnLOV1 protein and different mutants exhibited different recovery time constants (τ), representing the recovery rate of the protein from the signaling state to the ground state. Wild-type KnLOV1 protein exhibited a slower recovery rate (τ = 2467 seconds) compared to different mutants, i.e., R60K (τ = 122 seconds), Q122N (τ = 101 seconds), D33N (τ = 580 seconds), R60K D33N (τ = 31.8 seconds), and Q122N D33N (τ = 36.5 seconds), which accelerated the recovery kinetics of the protein (as shown in Fig. 4b-f).
The Q122N mutation in the LOV1 domain of KnPhot accelerated the photocycle of the protein, and the recovery was about 80 times faster in the Q122N mutant compared to the KnLOV1 wild-type protein. The faster recovery rate indicates the destabilization of the covalent adduct due to the mutation.
The photodynamic properties of the KnLOV1 examined by measuring the fluorescence decay time in wild type and mutant proteins and was found 4.1 nanoseconds (wild type), 3.5 nanoseconds (R60K mutant), 3.9 nanoseconds (Q122N mutant), and 3.7 nanoseconds (D33N mutant), as mentioned in Suppl. Table S2.
We found that the mutations primarily modulating the photocycle lifetime significantly did not alter the triplet state decay or adduct formation in KnLOV1 protein. Steady state fluorescence for KnLOV1 wild type and different mutants revealed similar emission maxima (495 nm) and shoulder peaks (518 nm) across all proteins (Suppl. Fig. S4), which is consistent with the algal LOV1 domain.
Residues for dark recovery affect the light induced confirmational changes in KnLOV1
The light-induced changes in secondary structure of the LOV1 protein investigated using circular dichroism (CD) spectroscopy in the far-UV region of wild type LOV1 protein displayed a characteristic α-helical conformation with distinct peaks at 208 and 222 nm, and a prominent peak at 218 nm indicating the presence of β-sheet structure (Suppl. Fig. S5a). Upon exposure to blue light, there was a gradual reduction in both α-helical and β-sheet content. Similar results were obtained for the Q122N and D33N mutants (Suppl. Fig. S5c and S5d). However, the R60K mutant showed an increase in both α-helical and β-sheet content (Suppl. Fig. S5b). Double mutations, such as R60K D33N, led to an increase in α-helix and a decrease in β-sheet content, whereas Q122N D33N resulted in a decrease in α-helix and an increase in β-sheet content (Suppl. Fig. S5e and S5f). The mutants used in this study are listed in Suppl. Table S3. These findings suggest that blue light induces secondary structural changes in the LOV1 protein and that different mutations can produce contrasting effects on the protein's conformation.
Photoinduced oligomerization of KnLOV1 protein
Effect of light on protein crosslinking suggested that in native conditions, KnLOV1 protein primarily exists in dimer form with trace amounts of tetramer, trimer, and monomer. SDS-PAGE gel analysis of crosslinking samples revealed the presence of protein monomers, suggesting non-covalent interactions among monomer molecules in protein oligomers (Suppl. Fig. S6a and S7). Glutaraldehyde crosslinking experiments confirm that the majority of the protein is in dimer form, with a small number of tetramers, trimers, and monomers (Suppl. Fig. S6a and b). We conducted crosslinking experiments under different light conditions to study the effect of light on protein crosslinking. The dark incubated crosslinked sample showed a higher concentration of dimer than the light and light-dark crosslinked samples (lane 2 and 3 dark (D), 4 and 5 light (L), 6 and 7 light-dark (LD) in Suppl. Fig. S6a). We used protein without GA addition as a control sample for oligomerization study in gel (lane 8 in Suppl. Fig. S6a). Dark incubated protein samples showed a higher number of oligomer species formation compared to light incubated samples (Suppl. Table S4). However, blue light illumination decreased the oligomer concentration and increased the monomer concentration in protein samples. These results coincide with AFM studies.
To visualize and characterize the morphology and structure of LOV1 oligomers, we studied crosslinked protein samples in various light conditions using AFM. We used native purified protein as a control sample in AFM for size comparison. The oligomer size was measured by examining the cross-section of different particles seen in AFM images. Protein size was also confirmed using DLS of crosslinked protein particles in different light conditions and control samples (Suppl. Fig. S8). The results indicate that the oligomers vary in size from approximately 1.2-5 nm for monomers, 6-12 nm for dimers, and 14-22 nm for trimer and tetramer molecules depending on the measuring axis for the cross-section of oligomeric molecules (Fig. 5). In AFM results, protein oligomers come in contact and form higher oligomeric species, resulting in an increase in the size of the oligomer depending on the surface and process of molecular oligomerization. In control samples (with native purified protein), monomeric size molecules were predominant, with a trace amount of dimer molecule, which may have been formed due to storage or higher protein concentration in AFM samples.