Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

doi:10.21203/rs.3.rs-3371513/v1

Download PDF

Article

Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

https://doi.org/10.21203/rs.3.rs-3371513/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The large-scale analysis of small-molecule binding to diverse RNA structures is key to understanding the required interaction properties and selectivity for developing RNA-binding molecules toward RNA-targeted therapies. Here, we report a new system for performing the large-scale analysis of small molecule–RNA interactions using a multiplexed pull-down assay with RNA structure libraries. The system profiled the RNA-binding landscapes of G-clamp and thiazole orange derivatives (TO and TO-3), which recognizes an unpaired guanine base and are good probes for fluorescent indicator displacement (FID) assays, respectively. Based on the information obtained from the bindings of TO and TO-3, we selected the combinations of fluorescent indicators and drug-targetable pre-miRNAs and screened for RNA-binding molecules using FID. Four hit compounds were identified, and three of them were validated. Our system provides fundamental information about small molecule–RNA interactions and facilitates the discovery of novel RNA-binding molecules.

Biological sciences/Chemical biology/Nucleic acids

Biological sciences/Biochemistry/RNA

Targeting RNA with small molecules represents an attractive medicinal approach for treating gene-related and infectious diseases.^1–5 For example, drugs targeting specific RNA splice sites have been approved to alleviate the symptoms of spinal muscular atrophy.^6,7 Further, human precursor microRNAs (pre-miRNAs)^8–13, various repetitive RNAs, such as CUG^14–17 and UGGAA¹⁸ repeats, and structured RNA elements of infectious pathogens^19–21 are considered promising drug targets. When developing new RNA-binding molecules, profiling the RNA-binding landscapes of various types of RNA structures is critical for gaining deep insights into their binding properties and selectivities.^22–24 One powerful way to profile the binding of small molecules is an analysis based on massively parallel DNA sequencing. For example, Disney’s group developed a computational approach, Inforna, based on their screening methods and massive sequencing analysis, that has led to the discovery of various regulatory RNA-binding molecules in RNA-related disease models.^10–12,25 Their binding profiles focused on the sequence variants within internal loops and bulge structures. More recently, Sugimoto’s group implemented RNA-capturing microsphere particles to establish a new sequencing-based RNA-selection method that does not require any ligand labeling for the RNA-binding fluorescent molecules.^26,27 Although these methods are valuable, they could produce inaccurate results in the profiling of specific or stable RNA structures, such as G-quadruplex (G4) structures, owing to structure-dependent amplification biases. This is because polymerase tends to pause at structured RNA sites during reverse transcription or polymerase chain reactions (PCR).^28,29 Therefore, different approaches that do not involve reverse transcription or PCR are required for the profiling of small-molecule binding to diverse RNA structures, particularly highly structured RNAs exhibiting naturally occurring sequences.

Recently, we developed a new method, folded RNA element profiling with structure library (FOREST)³⁰, for the large-scale analysis of protein–RNA interactions using a multiplexed RNA structure library. FOREST quantifies interactions using a DNA barcode microarray that can capture RNA probes in an RNA structure library (Fig. 1) that is designed by extracting structured motifs from RNA structure datasets. In this system, a stabilizing common stem, a unique RNA barcode (5′ terminus), and Cy5 or Cy3 (3′ terminus) were attached to each RNA structure (Fig. 1a). Employing this system, we revealed the interaction landscape of RNA-binding proteins (RBPs) using the RNA structure library that was extracted from human pre-miRNAs, human 5′ UTRs, and the HIV-1 RNA genome. FOREST drives amplification-free quantification, thus facilitating the bias-free detection of different RNA structures and their interactors (e.g., G4 and G4-binding RBPs). Notably, we identified cross-reactive interactions among some of the tested RBPs. For example, we observed that three G4-binding proteins exhibited different binding preferences to G4 and interacted with non-G4 RNA motifs (e.g., the r(GAA)_n motif) with different selectivity. Thus, we hypothesized that our method could be used as a platform for profiling the RNA-binding landscapes of small molecules.

In this study, we introduced a new systematic and large-scale approach for investigating small molecule–RNA interaction profiles. By subjecting small molecules to FOREST, our system is advantageous for analyzing large-scale datasets of diverse RNA structures derived from naturally occurring sequences. As the detection of the binding affinities of different RNA structures is based on microarray analysis, FOREST avoids sequencing and structure-dependent amplification biases. Additionally, the results include not only high-affinity interactions but intermediate- and low-affinity ones. Therefore, our datasets will be invaluable resources for understanding the fine determinants of small molecule–RNA interactions.

Design of the platform for the large-scale analysis of small molecule–RNA interactions

Regarding the first RNA structure library for the analysis (Library-1), we designed 1824 RNA structural motifs by extracting the terminal loops of human pre-miRNAs and adding several repetitive and control sequences.³⁰ Five different barcodes were allocated to each motif structure to exclude the outliers representing non-specific binding to the barcode sequences. Thereafter, the small molecule was immobilized onto beads via biotin–streptavidin interactions (Fig. 1a). We performed the pull-down process by mixing the RNA structure library and immobilizing the small molecule, followed by the washing and elution steps to collect the bound RNAs. The RNAs that were pulled down were quantified by a DNA barcode microarray to obtain the fluorescence intensity of each RNA structure because of the correlation of fluorescence intensities with binding affinities after background subtraction by no-ligand-conjugated streptavidin control samples.³⁰

In this study, we selected G-clamp and thiazole orange (TO) derivatives as the binding molecules (Fig. 1). G-clamp can recognize an unpaired guanine base in RNA loop structures by forming four hydrogen bonds (Fig. 1b).^31–33 G-clamp was used to validate our system because it binds strongly to a wide range of RNAs. Conversely, the TO derivatives, TO-PRO-1 and TO-PRO-3, are known as fluorescent light-up probes for imaging and fluorescent indicator displacement (FID) assays (Fig. 1c).^34–38 FID represents a high-throughput method for identifying novel RNA-binding molecules.^39–45 For example, TO-PRO-3, a deep-red fluorescent indicator, was used in an FID assay to screen for compounds that bind to the bacterial A-site, influenza A virus RNA, and G4 DNA.^37,38,46 However, the binding information of these fluorescent indicators and their target RNA sequences is still limited. We believed that it would be beneficial to determine the RNA binding profiles of such conventionally used indicators to further expand the repertoire of target RNA sequences that can be used in FID assays. Based on the structure of TO-PRO-1, we designed the N₃-modified TO–N₃ and TO–N₃-2 exhibiting different linker positions (Fig. 1d). Similarly, we designed TO-3–N₃ and TO-3–N₃-2. The N₃-modified G-clamp–N₃, TO–N₃, and TO-3–N₃ were synthesized using N₃–PEG₃–NH₂ as an N₃ linker after preparing the carboxylic acid intermediates (Schemes S1–S3), whereas TO–N₃-2 and TO-3–N₃-2 were synthesized using N₃–PEG₄–NHS ester as an N₃ linker after preparing the amine intermediates (Schemes S4 and S5).^47,48 These N₃-modified molecules were conjugated to biotin via a strain-promoted azide–alkyne cycloaddition (SPAAC) with DBCO–biotin (Figs. 1a, S1, and S2)^49,50 and used for the experiments without further purification (Figures S1 and S2).

Large-scale analysis of the interaction of G-clamp-N₃ with Library-1

First, we ranked the RNA motifs from Library-1 based on their G-clamp binding (ranking list S1). To understand the binding properties of G-clamp, the numbers of bases in the single-stranded (ss) and double-stranded (ds) RNA regions were investigated using the predicted secondary structures of the pre-miRNA loops (Fig. 2). Regarding ssRNA, the G count of high-ranking RNAs (1–360) was significantly higher than that of all the pre-miRNAs in Library-1. Contrarily, the G count of the low-ranking RNAs (1441–1800) was significantly lower than that of all the examined pre-miRNAs. Conversely, the C counts of the high- and low-ranking RNAs were lower and higher than those of all the pre-miRNAs in Library-1, respectively. The U count of the high-ranking RNAs was lower than that of all the pre-miRNAs, and the A count of ssRNA was not significantly different among the rank sections. Regarding dsRNA, the four bases exhibited smaller differences among the ranks compared with ssRNA. The C and U counts were inversely proportional to the G count, as C and U in the ssRNA region can form base pairs with the neighboring G bases. Furthermore, the percentage of the unpaired G count highlighted an unpaired-G selectivity (Figure S3). Five or more unpaired Gs were mainly observed in high-ranking RNAs (1–180), and the percentage decreased gradually as the rank decreased. Contrarily, few RNAs without any or only a single unpaired Gs were observed in the high-ranking group, and the percentage gradually increased as the rank decreased. These results corresponded to the fact that G-clamp mostly recognizes G base in the ssRNA regions.³²

Next, to validate our screening platform for RNA structures, we selected 17 sequences from the high-affinity (top 100), intermediate-affinity (101–1000), and low-affinity (1001–1824) groups and measured their apparent dissociation constants (K_Dapp) by fluorescence titration (Figure S4). The RNA motifs with three base pairs of a common stem (5′-AGC-motif-GCU-3′) were used to measure K_Dapp. A histogram of Z-scores and the correlation between the Z-scores and K_Dapp values are shown in Figs. 3a and 3b and Table S1. The minimum free energy structures of the selected RNAs are shown in Figs. 3c and S5. The ranks 1 and 2 RNAs (Fig. 3c, top) contained unpaired guanine bases in their loop structures and exhibited strong G-clamp binding (K_Dapp = 0.024 and 0.022 µM, respectively). For the rank 1 RNA (hsa-mir-4520-1 loop), we performed the G mutation assay using two G-mutated hsa-mir-4520-1 loops (mir-4520-1-mutG2A and -mutG7A). Although mutG2A exhibited strong binding (K_Dapp = 0.011 µM) similar to the wild type, mutG7A exhibited weaker binding (K_Dapp = 15 µM). The double mutant mutG2,7A also exhibited weaker binding (K_Dapp = 3.7 µM) than the wild type, indicating that G7 contributes to the strong interaction with G-clamp. To consider the selectivity of G7, the molecular modeling of the complex structure between mir-4520-1 and G-clamp–N₃ was performed using RNAComposer^51,52 and MacroModel (Fig. 3d). When G-clamp is bound to 7G by four hydrogen bonds, it can interact with neighboring bases. We considered that these interactions, such as stacking with CG base pairs at the top of the stem, would facilitate strong binding in addition to the formation of the four hydrogen bonds, indicating that G-clamp does not recognize all Gs on the loop (G-clamp recognizes specific Gs). The high number of G bases in the ssRNA region of high-ranking RNAs probably increased the probability of the presence of G bases that bind to G-clamp strongly. In the high-affinity group, two of the selected RNA motifs contained the G4 structure. The K_Dapp values of the hsa-mir-6850 loop (rank 28) and G4_(GGGU)₆ (rank 38) were 0.19 and 0.15 µM, respectively. In the intermediate-affinity group, even though hsa-mir-548ba (rank 522) exhibited a loop that was similar to that in hsa-mir-4520-1, its K_Dapp value (10 µM) was much higher. Comparing the modeling structures of hsa-mir-4520-1 and hsa-mir-548ba (Figure S6) revealed that G-clamp–N₃ cannot interact with adjacent bases when it forms hydrogen bonds with a G base on the loop structure of hsa-mir-548ba. In the low-affinity group, the loops without any G bases, such as hsa-mir-4773-1 (rank 1192), hsa-mir-4282 (rank 1775), and common stem sequence with four Us in the hairpin loop, exhibited weak binding (K_Dapp > 40 µM; Figures S4 and S5). Within the group of selected RNAs, only (CUG)₁₆ (rank 43) deviated from our expectations in the fluorescence titration experiment (Fig. 2b, green color). Overall, we observed a good correlation between the Z-scores and observed K_Dapp (Fig. 2b, Spearman’s correlation coefficient: −0.86); the coefficient without considering (CUG)₁₆ exhibited an even higher correlation (− 0.95). The G4 structures, which are susceptible to bias when using sequencing-based methods, were evaluated and ranked. These results indicate that our system for the large-scale analysis of the RNA structure libraries can ensure accurate assessments of small molecule–RNA interactions.

Large-scale analysis of the interaction of the thiazole derivatives with Library-2

Next, we investigated the binding of different RNA motifs to the TO derivatives using our second RNA structure library, Library-2 (ranking lists S2–S5). Library-2 contains 3000 RNA structural motifs that were designed by extracting the terminal loops of human pre-miRNAs, along with SARS-CoV-2 and influenza A virus RNAs and several repetitive and control sequences. Compared with the G-clamp binding profile, TO and TO-3 exhibited distinct profiles (Fig. 4a), although a significant correlation was observed between their binding profiles (Fig. 4b). These data indicate that the TO derivatives exhibited similar selectivities, which were unique compared with the G-clamp, as expected. The correlation coefficient between TO–N₃ and TO–N₃-2 with different linker positions (r = 0.78) was lower than that between TO–N₃ and TO-3–N₃ with the same linker position (r = 0.91), suggesting that the linker positions affect the binding profile (Fig. 4b). The high-affinity group of RNAs for the TO derivatives was mainly populated with G4 RNAs. The kernel density estimation of the Z-scores of the TO derivatives indicated the significant enrichment of the G4 control RNAs (Figure S7).

To understand the binding properties of the TO derivatives, the numbers of bases in the ssRNA and dsRNA regions were quantified using the predicted secondary structure of the pre-miRNA loops similar to the analysis of the G-clamp (Fig. 4c). For ssRNA, the G count of the high-ranking RNAs (1–360) was significantly higher than that of all the pre-miRNAs in Library-2. Contrarily, the ssRNA counts of the other bases were not significantly different among the different ranks. Regarding dsRNA, the G and C counts of the high-ranking RNAs (1–360), as well as the A and U counts of the low-ranking RNAs (1441–1800), were significantly higher than that of all the pre-miRNAs. The count tendencies of TO-3–N₃ and TO–N₃ were similar. Overall, these results altogether suggest that the TO derivatives prefer G-rich ssRNA and G/C-rich rigid stem structures, such as hsa-mir-5091 and − 4437 (Fig. 4d). Regarding ssRNA, we further examined the total number of nucleotides in the internal and hairpin loops (Fig. 4e). Although high-ranking RNAs exhibited more G and A bases in their internal loops, the hairpin loops of high-ranking RNAs only exhibited a preference for more G but no other bases. These results suggest that the TO derivatives prefer the G/A bases in the internal and G-rich hairpin loops. A likely explanation is that the internal loops comprising G/A bases may create a binding pocket that is ideal for intercalation, whereas the G-rich hairpins may form G4-like structures. To confirm the preference of the TO derivatives for internal loops comprising G/A bases, we compared the K_Dapp values of hsa-mir-4437 and its internal loop (AGG to UCC) mutant, mir-4437-mut (Figs. 4d and S8). Although the K_Dapp values of TO–N₃ and TO-3–N₃ for the wild type hsa-mir-4437 loop were relatively low, 4.4 and 11 µM, respectively, the K_Dapp values of mir-4437-mut were much higher (> 40 µM), suggesting that the G/A bases in the internal loop are crucial to the strong binding of the TO derivatives to the hsa-mir-4437 loop at least.

To further validate the binding profiles of the TO derivatives that were generated by our screening platform, the K_Dapp values of TO–N₃ and TO-3–N₃ interacting with 11 RNAs were measured by fluorescence titration (Figures S9 and S10 and Table S2). For the high-ranking RNAs (top 100), the K_Dapp values correlated well with the Z-scores of TO–N₃, and the Spearman correlation coefficient was − 0.93 (Fig. 5a). Contrarily, no strong binding was observed for the low-ranking RNAs (K_Dapp > 40 µM). Similarly, the K_Dapp values of TO-3–N₃ also correlated well with the Z-scores of TO-3–N₃ of high-ranking RNAs (top 100), as the coefficient was − 0.96 (Fig. 5b). These results confirm that our system can provide accurate assessments of different binding modes of ligands and structured RNAs containing G4 structures.

Additionally, we extended this analysis to the commercially available indicators, TO-PRO-1 and TO-PRO3, by measuring their K_Dapp values to the 16 selected RNAs (pre-miRNAs, G4 RNAs, and virus RNAs) and calculating the correlations with the Z-scores of TO–N₃ and TO-3–N₃, respectively (Figures S11–S13 and Tables S3 and S4). Regarding TO-PRO-1, the K_Dapp values exhibited weak and improved correlations with the Z-scores of TO–N₃ (r = − 0.60) and TO–N₃-2 (r = − 0.71), respectively, indicating that the binding profile of TO–N₃-2 may reflect TO-PRO-1 binding by various RNA motifs more accurately (Fig. 5a). Conversely, for TO-PRO-3, there were significant correlations between the K_Dapp values and Z-scores of TO-3–N₃ (r = − 0.89) and TO-3–N₃-2 (r = − 0.90) (Fig. 5b). Taken together, these binding profiles will benefit the selection of the proper combinations of target RNA and fluorescent indicators for FID assays.

Screening of the novel RNA-binding molecules by fluorescent indicator displacement assay using TO-PRO-1 and TO-PRO-3

Based on the binding profiles of the TO derivatives, we selected the intermediate-affinity-ranked combinations of the indicator and disease-related human pre-miRNAs previously observed to be dysregulated in several tumors, hsa-mir-221, -191, and − 21, for the FID assay (Figs. 6).^53–55 As a high-rank G4 RNA control, hsa-mir-6850 was selected. Additionally, as a low-rank control, the hairpin loop motifs from SARS-CoV-2 RNA (SARS-low) and hsa-mir-374a were selected. The predicted RNA secondary structures are shown in Fig. 6b, and the K_Dapp values of TO-PRO-1 and TO-PRO-3 to these target and control RNAs are listed. The signal-to-background (S/B) ratios of TO-PRO-1 and TO-PRO-3 for these RNAs are summarized in Fig. 6c. The S/B ratios of the low-rank RNAs were significantly lower than the others. A low S/B ratio is not favorable for performing an accurate FID assay. To identify the small molecules that bind to the target human pre-miRNAs listed above, we employed FID to screen a chemical library comprising 118 oxidation–reduction compounds (Targetmol). The fluorescence emission of TOs depends on the RNA binding: free TOs exhibit low fluorescence, although the intensity increases upon RNA binding. Thus, the fluorescence emission of TOs decreases when a test compound interacts with a target RNA via the same site as the fluorescent indicator, thereby identifying it as a hit compound (Fig. 6a). Through this screen, we identified four hit compounds that disrupted TO–RNA interactions (Figs. 6d and S14). Although three of these compounds—baicalein (Bai), myricetin (Myr), and chelerythrine chloride (Che)—were hits obtained from the assay when using TO-PRO-1, Bai did not meet our selection criteria when TO-PRO-3 was used as the indicator; rather, AS 602801 (AS) became a hit compound. This is probably because TO-PRO-3 differs in size and/or fluorescent properties compared with TO-PRO-1, indicating that diverse fluorescent indicators should be included to avoid false negatives and positives. Regarding the hit compounds, Myr^56–58 and Che^59–61 have been reported as DNA or RNA binders, whereas AS has not been reported.

The RNA binding of the four hit compounds was validated by measuring their K_Dapp values by fluorescence titrations. These experiments revealed that Bai exhibits weak RNA binding (K_Dapp > 40), indicating that it is a false-positive compound for targeting disease-related human pre-miRNAs when using TO-PRO-1. The structurally similar flavonoid, Myr, exhibited moderate binding (K_Dapp = 16–25) to target RNAs, as the indicators revealed (Figures S15 and S16). Unexpectedly, Myr bound strongly to hsa-mir-6850, which forms a G4 structure, although it was not identified as a hit compound when TO-PRO-3 was used. This suggests that Myr and TO-PRO-3 might have different binding sites. When using low-rank RNAs, Myr exhibited weak RNA binding (K_Dapp > 40) even though the indicators exhibited positive. Moreover, we observed that Che was bound to all the RNAs (K_Dapp = 2.6–16) though the indicators exhibited negative for low-rank RNAs (Figs. 6d and S17). Overall, predictably unreliable results were obtained when low-rank RNAs were used. The precisions of demonstrating the reliability of the assay data across the investigated RNAs became worse as the RNA ranking decreased (Figure S18), suggesting that our binding profiles offered insight into the selection of applicable RNA targets for indicators in FID assays.

In the fluorescence spectra of Che, two major peaks were observed at 420 and 550 nm (Fig. 7a and S17). Under aqueous conditions, Che forms an OH adduct that emits a strong fluorescence signal at 420 nm when the reaction is at equilibrium.^62,63 However, the intensity of this 420 nm peak increased dramatically at pH 8 as we shifted the experimental conditions from pH 5 to 8, indicating that the addition of OH was favored under weak alkaline conditions (Figure S19). Although the fluorescent intensity of the OH-adduct peak at 420 nm decreased after RNA addition, the 550 nm peak increased. This is likely because Che was protected from hydrolytic attacks after RNA binding and shifted the reaction equilibrium toward Che. Finally, we observed AS binding to hsa-mir-191, -21, and − 6850 (K_Dapp = 14, 20, and 4.5, respectively). Interestingly, this compound exhibited strong light-up properties (Figs. 7b and S20): although free AS exhibited almost no fluorescence (Φ_free = 0.00063), strong fluorescence was observed after RNA binding (Φ_bound = 0.054). The methine tautomer⁶⁴ likely contributes to this light-up property. TO-PRO-1 could not detect the RNA binding of this compound because of the interference of its strong light-up property at a similar wavelength range with the detection of the fluorescence originating from TO-PRO-1. These characteristics make AS an interesting seed compound for developing novel RNA binders and fluorescence probes.

We developed the large-scale analytical platform for investigating small molecule–RNA interactions by subjecting the small molecules to FOREST. The affinity profiles generated by FOREST include not only high affinity interactions but intermediate and low affinity ones, on the wide range of RNA structures that were derived from naturally occurring sequences. Additionally, compared with methods using massively parallel DNA sequencing, FOREST—by using microarray analysis to determine the binding affinities of RNA structure libraries—presents the affinity profiles of small molecules without any structure-dependent amplification bias.³⁰ First, we validated our system using the unpaired G-specific binding property of the G-clamp (Figs. 2 and 3). The FOREST system ranked the G-clamp bindings of high-, intermediate-, and low-affinity RNA targets. Second, we generated the binding profiles of the TO derivatives using this platform (Figs. 4 and 5). Employing FOREST profiling, G4 structures, which are susceptible to bias by sequencing-based methods, were evaluated and ranked as top-tier interactors of the TO derivatives. Additionally, the analysis of the affinity profiles reveals a binding preference of the TO derivatives for RNA motifs containing G-rich hairpin loops, internal loop G/A bases, and/or G/C-rich stem structures (Figs. 4c–e).

The library-wide binding landscape and profiles were also applicable to commercially available fluorescent indicators, TO-PRO-1 and TO-PRO-3, for FID assay (Fig. 6). Since our knowledge of fluorescent indicator–RNA combinations remains limited, the profiles generated by this system can benefit the selection of optimal combinations and further expand the repertoire of target RNA sequences for FID assays. In this study, we identified three binding molecules for disease-related human pre-miRNA loop motifs by FID assays using TO-PRO-1 and TO-PRO-3 based on the binding profiles of the TO derivatives generated from FOREST. The FID assays using these indicators and low-rank RNAs could not provide accurate hit compounds (Fig. 6), demonstrating that our binding profiles are valuable for selecting applicable combinations for the FID assay. Moreover, we demonstrated the utility of this screening approach by identifying AS 602801 as an RNA binder that binds hsa-mir-191, -21, and − 6850 with remarkable light-up properties (Figs. 6d, 7b, and S18). Considering that AS 602801 was identified only by using TO-PRO-3, the use of multiple fluorescent indicators is recommended for FID assays. Our system will be valuable for obtaining further RNA-binding information for fluorescent indicators.

The FOREST system in this study provides the basis for future efforts to identify new small molecule–RNA interactions, investigate the binding profiles and selectivities of various RNA-binding molecules, and aide the design of novel RNA-binding molecules through FID assays.

In silico RNA motif extraction

All motifs including human pre-miRNA in library-1 and − 2 were extracted from miRBase as detailed previously.³⁰ To design library-2, the human pre-miRNA motifs were filtered based on length (< 107 nt), with 1804 species collected in total. Next, we obtained RNA secondary structure datasets as determined by SHAPE-MaP or DMS-MaPseq with structural analysis.^65,66 Predicted structures and conserved elements of SARS-CoV2 were obtained from a published study.⁶⁷ From the collected datasets, we divided long continuous RNAs into terminal motifs and defined them as structural units using FOREST.py (https://github.com/KRK13/FOREST2020). In total, 1099 motifs were collected from the transcripts of SARS-CoV2 and Influenza A viruses. As controls, selected RNA structural motifs, aptamers, and defective mutants were collected and loaded into the libraries.

Design of a template pool of RNA structure library and DNA barcode microarray

Multiplexed single-stranded DNA sequences were used as templates for RNA probes in the library. The extracted RNA motifs were attached with T7 promoter, RNA barcodes, and stabilizing stem sequences for detection and hybridization to the DNA barcode microarray as previously described.³⁰ The ssDNA templates were synthesized by SurePrint oligonucleotide library synthesis (Agilent technologies). The size of the oligo template was limited to 170 nt for RNA structure library-1 and 190 nt for library-2. After assigning barcodes to RNA structures, the DNA reverse complementary strands of RNA barcodes were used by SureDesign (Agilent technologies), a custom CGH array design service, to synthesize DNA barcode microarrays. Probe Replication Factor was set to 5× and 3×.

3’-Terminal labeling with Cy5 or Cy3

All RNA probes in the RNA structure libraries were labelled with a fluorescent dye at the 3’ end. Ten micromolar RNA structure library, 100 µM pCp-Cy5 or pCp-Cy3 (Jena Bioscience), and 0.5 U/µL T4 RNA Ligase (Thermo Fisher Scientific) were mixed in 100 µL of 1× T4 Ligase Buffer (Thermo Fisher Scientific). The mixture was incubated at 16°C for 48 h on a ThermoMixer (Eppendorf) with ThermoTop (Eppendorf). After incubation, the labelled RNA was purified using Zymo RNA Clean and Concentrator (Zymo Research) and stored at − 28°C until use.

RNA pull-down

The RNA structure library was prepared in 1× Binding buffer (20 mM phosphate pH 7.0, 20 mM NaCl, 80 mM KCl).³⁰ For folding, RNA was heated at 95°C and cooled to 4°C on a ProFlex Thermal Cycler (Thermo Fisher Scientific) with a ramp rate of − 6°C/sec. During the folding step, 100 pmol of small molecules and 50 µL of Streptavidin Mag Sepharose (Cytiva) were mixed in 900 µL of 1× Binding buffer to prepare the small molecule-conjugated beads. The mixture was incubated on a ThermoMixer (Eppendorf) at 25°C for 60 min with vortex mixing at 1200 rpm. The tube was placed on a magnetic rack to remove the supernatant and 1 µg of the refolded RNA structure library in 1 mL of 1× Binding buffer was added. A mixture containing only the beads was prepared as a control for background subtraction. The mixture was incubated on a ThermoMixer at 25°C for 60 min with vortex mixing at 1200 rpm. The mixture was washed three times with 1× Binding buffer when the reaction ended. Two hundred microlitres of 1× Elution buffer (1% SDS, 10 mM Tris-HCl, 2 mM EDTA) was added to the magnetic beads and the mixture was heated at 95°C for 3 min. The bound RNA structures were collected from the supernatant by removing the magnetic beads and purified with phenol-chloroform extraction and ethanol precipitation.

Hybridization and microarray scanning

Eighteen microlitres of the bound RNA structures was mixed with 4.5 µL of 10× Blocking Agent (Agilent Technologies) and 22.5 µL of Hi-RPM Hybridization Buffer (Agilent Technologies). The samples were incubated for 5 min in a heat block set at 104°C, then rapidly cooled and incubated for 5 min in ice water. The samples were applied to an 8× 60 K Agilent microarray gasket slide (Agilent Technologies). The prepared gasket slide and CGH custom array 8× 60 K (Agilent Technologies) were assembled with SureHyb. Hybridization was performed for 20 h at a temperature of 55.5°C at 20 rpm. The microarray slide was washed for 5 min with Gene Expression Wash Buffer 1 (Agilent Technologies) in a glass container at room temperature following hybridization. The microarray slide was moved to a glass container containing Gene Expression Wash Buffer 2 (Agilent Technologies), which was immersed in a thermostatic bath at 37°C. The washing step was performed for 5 min. Fluorescence scanning was performed on the microarray and fluorescence image data were acquired using SureScan (Agilent Technologies). The acquired images were converted to numeric fluorescence intensities for each spot by Feature Extraction (Agilent Technologies) and GeneSpringGX (Agilent Technologies).

Calculation of binding intensity

The binding intensities of each RNA structure were calculated by subtracting the fluorescence intensities of the no-ligand control samples. To alleviate the effect of undesired interactions with the RNA barcode, we calculated the mean fluorescence intensities of each RNA structure from the intensities of three RNA probes that had the same RNA structure but different RNA barcodes. For this reason, we filtered the maximum and minimum values from a set of five intensities.

Statistics

For testing statistical significance, the two-tailed Brunner–Munzel test with Bonferroni correction was performed using Julia 1.6. Standard Error (SE) was calculated using the three probes of the RNA structure library. The binding strength is normalized as a Z-score using Eq. (1): µ is the mean value of the library population, σ is the standard deviation, and x is the binding intensity of each probe in the library.

$${Zscore}_{x}= \frac{x-\mu }{\sigma }$$

Fluorescence binding assay

A solution (100 µL) of the binder (0.01 or 0.1 µM for G-clamp, 0.1 µM for TO-N₃ and TO-PRO-1, 1 µM for TO-3-N₃, 0.1 or 0.5 µM for TO-PRO-3) in 1x phosphate buffer (1% DMSO, 20 mM phosphate, 20 mM NaCl and 80 mM KCl) was transferred to a micro quartz cell with a 1-cm path length. Serial aliquots of a concentrated solution of RNA in 1× buffer was added to the binder solution and allowed to equilibrate for 2 min. The excitation wavelength was set at 360 nm for G-clamp, 501 nm for TO-N₃ and TO-PRO-1, 623 nm for TO-3-N₃ and TO-PRO-3, and the emission was recorded at 20°C. Fluorescence measurements were performed with a JASCO-6500 spectrofluorometer (JASCO, Tokyo, Japan).

The data from the titrations were analyzed according to the independent-site model by non-linear fitting to Equations (2) or (3), in which F₀ is the initial fluorescence intensity in the absence of RNA, Q (= F_max/F₀) is the fluorescence enhancement upon saturation, A = K_Dapp/C_ligand and X = nC_RNA/C_ligand (n is the putative number of binding sites on RNA and n = 1 was used).⁶⁸ The parameters Q and X were determined by KaleidaGraph (Synergy Software, PA). The K_Dapp values in the main text show the mean values of two or three experiments.

F/F₀ = 1+(Q-1)/2{A + 1 + X-[(X + 1 + A)²-4X]^1/2} (2)

or ΔF = F-F₀ = F₀(Q-1)/2{A + 1 + X-[(X + 1 + A)²-4X]^1/2} (3)

RNA secondary structure prediction and visualization

The forna website⁶⁹ was used to generate illustrations of the RNA secondary structures predicted by RNAfold 2.4.13 in the ViennaRNA package⁷⁰ with the temperature set to 25°C. The RNA structures extracted from the long transcripts (5' UTR and HIV-1 genome) included in library-2 were taken from a previous study.³⁰

Structural preference analysis

Following previous studies⁷¹, secondary structure prediction of RNA motifs in the RNA structure library was performed by RNAsubopt 2.4.13 in the ViennaRNA package⁷⁰ with parameters set to the following: (command: RNAsubopt --temp = 25 --stochBT = 30). Each nucleotide (A, G, U, C) of each base pair state (ssRNA or dsRNA) or each structural motif (hairpin loop, inner loop, or stem) was counted using the secondary structures generated by RNAsubopt as input.

FID assay

Fluorescence intensities in FID assays were measured with a microplate reader Infinite® 200 PRO (TECAN Group Ltd., Mannedorf, Switzerland) using i-control® and LBS coated Optiplate^TM-96F as 96-well plates. Buffer solution (20 mM phosphate pH 7.0, 20 mM NaCl, 80 mM KCl) was added to each well (49.5 µL for blank well and negative control well, 49 µL for positive control well and sample well), followed by the addition of 0.25 µL of ligand solution (20 µM for TO-PRO-1 and 100 µM for TO-PRO-3) to each well except for blank wells. RNA solution (0.5 µL of 10 µM for TO-PRO-1 and 50 µM for TO-PRO-3) in binding buffer was dispensed in positive control and sample wells. DMSO was added to the control (negative and positive, 0.25 µL) and blank (0.5 µL) wells; while 0.25 µL of compound solution in DMSO (1 mM, Targetmol) was added to each sample well and mixed with RNA-ligand solutions. Fluorescence intensities of the mixtures were measured after incubating for 30 min. The excitation wavelength was set at 485 nm for TO-PRO-1 or 620 nm for TO-PRO-3. Normalized fluorescence intensity (F) was calculated using Eq. (4) described below:

$$Normalized F= \frac{{F}_{(indicator + RNA + test compounds) } -{F}_{(buffer + indicator)}}{{F}_{(indicator + RNA) } -{F}_{(buffer + indicator)}}$$

Hits were selected based on a reduction of TO-PRO-1 or TO-PRO-3 signal by less than a standard deviation (σ) from the mean. Normalized fluorescence intensities greater than 1.5 were excluded from calculations for the mean and σ.

Calculation of fluorescent quantum yield

The fluorescent quantum yields (QY) of AS 602801 in the presence of RNA were calculated using quinine sulfate in 0.1 M H₂SO₄ as a standard (Φ = 0.55). Absorbance and fluorescence values were recorded 3 min after mixing RNA and AS 602801. For calculating QY, conditions for absorbance measurement were as follow: [AS 602801] = 2.5 µM, [RNA] = 5 µM, and ε366; and for fluorescence measurement: [AS 602801] = 1 µM, [RNA] = 2 µM, emission spectrum area of 380–600 nm was used for integration. QY values were calculated according to Eq. (5):

$${\phi }_{sam.}= \frac{ {\epsilon }_{ref.}}{{\epsilon }_{sam.}}\times \frac{ {c}_{ref.}}{{c}_{sam.}}\times \frac{{\left({n}_{sam.}\right)}^{2}}{{\left({n}_{ref.}\right)}^{2}}\times \frac{ {F}_{sam.}}{{F}_{ref.}}$$

where Φ_sam. is quantum yield of sample, Φ_ref. is quantum yield of reference compound, ε_sam. is molar extinction coefficient of sample, ε_ref. is molar extinction coefficient of reference compound, c_ref is concentration of reference compound, c_sam is concentration of sample, n_sam. is refractive index of sample solution, n_ref. is refractive index of reference solution, F_sam. is fluorescence intensity of sample solution, and F_ref. is fluorescence intensity of reference solution.

Competing interests

K.R.K. and H.S. own shares of xFOREST Therapeutics Co., Ltd.

Supplementary information

The online version contains supplementary material available

Author Contributions

K.O. and K.R.K. designed the experiments. K.O., H.S. and F.N. mentored the research. R.N., H.M., K. Ojima, and S.I. synthesized compounds. R.N., K.R.K., E.M., and M.O. performed analytical experiments. R.N., K.O., K.R.K., and E.M. analysed the results. R.N., K.O. and K.R.K. mainly wrote the manuscript. All authors discussed the results and provided feedback on the study and manuscript.

Acknowledgments

We thank Kelvin Hui (Kyoto University) for critical reading of the manuscript. This work was supported in part by Grant-in-Aid for Scientific Research on Innovative Areas “Middle Molecular Strategy” (No. JP15H05838 to F.N.), “ncRNA neotaxonomy” (No. JP17H05601 to H.S.), “Frontier Research on Chemical Communications” (No. JP20H04762 to K.O.), Transformative Research Areas (A) “Biophysical Chemistry for Material Symbiosis” (No. JP23H04051 to K.O.), Scientific Research (B) (JP19H02845 to K.O. and JP20H02855 and JP23H02076 to F.N.), Specially Promoted Research (No. JP20H05626 to H.S.), Challenging Exploratory Research (No. JP19K22387 to H.S. and No. JP21K19038 to F.N.) from the Japan Society for the Promotion of Science (JSPS); Japan Science and Technology Agency (JST) FOREST program (No. JPMJFR2002 to K.O.) and SPRING program (No. JPMJSP2114 to R.N.); the Takeda Science Foundation (K.O.), the Uehara Memorial Foundation (K.O.), the Noguchi Foundation (K.O.), the Tokyo Biochemical Research Foundation (K.O.), the Naito Foundation (K.R.K.), the Mitsubishi Foundation (H.S.), and the research program of ̏Crossover Alliance to Create the Future with People, Intelligence and Materials ̋ from MEXT, Japan.

Warner, K.D., Hajdin, C.E. and Weeks, K.M. Principles for targeting RNA with drug-like small molecules. Nat. Rev. Drug Discov. 17, 547–558 (2018).
Sztuba-Solinska, J., Chavez-Calvillo, G. and Cline, S.E. Unveiling the druggable RNA targets and small molecule therapeutics. Bioorg. Med. Chem. 27, 2149–2165 (2019).
Guan, L. and Disney, M.D. Recent advances in developing small molecules targeting RNA. ACS Chem. Biol. 7, 73–86 (2012).
Bush, J.A., Williams, C.C., Meyer, S.M., Tong, Y., Haniff, H.S., Childs-Disney, J.L. and Disney, M.D. Systematically Studying the Effect of Small Molecules Interacting with RNA in Cellular and Preclinical Models. ACS Chem. Biol. 16, 1111–1127 (2021).
Hargrove, A.E. Small molecule–RNA targeting: starting with the fundamentals. Chem. Commun. 56, 14744–14756 (2020).
Cheung, A.K., Hurley, B., Kerrigan, R., Shu, L., Chin, D.N., Shen, Y., O’Brien, G., Sung, M.J., Hou, Y., Axford, J. et al. Discovery of Small Molecule Splicing Modulators of Survival Motor Neuron-2 (SMN2) for the Treatment of Spinal Muscular Atrophy (SMA). J. Med. Chem. 61, 11021–11036 (2018).
Sturm, S., Günther, A., Jaber, B., Jordan, P., Al Kotbi, N., Parkar, N., Cleary, Y., Frances, N., Bergauer, T., Heinig, K. et al. A phase 1 healthy male volunteer single escalating dose study of the pharmacokinetics and pharmacodynamics of risdiplam (RG7916, RO7034067), a SMN2 splicing modifier. Br. J. Clin. Pharmacol. 85, 181–193 (2019).
Bose, D., Jayaraj, G., Suryawanshi, H., Agarwala, P., Pore, S.K., Banerjee, R. and Maiti, S. The tuberculosis drug streptomycin as a potential cancer therapeutic: inhibition of miR-21 function by directly targeting its precursor. Angew. Chem. Int. Ed. 51, 1019–1023 (2012).
Vo, D.D., Staedel, C., Zehnacker, L., Benhida, R., Darfeuille, F. and Duca, M. Targeting the Production of Oncogenic MicroRNAs with Multimodal Synthetic Small Molecules. ACS Chem. Biol. 9, 711–721 (2014).
Velagapudi, S.P., Gallo, S.M. and Disney, M.D. Sequence-based design of bioactive small molecules that target precursor microRNAs. Nat. Chem. Biol. 10, 291–297 (2014).
Velagapudi, S.P., Cameron, M.D., Haga, C.L., Rosenberg, L.H., Lafitte, M., Duckett, D.R., Phinney, D.G. and Disney, M.D. Design of a small molecule against an oncogenic noncoding RNA. Proc. Natl. Acad. Sci. U. S. A. 113, 5898–5903 (2016).
Liu, X., Haniff, H.S., Childs-Disney, J.L., Shuster, A., Aikawa, H., Adibekian, A. and Disney, M.D. Targeted Degradation of the Oncogenic MicroRNA 17–92 Cluster by Structure-Targeting Ligands. J. Am. Chem. Soc. 142, 6970–6982 (2020).
Yan, H., Bhattarai, U., Guo, Z.-F. and Liang, F.-S. Regulating miRNA-21 Biogenesis By Bifunctional Small Molecules. J. Am. Chem. Soc. 139, 4987–4990 (2017).
Wong, C.-H., Nguyen, L., Peh, J., Luu, L.M., Sanchez, J.S., Richardson, S.L., Tuccinardi, T., Tsoi, H., Chan, W.Y., Chan, H.Y.E. et al. Targeting Toxic RNAs that Cause Myotonic Dystrophy Type 1 (DM1) with a Bisamidinium Inhibitor. J. Am. Chem. Soc. 136, 6355–6361 (2014).
Rzuczek, S.G., Colgan, L.A., Nakai, Y., Cameron, M.D., Furling, D., Yasuda, R. and Disney, M.D. Precise small-molecule recognition of a toxic CUG RNA repeat expansion. Nat. Chem. Biol. 13, 188–193 (2017).
Reddy, K., Jenquin, J.R., McConnell, O.L., Cleary, J.D., Richardson, J.I., Pinto, B.S., Haerle, M.C., Delgado, E., Planco, L., Nakamori, M. et al. A CTG repeat-selective chemical screen identifies microtubule inhibitors as selective modulators of toxic CUG RNA levels. Proc. Natl. Acad. Sci. U. S. A. 116, 20991–21000 (2019).
Lee, J., Bai, Y., Chembazhi, U.V., Peng, S., Yum, K., Luu, L.M., Hagler, L.D., Serrano, J.F., Chan, H.Y.E., Kalsotra, A. et al. Intrinsically cell-penetrating multivalent and multitargeting ligands for myotonic dystrophy type 1. Proc. Natl. Acad. Sci. U. S. A. 116, 8709–8714 (2019).
Shibata, T., Nagano, K., Ueyama, M., Ninomiya, K., Hirose, T., Nagai, Y., Ishikawa, K., Kawai, G. and Nakatani, K. Small molecule targeting r(UGGAA)n disrupts RNA foci and alleviates disease phenotype in Drosophila model. Nat. Commun. 12, 236 (2021).
Howe, J.A., Wang, H., Fischmann, T.O., Balibar, C.J., Xiao, L., Galgoci, A.M., Malinverni, J.C., Mayhood, T., Villafania, A., Nahvi, A. et al. Selective small-molecule inhibition of an RNA structural element. Nature 526, 672–677 (2015).
Fedorova, O., Jagdmann, G.E., Adams, R.L., Yuan, L., Van Zandt, M.C. and Pyle, A.M. Small molecules that target group II introns are potent antifungal agents. Nat. Chem. Biol. 14, 1073–1078 (2018).
Rangan, R., Watkins, A.M., Chacon, J., Kretsch, R., Kladwang, W., Zheludev, I.N., Townley, J., Rynge, M., Thain, G. and Das, R. De novo 3D models of SARS-CoV-2 RNA elements from consensus experimental secondary structures. Nucleic Acids Res. 49, 3092–3108 (2021).
Velagapudi, S.P., Luo, Y., Tran, T., Haniff, H.S., Nakai, Y., Fallahi, M., Martinez, G.J., Childs-Disney, J.L. and Disney, M.D. Defining RNA–Small Molecule Affinity Landscapes Enables Design of a Small Molecule Inhibitor of an Oncogenic Noncoding RNA. ACS Central Science 3, 205–216 (2017).
Ursu, A., Childs-Disney, J.L., Angelbello, A.J., Costales, M.G., Meyer, S.M. and Disney, M.D. Gini Coefficients as a Single Value Metric to Define Chemical Probe Selectivity. ACS Chem. Biol. (2020).
Mukherjee, H., Blain, J.C., Vandivier, L.E., Chin, D.N., Friedman, J.E., Liu, F., Maillet, A., Fang, C., Kaplan, J.B., Li, J. et al. PEARL-seq: A Photoaffinity Platform for the Analysis of Small Molecule-RNA Interactions. ACS Chem. Biol. 15, 2374–2381 (2020).
Disney, M.D. Targeting RNA with Small Molecules To Capture Opportunities at the Intersection of Chemistry, Biology, and Medicine. J. Am. Chem. Soc. 141, 6776–6790 (2019).
Endoh, T., Ohyama, T. and Sugimoto, N. RNA-Capturing Microsphere Particles (R-CAMPs) for Optimization of Functional Aptamers. Small 15, 1805062 (2019).
Satpathi, S., Endoh, T., Podbevšek, P., Plavec, J. and Sugimoto, N. Transcriptome screening followed by integrated physicochemical and structural analyses for investigating RNA-mediated berberine activity. Nucleic Acids Res. 49, 8449–8461 (2021).
Kwok, C.K., Marsico, G., Sahakyan, A.B., Chambers, V.S. and Balasubramanian, S. rG4-seq reveals widespread formation of G-quadruplex structures in the human transcriptome. Nat. Methods 13, 841–844 (2016).
Murat, P., Guilbaud, G. and Sale, J.E. DNA polymerase stalling at structured DNA constrains the expansion of short tandem repeats. Genome Biol. 21, 209 (2020).
Komatsu, K.R., Taya, T., Matsumoto, S., Miyashita, E., Kashida, S. and Saito, H. RNA structure-wide discovery of functional interactions with multiplexed RNA motif library. Nat. Commun. 11, 6275 (2020).
Lin, K.-Y. and Matteucci, M.D. A Cytosine Analogue Capable of Clamp-Like Binding to a Guanine in Helical Nucleic Acids. J. Am. Chem. Soc. 120, 8531–8532 (1998).
Murase, H. and Nagatsugi, F. Development of the binding molecules for the RNA higher-order structures based on the guanine-recognition by the G-clamp. Bioorg. Med. Chem. Lett. 29, 1320–1324 (2019).
Murase, H., Nagatsugi, F. and Sasaki, S. Development of a selective ligand for G–G mismatches of CGG repeat RNA inducing the RNA structural conversion from the G-quadruplex into a hairpin-like structure. Org. Biomol. Chem. 20, 3375–3381 (2022).
Krishnamurthy, M., Schirle, N.T. and Beal, P.A. Screening helix-threading peptides for RNA binding using a thiazole orange displacement assay. Biorg. Med. Chem. 16, 8914–8921 (2008).
Asare-Okai, P.N. and Chow, C.S. A modified fluorescent intercalator displacement assay for RNA ligand discovery. Anal. Biochem. 408, 269–276 (2011).
Tran, T. and Disney, M.D. Identifying the preferred RNA motifs and chemotypes that interact by probing millions of combinations. Nat. Commun. 3, 1125 (2012).
Sato, Y., Yajima, S., Taguchi, A., Baba, K., Nakagomi, M., Aiba, Y. and Nishizawa, S. Trimethine cyanine dyes as deep-red fluorescent indicators with high selectivity to the internal loop of the bacterial A-site RNA. Chem. Commun. 55, 3183–3186 (2019).
Sato, Y., Aiba, Y., Yajima, S., Tanabe, T., Higuchi, K. and Nishizawa, S. Strong Binding and Off–On Signaling Functions of Deep-Red Fluorescent TO-PRO-3 for Influenza A Virus RNA Promoter Region. ChemBioChem 20, 2752–2756 (2019).
Zhang, J., Umemoto, S. and Nakatani, K. Fluorescent Indicator Displacement Assay for Ligand – RNA Interactions. J. Am. Chem. Soc. 132, 3660–3661 (2010).
Murata, A., Harada, Y., Fukuzumi, T. and Nakatani, K. Fluorescent indicator displacement assay of ligands targeting 10 microRNA precursors. Biorg. Med. Chem. 21, 7101–7106 (2013).
Fukuzumi, T., Murata, A., Aikawa, H., Harada, Y. and Nakatani, K. Exploratory Study on the RNA-Binding Structural Motifs by Library Screening Targeting pre-miRNA-29 a. Chem. Eur. J. 21, 16859–16867 (2015).
Wicks, S.L. and Hargrove, A.E. Fluorescent indicator displacement assays to identify and characterize small molecule interactions with RNA. Methods 167, 3–14 (2019).
del Villar-Guerra, R., Gray, R.D., Trent, J.O. and Chaires, J.B. A rapid fluorescent indicator displacement assay and principal component/cluster data analysis for determination of ligand–nucleic acid structural selectivity. Nucleic Acids Res. 46, e41-e41 (2018).
Das, B., Murata, A. and Nakatani, K. A small-molecule fluorescence probe ANP77 for sensing RNA internal loop of C, U and A/CC motifs and their binding molecules. Nucleic Acids Res. 49, 8462–8470 (2021).
Shibata, T., Matsumoto, Y., Iihara, A., Yamada, K., Ochiai, H., Saito, R., Kusaka, S. and Kume, T. Fluorescent indicator displacement assay for the discovery of UGGAA repeat-targeted small molecules. Chem. Commun. 59, 5071–5074 (2023).
Largy, E., Hamon, F. and Teulade-Fichou, M.-P. Development of a high-throughput G4-FID assay for screening and evaluation of small molecules binding quadruplex nucleic acid structures. Anal. Bioanal. Chem. 400, 3419–3427 (2011).
Ikeda, S., Kubota, T., Yuki, M. and Okamoto, A. Exciton-Controlled Hybridization-Sensitive Fluorescent Probes: Multicolor Detection of Nucleic Acids. Angew. Chem. Int. Ed. 48, 6480–6484 (2009).
Ikeda, S., Yanagisawa, H., Nakamura, A., Wang, D.O., Yuki, M. and Okamoto, A. Hybridization-sensitive fluorescence control in the near-infrared wavelength range. Org. Biomol. Chem. 9, 4199–4204 (2011).
Agard, N.J., Prescher, J.A. and Bertozzi, C.R. A Strain-Promoted [3 + 2] Azide – Alkyne Cycloaddition for Covalent Modification of Biomolecules in Living Systems. J. Am. Chem. Soc. 126, 15046–15047 (2004).
Debets, M.F., van der Doelen, C.W., Rutjes, F.P. and van Delft, F.L. Azide: a unique dipole for metal-free bioorthogonal ligations. ChemBioChem 11, 1168–1184 (2010).
Popenda, M., Szachniuk, M., Antczak, M., Purzycka, K.J., Lukasiak, P., Bartol, N., Blazewicz, J. and Adamiak, R.W. Automated 3D structure composition for large RNAs. Nucleic Acids Res. 40, e112 (2012).
Biesiada, M., Pachulska-Wieczorek, K., Adamiak, R.W. and Purzycka, K.J. RNAComposer and RNA 3D structure prediction for nanotechnology. Methods 103, 120–127 (2016).
Mukohyama, J., Isobe, T., Hu, Q., Hayashi, T., Watanabe, T., Maeda, M., Yanagi, H., Qian, X., Yamashita, K., Minami, H. et al. miR-221 Targets QKI to Enhance the Tumorigenic Capacity of Human Colorectal Cancer Stem Cells. Cancer Res. 79, 5151–5158 (2019).
Elyakim, E., Sitbon, E., Faerman, A., Tabak, S., Montia, E., Belanis, L., Dov, A., Marcusson, E.G., Bennett, C.F., Chajut, A. et al. hsa-miR-191 is a candidate oncogene target for hepatocellular carcinoma therapy. Cancer Res. 70, 8077–8087 (2010).
Si, M.L., Zhu, S., Wu, H., Lu, Z., Wu, F. and Mo, Y.Y. miR-21-mediated tumor growth. Oncogene 26, 2799–2803 (2007).
Mondal, S., Jana, J., Sengupta, P., Jana, S. and Chatterjee, S. Myricetin arrests human telomeric G-quadruplex structure: a new mechanistic approach as an anticancer agent. Mol. Biosyst. 12, 2506–2518 (2016).
Das, A., Majumder, D. and Saha, C. Correlation of binding efficacies of DNA to flavonoids and their induced cellular damage. Journal of Photochemistry and Photobiology B: Biology 170, 256–262 (2017).
Khan, E., Tawani, A., Mishra, S.K., Verma, A.K., Upadhyay, A., Kumar, M., Sandhir, R., Mishra, A. and Kumar, A. Myricetin Reduces Toxic Level of CAG Repeats RNA in Huntington’s Disease (HD) and Spino Cerebellar Ataxia (SCAs). ACS Chem. Biol. 13, 180–188 (2018).
Bai, L.-P., Hagihara, M., Nakatani, K. and Jiang, Z.-H. Recognition of Chelerythrine to Human Telomeric DNA and RNA G-quadruplexes. Sci. Rep. 4, 6767 (2014).
Basu, P. and Suresh Kumar, G. Small molecule–RNA recognition: Binding of the benzophenanthridine alkaloids sanguinarine and chelerythrine to single stranded polyribonucleotides. Journal of Photochemistry and Photobiology B: Biology 174, 173–181 (2017).
Chen, H., Sun, H., Zhang, W., Zhang, Q., Ma, J., Li, Q., Guo, X., Xu, K. and Tang, Y. Chelerythrine as a fluorescent light-up ligand for an i-motif DNA structure. New J. Chem. 45, 28–31 (2021).
Dostál, J., Táborská, E., Slavík, J., Potáček, M. and de Hoffmann, E. Structure of Chelerythrine Base. J. Nat. Prod. 58, 723–729 (1995).
Pradhan, A.B., Bhuiya, S., Haque, L., Tiwari, R. and Das, S. Micelle assisted structural conversion with fluorescence modulation of benzophenanthridine alkaloids. Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy 170, 89–96 (2017).
Gaillard, P., Jeanclaude-Etter, I., Ardissone, V., Arkinstall, S., Cambet, Y., Camps, M., Chabert, C., Church, D., Cirillo, R., Gretener, D. et al. Design and Synthesis of the First Generation of Novel Potent, Selective, and in Vivo Active (Benzothiazol-2-yl)acetonitrile Inhibitors of the c-Jun N-Terminal Kinase. J. Med. Chem. 48, 4596–4607 (2005).
Simon, L.M., Morandi, E., Luganini, A., Gribaudo, G., Martinez-Sobrido, L., Turner, D.H., Oliviero, S. and Incarnato, D. In vivo analysis of influenza A mRNA secondary structures identifies critical regulatory motifs. Nucleic Acids Res. 47, 7003–7017 (2019).
Manfredonia, I., Nithin, C., Ponce-Salvatierra, A., Ghosh, P., Wirecki, T.K., Marinus, T., Ogando, N.S., Snijder, E.J., van Hemert, M.J., Bujnicki, J.M. et al. Genome-wide mapping of SARS-CoV-2 RNA structures identifies therapeutically-relevant elements. Nucleic Acids Res. 48, 12436–12452 (2020).
Rangan, R., Zheludev, I.N., Hagey, R.J., Pham, E.A., Wayment-Steele, H.K., Glenn, J.S. and Das, R. RNA genome conservation and secondary structure in SARS-CoV-2 and SARS-related viruses: a first look. RNA 26, 937–959 (2020).
Stootman, F.H., Fisher, D.M., Rodger, A. and Aldrich-Wright, J.R. Improved curve fitting procedures to determine equilibrium binding constants. Analyst 131, 1145–1151 (2006).
Kerpedjiev, P., Hammer, S. and Hofacker, I.L. Forna (force-directed RNA): Simple and effective online RNA secondary structure diagrams. Bioinformatics (Oxford, England) 31, 3377–3379 (2015).
Lorenz, R., Bernhart, S.H., Höner Zu Siederdissen, C., Tafer, H., Flamm, C., Stadler, P.F. and Hofacker, I.L. ViennaRNA Package 2.0. Algorithms Mol. Biol. 6, 26 (2011).
Dominguez, D., Freese, P., Alexis, M.S., Su, A., Hochman, M., Palden, T., Bazile, C., Lambert, N.J., Van Nostrand, E.L., Pratt, G.A. et al. Sequence, Structure, and Context Preferences of Human RNA Binding Proteins. Mol. Cell 70, 854–867.e859 (2018).

There is NO Competing Interest.

Download PDF

Version 1

posted

You are reading this latest preprint version

Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

Status:

Version 1

Abstract

Figures

Introduction

Results and discussion

Design of the platform for the large-scale analysis of small molecule–RNA interactions

Large-scale analysis of the interaction of G-clamp-N₃ with Library-1

Large-scale analysis of the interaction of the thiazole derivatives with Library-2

Screening of the novel RNA-binding molecules by fluorescent indicator displacement assay using TO-PRO-1 and TO-PRO-3

Conclusions

Methods

Design of a template pool of RNA structure library and DNA barcode microarray

3’-Terminal labeling with Cy5 or Cy3

RNA pull-down

Hybridization and microarray scanning

Calculation of binding intensity

Statistics

Fluorescence binding assay

RNA secondary structure prediction and visualization

Structural preference analysis

FID assay

Calculation of fluorescent quantum yield

Declarations

Competing interests

Supplementary information

Author Contributions

Acknowledgments

References

Additional Declarations

Supplementary Files

Status:

Version 1

Large-scale analysis of small molecule-RNA interactions using multiplexed RNA structure libraries

Status:

Version 1

Abstract

Figures

Introduction

Results and discussion

Design of the platform for the large-scale analysis of small molecule–RNA interactions

Large-scale analysis of the interaction of G-clamp-N3 with Library-1

Large-scale analysis of the interaction of the thiazole derivatives with Library-2

Screening of the novel RNA-binding molecules by fluorescent indicator displacement assay using TO-PRO-1 and TO-PRO-3

Conclusions

Methods

Design of a template pool of RNA structure library and DNA barcode microarray

3’-Terminal labeling with Cy5 or Cy3

RNA pull-down

Hybridization and microarray scanning

Calculation of binding intensity

Statistics

Fluorescence binding assay

RNA secondary structure prediction and visualization

Structural preference analysis

FID assay

Calculation of fluorescent quantum yield

Declarations

Competing interests

Supplementary information

Author Contributions

Acknowledgments

References

Additional Declarations

Supplementary Files

Status:

Version 1

Large-scale analysis of the interaction of G-clamp-N₃ with Library-1