Plant material
Novel endophyte (AR584) positive (E+) and endophyte-free (E-) tall fescue genotypes of cv. Texoma MaxQ II (referred as “Texoma”) (Pennington, USA, https://www.pennington.com/) were developed at the Noble Research Institute, Ardmore, Oklahoma, USA. Texoma MaxQ II is a commercial cultivar freely available for cultivation in USA. The E+ and E- plants were transplanted in the field for seed production via open pollination among them. Since the endophyte does not transmit through pollen, seeds were harvested from the E+ and E- mother plants separately. The seeds obtained from the E+ and E- Texoma genotypes were sown in rows in the experimental farm located at Dupy (Latitude: 34°17'12.106"N, Longitude: 96°59'36.608"W), Gene Autry, Oklahoma. Before collecting samples for transcriptome analyses, we collected S tissues separately in ice-cold 15 mL falcon tubes from 15 genotypes of each E+ and E- from three random rows of the plot to test their endophyte status. In an earlier study, Takach et al.41 confirmed that endophyte is residing in the S, not in L, in Texoma tall fescue. The S samples were freeze-dried and ground separately in the presence of liquid N2 using mortar and pestle. Genomic DNA was extracted using MagAttract 96 DNA Plant Core Kit (QIAGEN Cat. No. 67163, Hilden, Germany) according to the manufacturer’s recommendation. PCR amplifications were performed using primers described in42 to confirm endophyte status of the Texoma tall fescue genotypes.
After confirming E+ and E- status through PCR amplification, equal volume of S and L tissues were collected on December 10, 2018 under freezing/cold temperatures from the field at: morning between 7:40 (-3°C) to 9:00 am (0.5°C), afternoon between 1:15 (11°C) to 2:15 pm (12°C), and evening between 4:45 (12°C) to 5:45 pm (10°C) (Supplementary Figure S6). The 12 samples with three replicates that were collected from the E+ and E- tall fescue at morning, afternoon, and evening were referred as E+MS (endophyte positive, morning, pseudostem), E+ML (endophyte positive, morning, leaf blade), E+NS (endophyte positive, afternoon, pseudostem), E+NL (endophyte positive, afternoon, leaf blade), E+ES (endophyte positive, evening, pseudostem), E+EL (endophyte positive, evening, leaf blade), E-MS (endophyte-free, morning, pseudostem), E-ML (endophyte-free, morning, leaf blade), E-NS (endophyte-free, afternoon, pseudostem), E-NL (endophyte-free, afternoon, leaf blade), E-ES (endophyte-free, evening, pseudostem), and E-EL (endophyte-free, evening, leaf blade). In each sample, L/S tissue from 10 genotypes was pooled in a 15 mL tube and immediately frozen in liquid N2. After arrival to the laboratory, the samples were stored at -80°C until processing.
RNA extraction and sequencing
The total RNA of each of 36 samples (Supplementary Figure S1) was isolated from approximately 100 mg ground tissues using a SpectrumTM Plant Total RNA Kit (Sigma, Cat. No. STRN250, St. Louis, USA) according to the manufacturer recommendation. The RNA quality was measured in Agilent 2100 Bioanalyzer (Agilent, Santa Clara, USA) using Agilent RNA 6000 Nano Kit (Agilent, 5067-1511), and RNA was quantified using Qubit® RNA BR (Broad-Range) Assay Kit (Life Technologies, Cat. No. Q10211, Carlsbad, USA). Then the RNA samples were treated with TURBO DNA-free Kit (Invitrogen, Cat. No. AM1907, Carlsbad, USA) following their protocol (https://assets.fishersci.com/TFS-Assets/LSG/manuals/1907M_turbodnafree_UG.pdf). RNA samples were then cleaned using the RNeasy MinElute Cleanup Kit (Qiagen, Cat. No. 74204, Hilden, Germany) according to the manufacturer protocol (https://www.qiagen.com).
RNA-seq libraries were prepared using TruSeq Stranded mRNA Sample Preparation Kits (Illumina, Cat. No. 20020594). Briefly, mRNA was purified from one microgram of total RNA, fragmented, and converted to double-stranded DNA for sequencing. Individual libraries were uniquely indexed using TruSeq RNA CD Indexes (Illumina, Cat. No. 20019792), and pooled in equimolar ratio. The pooled libraries were sequenced on an Illumina NovaSeq 6000 150PE Sequencing system.
Quality assessment and assembly of the RNA-seq reads
The raw reads of 36 samples (Figure 1) were quality trimmed to remove any low quality bases and primer/adapter sequences before performing the assembly using the Trimmomatic (v. 0.36) using default settings43. Reads less than 30 bases long after trimming were discarded, along with their mate pair. Endophyte-derived reads were identified by mapping the trimmed reads to the Epichloë coenophiala transcriptome44 (http://csbio-l.csr.uky.edu/ec/) and successfully mapped reads were excluded from further analysis. The trimmed and filtered reads from each sample were independently de novo assembled using the software Trinity (v. 2.8.5) with default parameters45. These assemblies were then combined by randomly selecting one as a starting transcriptome and then iteratively aligning the transcriptome with each assembly, identifying that assembly's novel transcripts, and adding those transcripts to the combined transcriptome. Each sample was then mapped to the combined transcriptome using HISAT2 (v. 2.0.5) (https://daehwankimlab.github.io/hisat2/) with 24 threads and the default mapping parameters46. The expressed transcripts in each sample were quantified using the StringTie (v. 1.2.4) with the default assembly parameters to produce more complete and accurate reconstructions of transcripts and better estimates of their expression levels47.
Identification of differentially expressed genes
To identify genes which expressed under different temperature condition during morning, afternoon, and evening time with or without the presence of endophyte, pairwise differential gene expression testing was performed using DESeq2 with default parameters setting48. DESeq2 method was used for differential read counts per gene in RNA-seq, using shrinkage estimation for dispersions and fold changes to improve stability of estimates across experimental conditions. A log2 FC ≤-5 and ≥5 and adjusted p-value ≤ 0.05 were used to determine the significant differences in differential gene expression between two samples. The DEGs with log2 FC – and + sign indicates downregulated and upregulated genes, respectively.
Hierarchical clustering of differentially expressed genes
Hierarchical clustering analysis of DEGs from the six comparisons was constructed using the function heatmap.2 in the R package gplots49 in R Studio.
Visualization of differentially expressed genes
The DEGs that were biologically significant were visualized using the web-based software DiVenn50. The red and blue nodes represent up- and down-regulated genes, respectively. The yellow nodes represent upregulated in one dataset but downregulated in the other dataset.
Identification of orthologous genes using tall fescue transcripts
As annotation of tall fescue genome is not available till to May 30, 2021, the complete and accurate tall fescue transcripts were aligned against the switchgrass non-redundant protein sequences in Phytozome v13 database (https://phytozome.jgi.doe.gov/pz/portal.html) using BLASTX searches to identify best matched switchgrass orthologues. Using switchgrass orthologues, we also obtained rice and Arabidopsis orthologues of tall fescue transcripts from the Phytozome database.
Gene Ontology analysis of differentially expressed genes
We performed GO enrichment analysis using orthologue genes of rice (Supplementary Table S2) to identify their involvement in BP, MF, and CC categories. To study the influence of endophyte, the DEGs between E+MS vs. E-MS, E+ML vs. E-ML, E+NS vs. E-NS, E+NL vs. E-NL, E+ES vs. E-ES and E+EL vs. E-EL were used for GO enrichment analysis. The rice orthologues of the tall fescue DEGs were used as input data to perform GO analysis using singular enrichment analysis (SEA) tool against Oryza sativa japonica annotation of the web-based AgriGO v2.051 with modified statistical parameter settings: statistical test method, Fisher; multi_test adjustment method, Yekutieli (FDR under dependency); significance level, 0.01; and minimum number of mapping entries,10. MSU7.0 gene ID (TIGR) of rice orthologues was used as reference during SEA analysis.
KEGG pathway enrichment analysis
The KEGG pathway enrichment analysis was performed using KOBAS 3.052 on the basis of Fisher’s exact test with Benjamini and Hochberg (1995) FDR corrected p-value <0.05. The top most significant pathways based on FDR corrected p-values in all six comparisons were presented.
Validation of RNA-seq by quantitative real time PCR (qRT-PCR)
Eight DEGs were selected for validation using qRT-PCR. Total RNA treated with TURBO DNA-free Kit (Invitrogen, Cat. No. AM1907, Carlsbad, USA) was used to synthesize the first-strand cDNA using SuperScript III Reverse Transcriptase (RT) kit (Invitrogen, Carlsbad, CA, USA, Cat. no.: 18080044) following the manufacturer protocol (https://www.thermofisher.com). Briefly, for each RNA sample, the following components was combined in a PCR tube on ice to a volume of 12 µL containing 5 µL DNase-free RNA (200 ng/µL), 1 µL 50 mM oligo (dT)20, 1 µL 10 mM dNTPs and 5 µL RNase/DNase-free water. The reaction mixtures were incubated at 65°C for 5 min and then placed on ice for 2 min. The first-strand cDNA synthesis master mix was prepared on ice by adding 4 µL 5x First Strand buffer, 1 µL 0.1 M DTT, 1 µL RNaseOUT Recombinant RNase Inhibitor (40 units/μL) and 1 µL SuperScript III RT (200 units/μL). The first-strand cDNA synthesis master mix was mixed properly by gentle vortex and was added into the pre-incubated RNA and oligo tube. The mixture was mixed by pipetting up and down and incubated at 50°C for 60 min. The reaction was terminated at 70°C for 15 min and cooled on ice. The cDNA was stored at -20°C.
qRT-PCR reactions were prepared in an optical 384-well plate in a volume of 10 µL containing 2 µL of the forward and reverse primer (1 µM/µL), 5 µL of 2x Sigma KiCqStart SYBR Green qPCR Ready-Mix (Cat no.: KCQS01), 1 µL molecular biology grade water, and 2 µL cDNA (1:20). qRT-PCR amplifications were performed on QuantStudio 7 Flex Real-Time PCR system (Thermo Fisher Scientific, Singapore) using a protocol of 2-step PCR cycle (an initial denaturation of cDNA at 95°C for 3 min, followed by 40 cycles of denaturation at 95°C for 15 sec and annealing at 60°C for 45 sec) and a 3-step of melting curve analysis (95°C for 15 sec, 60°C for 1 min and 95°C for 15 sec). Experiments were performed with three technical replicates of each S tissues of E+ and E- tall fescue collected under freezing temperature in the morning. Gene expression was quantified using the 2^-(ΔΔCT) method53. The tall fescue Actin gene was used as an internal reference gene. Primers used to amplify 53-63 bp of the genes were designed using Primer Express software (v3.0.1) (Thermo Fisher Scientific) (Supplementary Table S5) and synthesized by Sigma-Aldrich, MO, USA.