Plant material
Aeluropus littoralis seeds were collected from Isfahan province in Iran and plants were since cultivated at IPK Gatersleben (Germany) and Sari Agricultural Sciences and Natural Resources University (Iran). A specimen of the analyzed plants was deposited at the herbarium GAT under voucher number 70486. Sterilized seeds plated on full strength MS medium [38] with vitamins, 3% sucrose and 0.7% agar (pH 5.8). The cultures were incubated in germinator at 25 ± 2°C with 16 h light/8 h dark photoperiod at 100 μmol m-2 s-1 photon flux density using cool-white fluorescent light. Two weeks after germination, the seedlings were transferred to hydroponic culture containing Hoagland’s solution [39]. Hoagland’s nutrient solution comprised 3 mM KNO3, 2 mM Ca(NO3), 2.1 mM NH4H2PO4, 0.5 mM MgSO4, 1 μM KCl, 25 μ M H3BO3, 2 μ M MnSO4, 2 μ M ZnSO4, 0.1 μ M CuSO4, 0.1 μ M (NH4)6Mo7O24 and 20 μ M Fe(Na) EDTA, in demineralised H2O buffered with 2 mM 2-(N-morpholino) ethanesulphonic acid, pH 5.5, set-out using KOH. Transferred plants were grown in a phytochamber at approximately 240 μmol m–2s–1 under photoperiodic conditions (16h light, 22°C /8h dark, 18°C) at relative humidity 70%.
For salt stress treatments soil grown plants (pots 12 cm diameter) were continuously watered with 1m NaCl 1 time per week.
Light and transmission electron microscopy
Aeluropus littoralis leaves of three biological replicates of plants grown under controlled conditions and exposed to salt stress were used for comparative histological and ultrastructural analysis. For this, cuttings of a size of 1x2 mm from the central part of fully developed leaves were used for combined conventional and microwave assisted chemical fixation, substitution and resin embedding as defined in the given protocol (Supplemental Table 3). Sectioning, histological staining, light and transmission electron microscopy analysis was performed as described [40].
Chromosome preparation and fluorescence in situ hybridization
Mitotic chromosomes were prepared from root tips pretreated in ice water for 24 h to accumulate synchronized cells at metaphase, fixed in Carnoy’s fixative (ethanol and glacial acetic acid, 3:1 (v/v)) at room temperature for 20 h and kept in 70% ethanol at -20°C for later use. Fixed roots were digested in enzyme mixture (2% cellulose, 2% pectinase, 2% pectolyase in citrate buffer (0.01 M sodium citrate dihydrate and 0.01 M citric acid)) at 37°C for 30-40 min. Cell suspension from root meristems in the Carnoy’s fixative was dropped onto slides on a hot plate at 50°C, slides were further fixed in the fixative for 1 min, air-dried and kept at 4°C.
The Arabidopsis-type telomere probe was labelled with fluorophore ATTO488 using nick translation labelling kit (Jena Bioscience). Fluorescence in situ hybridization was performed as described in [41] with pretreatment for 10 min in 45% acetic acid at room temperature, followed by 0.1% pepsine in 0.01N HCl at 37°C. Slides were applied with the hybridization mix (50% (v/v) formamide, 10% (w/v) dextran sulfate, 2× SSC, and 3 ng/µl of telomere probe) and denatured at 75°C for 2 min. After stringency wash in 2× SSC at 57°C for 20 min, chromosomes were counterstained with 4',6-diamidino-2-phenylindoline (DAPI). Images were captured using an epifluorescence microscope BX61 (Olympus) equipped with a cooled CCD camera (Orca ER, Hamamatsu) and pseudocolored with Adobe Photoshop.
Estimation of nuclear genome size
For estimation of nuclear genome size by flow cytometry, approximately 10 mm2 of leaf tissue from individuals of Aeluropus littoralis populations was chopped with a sharp razor blade together with rougthly 5 mm2 of leaf material of Raphanus sativus cv. ‘Voran’ (Genebank Gatersleben accession number: RA 34; 2C = 1.11 pg) as internal reference standard [32] in a Petri dish containing 1 ml Galbraith nuclei isolation buffer [42] supplemented with 1 % PVP- 25, 0.1 % Triton X-100, DNase-free RNase (50 µg/ml). The nuclei suspension was filtered through a 35-µm, mesh cell strainer cap to remove large fragments and stored on ice until measurement. The relative fluorescence intensities of around 7,000 to 10,000 events (nuclei) per sample were measured using a CyFLow Space flow cytometer (Sysmex-Partec, Germany) quipped with a 30 mW green solid state laser (532 nm). The absolute DNA amounts of samples were calculated based on the values of the G1 peak means.
Extraction of genomic DNA
DNA of A. littoralis was extracted according to Dellaporta procedures [43]. The quality and quantity of the extracted DNA were controlled by measuring absorbance at 260/280 nm using a NanoDrop spectrophotometer (Biochrom WPA Biowave II, UK). Further, the purity and integrity of DNA were tested by running on 0.7% agarose gel electrophoresis.
Illumina sequencing and sequence data pre-processing
Library preparation (Illumina TruSeq DNA Sample Prep Kit) and sequencing by synthesis using the Illumina HiSeq2500 device involved standard protocols from the manufacturer (Illumina, Inc., San Diego, CA, USA). The library was quantified by qPCR [44] and sequenced using the rapid run mode (on-board cluster generation, paired-end, 2 x 101 cycles. In total, 125,600,517 Illumina paired end reads were produced having a total output of residues of 42.5 Gb. Prior to the assembly process reads were quality trimmed using the clc_quality_trim module of CLC Genomics Workbench 11 with a minimal cut-off threshold of Q30 and default settings on remaining parameters. 85.6% of reads and 83.77 % of residues passed this initial pre-processing. Subsequently, the quality of the sequence data checked using fastQC (http://www.bioinformatics.babraham.ac.uk/projects/fastqc/). After this quality enrichment a genome coverage of 62-fold was reached.
De novo assembly construction
Our A. littoralis de novo sequence was constructed using CLC assembly cell version 4.3 and the quality trimmed WGS data. The de novo assembly pipeline was applied with automatic detection of best parameters by CLC assembly cell. In accordance with good practice, all contigs below a length threshold of 200 bp were removed. For purification of the constructed assembly we checked our constructed contigs for contamination by E. coli using BLAST+[45]. As parameter settings we used a sequence identity of 60% and a word size of 28. Critical contigs were fully removed if the BLASTN analysis resulted in a hit with length >500bp. For smaller contigs we reduced the minimal length of a hit 200 bp, while at the same time at least 10% of the length of the contig is identified as E. coli contamination. From the remaining sequences we removed contigs in case a bacterial origin was detected within the BLASTN analysis against the NCBI non-redundant nucleotide database nt. In addition, we filtered for contigs having a length of 500bp. The descriptive statistics of both datasets (200bp and 500bp) are described in Table 1. The list of all contigs is available at https://doi.ipk-gatersleben.de/DOI/ca99c593-ffdd-4d49-8eab-f1c891953776/d5b041b5-b2c1-4696-bc7c-bc5a32a0c7ec/2/1847940088.
Gene prediction and annotation
We used the purified WGS assembly without a threshold on contig sizes to predict gene models. Gene prediction was done with GeMoMa [46] using gene models of Brachypodium distachyon (Brachypodium_distachyon_v3.0, INSDC Assembly GCA_000005505.4, Feb 2018), Oryza sativa (IRGSP-1.0, INSDC Assembly GCA_001433935.1, Oct 2015) and Sorghum bicolor (Sorghum_bicolor_NCBIv3, INSDC Assembly GCA_000003195.3, Apr 2017) downloaded from Ensembl Plants [47]. In total, 15,916 gene models were predicted in 12,130 different contigs. For all detected gene models CDS (FASTA), protein sequence (FASTA) and genomic positions (GFF) are provided. We further investigated these dataset performing a gene annotation with AHRD version 2.0 (https://github.com/groupschoof/AHRD/) using UniProt, trembl and TAIR10 (downloaded January 4th 2016). For 13,921 genes (87.5 %) a functional annotation could be assigned. Complete data of gene models and functional annotation is available for download. The coding sequences of all annotated genes are available at https://doi.ipk-gatersleben.de/DOI/ac423f10-971e-481e-bcab-6ac261e27f5c/15d455e1-da91-4e78-82d8-7c7607cb05b9/2/1847940088 (provisional DOI). DOIs of datasets released in this manuscript were constructed using the e!DAL system [48].
Genome repeat fraction analysis
The repetitive fraction analysis was performed with 89 Mbp of reads of the total genomic DNA (0.26x genome coverage). Sequenced reads, after the quality trimmed above, were grouped with the graph-based clustering algorithm based on sequence similiraty, implemented in the RepeatExplorer pipeline [49]. The paired-end reads clustering was performed with a minimum overlap of 55% and a similarity of 90%. Three independent analyses were performed, using a different dataset of reads of the same sequencing, to confirm the proportions of each cluster within the total genome. Repeat annotation and classification was performed for those clusters with an abundance of at least >0.01%. For basic repeat classification, protein domains were identified using the tool ‘Find RT Domains’ within RepeatExplorer [49]. Searches for sequence similarity, using different databases (RepeatMasker and GenBank) were performed and graph layouts of individual clusters were examined using the SeqGrapheR program [49]. Satellite DNAs were identified based on the TAREAN tool implemented in the pipeline, graph layouts and further examined using DOTTER [50].