Animal husbandry
Platax fingerlings used in the bacterial challenge were 58 days old (days post-hatching; dph). Fish were obtained from a mass spawning of six females and eight males induced by desalinisation. Broodstock included wild individuals caught in French Polynesia that had been maintained at the Centre Ifremer du Pacifique (CIP) hatchery facility for seven years, under the supervision of the Direction des Ressources Marines. Eggs were randomly distributed into six black circular 210-L fiberglass tanks at 50 eggs.L-1 to give an average density of 30 larvae.L-1. Half of the larvae were then reared in conditions following the standard procedures used in the CIP facility (open water system, normal salinity around 36 psu), referred to as ‘Standard’ rearing conditions [18]. The remaining half of the larvae were reared in conditions supposedly optimal according to previous experiments and referred to as ‘Recirculated’ rearing conditions. Here, they were put in a recirculating system that was desalinated to 24 psu until day 34 whereupon salinity was progressively raised to normal (36 psu). Moreover, commercial clay (Clay Bacter ®) was added daily at a rate of 1g/d/m3 per percent of hourly water renewal from day 1 to day 19, the beginning of the live-prey weaning period. Probiotic Pseudoalteromonas piscicida B1 (local strain), produced in the CIP facilities (see Supplementary Methods), was also added daily to the fish feed (0.5mL/day/tank) and water (0.5 mL/day/tank) at a concentration of 109cfu/mL of bacterial suspension from day 1 to day 57. At the end of the larval phase, at day 20, 700 fingerlings/tank were randomly selected. When these reached an average weight of 1 g, they were sorted to remove the largest and smallest extremes from the batches and then kept at a density of 200 ind./ tank (1g/L). It should be mentioned that around day 40 some of the fish, mainly those in the recirculating system, started to show a decrease in appetite and heavy mucus losses although there was no mortality. After an analysis of the water flora, it was shown that Vibrio harveyi was present but not T. maritimum.
Platax larvae were fed four times a day on live prey (Brachonius sp. and Artemia spp.) before being weaned from day 16 to 23. Fingerlings were then fed on commercial micro pellets ranging from 0.3 to 1 mm for the Micro-Gemma and Gemma ranges (Skretting, Stavanger, Norway) and from 1 to 1.3 mm for Ridley (Le Gouessant, Lamballe, France) according to the standard previously established [18]. Seawater supplied to both systems was pumped from the lagoon, filtered through a 300-µm sand filter and two 25- and 10-µm mesh filters and UV treated (300 mJ/cm²). The recirculating system included a 500-L biological filter to regulate levels of ammonia and nitrite. All tanks were supplied with saltwater held at 28.4 ± 0.3°C at a constant photoperiod (12L:12D) and oxygen saturation was maintained above 60% in the tanks with air distributed via air stones. Water renewal ranged from 36 to 360 L/h and new water input into the recirculating system was of 11 ± 1%. Levels of ammonia and nitrite were monitored once a week by spectrophotometry (HANNA Instruments ®) to assess biofilter performance. Temperature, salinity and dissolved oxygen were measured daily (YSI ®) and uneaten food and faecal material was removed once a day.
a)Bacterial challenge
We used strain TF4 for the experimental infection of 58-dph fingerlings. TFA4 was isolated from the skin of an infected Platax orbicularis in French Polynesia in 2013 and was shown to belong to Tenacibaculum maritimum by whole-genome sequencing, displaying an average nucleotide identity of 99.6% with the reference strain NCIMB 2154T [30]. Strain TFA4 was cultivated in nutrient Zobell medium (4 g L-1 peptone and 1 g L-1 yeast extract Becton, Dickinson and Company, Sparks, MD in filtered and UV-treated seawater) under constant agitation (200 rpm) at 27°C for 48 h. On the infection day, 50 juvenile Platax orbicularis were transferred into 40-L tanks supplied with air and infected by the addition of 10 mL of a bacterial suspension of strain TFA4 to the tank water. Final bacterial concentration in the 40-L tanks, determined by the ‘plate-counting’ method, reached 4.104 CFU.mL-1. Moreover, for each rearing treatment, 16 to 17 fish were randomly selected from each tank to be transferred into one 40-L tank to form a mock-treated group, which we call the control (N = 50 fish) and 10 mL Zobell medium were added. After 2 hours of bathing, the fish were caught with a net, rinsed successively in two 40-L buckets filled with clean filtered UV-treated seawater, before being transferred back into their respective tanks. Twice a day, one third of the water in the tanks was replaced with filtered UV-treated seawater to maintain good water quality. This also made it possible to inactivate the T. maritimum in the sewage by bleach treatment. Dead animals were also collected and recorded at these times. After 115 hours post-infection (hpi), all living infected animals were considered as survivors and the challenge was ended. All the remaining fish were euthanised.
b) Animal sampling
We randomly sampled five individuals per tank at 58 dph (average weight 7.21g ± 0.28 se). Samplings in the experiment then consisted of five individuals per tank at 24 hpi and 96 hpi (Figure 1A). To increase our sampling dataset for controls we included the ‘standard’ control group (see previous section, one tank) together with the ‘recirculated’ (one tank) individuals. Consequently, our design consisted of four groups, namely control24h (N = 10 individuals, replicate tanks), control96h (N = 10 individuals, replicate tanks), infected24h (N = 15 individuals, triplicate tanks) and resistant96h (N = 15 individuals, triplicate tanks). For each sampling, at 24 hpi and 96 hpi, individuals were lethally anaesthetised using a benzocaine bath (150 mg.L-1) and a lateral photograph was taken using a digital fixed camera (Leica Microsystems; Figure 1B and C). Microbiome and host sampling consisted in making gentle fish skin smears with sterile swabs that were directly placed in TRIZOL Reagent (Life Technologies) on ice to prevent RNA degradation. Swabs were disrupted using a mixer mill MM200 (Retsch) for 5 min at a frequency of 30 Hz and stocked at -80°C for later analysis. In parallel, water was also sampled from each tank but was not included in the analysis due to its very low DNA yield.
c) T. maritimum in vitro liquid culture sampling
The TFA4 strain was cultivated in 6 mL Zobell medium under constant agitation (200 rpm) at 27°C for 48 h, following exactly the same procedure and time of incubation as the culture used for bacterial challenge. Five culture replicates were performed. For each replicate, 4 mL at 108 CFU/mL were centrifuged 5 min at 10,000 g at room temperature. Three inox beads and 2 mL TRIZOL (Life technologies) were quickly added to each bacterial pellet and cells were immediately disrupted using a mixer mill MM200 (Retsch) for 5 min at 30 Hz to prevent RNA degradation. RNA was extracted following manufacturer’s instructions, using a high salt precipitation procedure (0.8 M sodium citrate and 1.2 M NaCl per 1 mL of TRIZOL reagent used for the homogenisation) to reduce proteoglycan and polysaccharide contamination. Quantity, integrity and purity of total RNA were validated by both NanoDrop readings (NanoDrop Technologies Inc.) and on a Bioanalyzer 2100 system (Agilent Technologies). DNA contaminants were removed using a DNAse RNase-free kit (Ambion). A total of five RNA samples (1.048 ± 0.019 µg) were further dried in RNA-stable solution (Thermo Fisher Scientific) following manufacturer’s recommendations and shipped at room temperature to McGill sequencing platform services (Montreal, Canada). One library was removed prior to sequencing because it did not meet the minimum quality requirements.
d) Fish mortality and cortisol measurements
Mortality was recorded at 0, 19, 24, 43, 48, 67, 72, 91, 96 and 115 hpi. To estimate log-rank values, we used the non-parametric Kaplan-Meier approach implemented in the survival R package [31]. Differences in survival probability were considered significant when P < 0.05. We assessed stress levels in fish by measuring scale cortisol content [32]. The scales were collected from both sides of each individual, washed and then vortexed three times (2.5 min; 96% isopropanol) to remove external cortisol origination from the mucus. Residual solvent traces were evaporated under nitrogen flux and samples frozen at -80°C. To ensure the scales were dry, they were lyophilised for 12 hours and then ground to a powder using a ball mill (MM400, Retsch GmbH, Germany). Cortisol content was extracted from ~50 mg of dry scale powder by incubation in 1.5 mL methanol (MeOH) on a 30°C rocking shaker for 18 h. After centrifugation at 9500 g for 10 min, the supernatant was evaporated using a rotary evaporator and reconstituted with 0.2 mL of EIA buffer from a Cortisol assay kit (Neogen® Corporation Europe, Ayr, UK). Cortisol concentrations were determined in 50 µL of extracted cortisol using a competitive EIA kit (Neogen® Corporation Europe, Ayr, UK) according to a previously published protocol [33]. After validation of normality and homoscedasticity, differences among groups were tested using a two-way ANOVA followed by Tukey’s HSD post-hoc tests. Differences were considered significant when P < 0.05.
e) RNA and DNA extraction and sequencing
Dual RNAseq. Total RNA was extracted using the same procedure as described above. RNA was then dried in RNA-stable solution (Thermo Fisher Scientific) following manufacturer’s recommendations and shipped at room temperature to McGill sequencing platform services (Montreal, Canada). Ribo-Zero rRNA removal kit (Illumina, San 260 Diego, Ca, USA) was used to prepare rRNA-depleted mRNA libraries that were multiplexed (13–14 samples per lane) and sequenced on a HiSeq4000 100-bp paired-end (PE) sequencing device. Infected individuals 24 hpi were sequenced twice to insure sufficient coverage (Table S1).
Short-read 16S rRNA MiSeq microbiome sequencing. Total DNA was extracted from the same TRIZOL Reagent (Life Technologies) mix described above. DNA quantity/integrity and purity were validated using both a Nanodrop (NanoDrop Technologies Inc.) and a Bioanalyzer 2100 (Agilent Technologies). The V4 region was amplified by PCR using modified 515F/806rb primer constructs (515F: 5’-GTGYCAGCMGCCGCGGTAA-3’; 806rb: 5’- GGACTACNVGGGTWTCTAAT-3’) recommended for microbial survey [34]. Amplicon libraries were multiplexed and sequenced on a single lane of a MiSeq 250bp PE Illumina machine at Genome Québec McGill, Canada. Details of the sequencing statistics are given in Table S2.
Full 16S rRNA Nanopore sequencing. For a broad range amplification of the 16S rRNA gene, DNA was amplified using the 27F/1492R barcoded primer products (27F: 5’- AGAGTTTGATCMTGGCTCAG-3’; 1492R: 5’- TACGGYTACCTTGTTACGACTT-3’). In the PCR experiment, we included eight randomly selected individuals from the infected24h group, two negative PCR controls (clean water) and one positive control (Acinetobacter DNA).
The PCR mixtures (25 μL final volume) contained 10 ng of total DNA template or 10 µL of water, with 0.4 μM final concentration of each primer, 3% DMSO and 1X Phusion Master Mix (Thermo Fisher Scientific, Waltham, MA, USA). PCR amplifications (98°C for 2 min; 30 cycles of 30 s at 98°C, 30 s at 55°C, 1 min at 72°C; and 72°C for 10 min) of all samples were carried out in triplicate in order to smooth out intra-sample variance. Triplicates of PCR products were pooled and purified by 1x AMPure XP beads (Beckmann Coulter Genomics) clean-up. Amplicon lengths were measured on an Agilent Bioanalyzer using the DNA High Sensitivity LabChip kit, then quantified with a Qubit Fluorometer.
An equimolar pool of purified PCR products (except for negative controls) was made and one sequencing library was finally prepared from 100 ng of the pool using the 1D Native barcoding genomic DNA protocol (with EXP-NBD103 and SQK-LSK108) for R7.9 flow cells run (FLO-MAP107) then sequenced on a MinION device. Details of the sequencing statistics are provided in the Supplementary Material.
f) Microbiome community analyses
Microbiome dynamics with the MiSeq short-reads dataset. Raw reads were filtered to remove the Illumina adaptors as well as for quality and length using Trimmomatic v.0.36 [35], with minimum length, trailing and leading quality parameters set at 100 bp, 20 and 20, respectively. Remaining reads were analysed with functions implemented on the QIIME2 platform v2019.10. Briefly, we used the DADA2 algorithm [36] to cluster sequences in amplicon sequence variants (ASVs). The resulting ASVs were mapped against GreenGenes v13.9 99% OTUs database [37]. We explored alpha-diversity [Shannon, Fisher and Faith’s phylogenetic diversity (PD) indexes] and beta-diversity (Bray-Curtis, unweighted and weighted Unifrac distances) using the phyloseq R package [38]. Dissimilarity between samples was assessed by principal coordinates analysis (PCoA). Differences in alpha-diversity were tested using pairwise Wilcoxon rank tests and were considered significant when P < 0.01. Differences in beta-diversity were tested using PERMANOVA (999 permutations) as implemented in the 'adonis' function of the vegan R package [39] and were considered significant when P < 0.01. We also searched for a ‘core’ microbiome of fish skin, considering those ASVs present in all the individuals across all treatments (infected, control and resistant) as belonging to this group. We finally searched for significant differences in specific ASV abundance across groups using Wald tests implemented in the DESeq2 R package [40]. We used the apeglm method for log2FC shrinkage to account for dispersion and variation of effect size across individuals and treatments, respectively [41]. Differences were considered significant when FDR < 0.01 and |FC| > 2.
Microbiome diversity analysis with the Nanopore dataset. Sequences were called during the MinION run with MinKnow software (v. 1.7.14). The demultiplexing and adaptor trimming were done with the porechop tool (https://github.com/rrwick/Porechop) using the option discard_middle. For each barcode, all Nanopore reads were mapped on the GreenGenes database (v.13.5, http://greengenes.lbl.gov) with minimap2 (v2.0-r191) with the ‘map-ont’ pre-set options [42]. All reference sequences of the GreenGenes database covered by more than 0.01% of all reads were kept for the next step. A second round of mapping (using the same parameters) was done on the selected references in order to aggregate reads potentially mis-assigned during the first round of mapping. SAMtools and BCFtools were used to reconstruct consensus sequences for each reference sequence covered by more than 10 Nanopore reads with the following programs and options: mpileup -B -a -Q 0 –u; bcftools call -c --ploidy 1;vcfutils.pl vcf2fastq. Individuals and consensus sequences were blasted (e-value < 10-5) against the NCBI nt database(https://ftp.ncbi.nlm.nih.gov/blast/db; accessed October 2019).
g) Compartment-specific differential expression analyses
Read pre-processing. For each individual, raw reads were filtered using Trimmomatic v0.36 [43], with minimum length 60 bp, trailing 20 and leading 20. Filtered PE reads were mapped against a combined reference including the host’s transcriptome (See Supplementary Material Tables S1 and S3 for details of the transcriptome assembly) and the genomes of Alteromonas mediterranea strain: AltDE1 (Genbank accession: GCA_000310085.1), and Pseudoalteromonas phenolica strain: KCTC 12086 (Genbank accession: GCA_001444405.1), Tenacibaculum maritimum strain: NCIMB 2154T (Genbank accession: GCA_900119795.1), Sphingobium yanoikuyae strain ATCC 51230 (Genbank accession: GCA_000315525.1), Vibrio alginolyticus strain: ATCC 17749 (Genbank accession: GCA_000354175.2,) and Vibrio harveyi strain: ATCC 43516 (Genbank accession: GCA_001558435.2). To prevent multi-mapping biases, we used GSNAP v2017-03-17 [44] with minimum coverage set at 0.9, a maximum of five mismatches allowed, and removal of improperly paired and non-uniquely mapped reads (option ‘concordant_uniq’). Reads with low mapping quality (MAPQ) were removed using SAMtools v1.4.1 [45] with the minimum MAPQ threshold fixed at five. A matrix of raw counts was built using HTSeq-count v0.9.1 [46]. Transcripts from the host and bacterial species origin were then separated into different contingency tables using homemade scripts.
Host transcriptome analysis. Low coverage transcripts with count per million (CPM) < 1 in at least nine individuals were removed, resulting in a total of 22,390 transcripts. Similarly, transcript over-representation was assessed using ‘majSequences.R’ implemented in the SARTools suite [47]. We used distance-based redundant discriminant analysis (db-RDA) to document genetic variation among groups and correlation with group (infected or control), weight and time (24 and 96 hpi) as the explanatory variables. Briefly, we computed Euclidean distances and PCoA using the ‘daisy’ and ‘pcoa’ functions, respectively, implemented in the ape R package [48]. PCo factors (n = 6) were selected based on a broken-stick approach [49,50] and used to produce a db-RDA. Partial db-RDAs were used to assess the factor effect, checking for the other factor variables. We tested the significance of the models and individual factors using 999 permutations. Effects were considered significant when P < 0.01.
Differential expression was assessed using the DESeq2 R package [40], using pairwise comparisons with Wald tests. Logarithmic fold changes (logFC) were shrunk using the ‘apeglm’ method, implemented in the DESeq2 R package [40], to account for dispersion and effect size across individuals and treatments [41]. Differences were considered significant when FDR < 0.01 and FC > 2. Group comparisons included infected24h vs control24h and resistant96h vs control96h. Gene ontology (GO) enrichment was tested using GOAtools v0.6.5 [51] and the go-basic.obo database (release 2017-04-14) using Fisher’s test. Our background list included the ensemble of genes in the host transcriptome. Only GO terms with Bonferroni adjusted P < 0.01 and including at least three differentially expressed genes were considered. Significant GO enriched terms were used for semantic similarity-based clustering in REVIGO (http://revigo.irb.hr/).
Tenacibaculum maritimum gene expression in vitro or during infection. A validation step for searching for transcript over-representation was assessed using ‘majSequences.R’ implemented in the SARTools suite [47], similarly to the fish transcriptome. Most represented sequences were attributed to ssrA-coding genes, but represented less than 8% of the total library. We applied a similar shrinkage method and pairwise comparisons (infected vs in vitro), as for the host comparisons, but used more stringent thresholds as commonly observed in similar studies [8], and considered significant differences when FDR < 0.01 and FC > 4. Gene ontology (GO) enrichment was similar to the methods used for the host.
h) Species-specific weighted co-network gene expression analyses in the host
We built a signed weighted co-expression networks for the host compartment to cluster co-expressed genes and identify putative driver genes using the WGCNA R package [52]. Variation in normalised counts were previously controlled using the ‘vst’ method implemented in the DESeq2 R package [40].
We reduced the expression noise in the dataset by keeping only transcripts with minimum overall variance (> 5%). Briefly, we fixed a soft threshold power of 14 using the scale-free topology criterion to reach a model fit (|R|) of 0.80. The modules were defined using the ‘cutreeDynamic’ function (minimum 30 genes by module and default cutting-height = 0.99) based on the topological overlap matrix, a module eigengene distance threshold of 0.25 was used to merge highly similar modules. For each module, we defined the criteria for module membership (kME, correlation between module eigengene value and gene expression). We looked for significant correlation (Pearson’s correlation; P < 0.001) between modules and physiological data, including cortisol levels (pg.mg-1 in scales), fish weight (g) and treatment (coded ‘1’ for control24h, control96h and resistant96h and ‘2’ for infected24). Gene ontology (GO) enrichment for each module was tested using same protocol and parameters as described above.
i) The genetic bases of fish resistance
We further explored the putative genetic variation between resistant and infected fish by focusing on resistant fish because of their established phenotype, (i.e. survivors with no signs of lesions after bacterial challenge). We followed GATK recommendations for SNP identification based on RNAseq data. Briefly, BAM files were pre-treated using the ‘CleanSam’ function, duplicates were picked out with the ‘MarkDuplicates’ function, and cigar string split with the ‘SplitNCigarReads’ function. All functions were implemented in GATK v4.0.3.0 software [53,54]. Final SNP calling was conducted with Freebayes v1.1.0 (https://github.com/ekg/freebayes) requiring minimum coverage of 15 and minimum mapping quality of 20, forcing ploidy at 2 and removing indels (‘—no-indels’) and complex polymorphisms (‘—no-complex’). The raw VCF file was filtered for minimum allele frequency (‘—min_maf=0.2’), minimum coverage (‘—minDP=20’) and using Vcftools v0.1.14 [55] to allow no missing data. We computed relatedness (‘—relatedness2’) within and among groups with Vcftools v0.1.14 [55]. We further used distance-based redundant discriminant analysis (db-RDA) to document genetic variation among groups and correlation, with cortisol, treatment and weight as the explanatory variables. Briefly, we computed Euclidean distances and PcoA using the ‘daisy’ and ‘pcoa’ functions, respectively, implemented in the ape R package [48]. Pco factors (n = 6) were selected based on a broken-stick approach [49,50] and used to produce a db-RDA. We tested the model significance using 999 permutations, effects were considered significant when P < 0.01.