Study sites
Soils were collected from two different sites (ARDEC: Colorado State University’s Agricultural Research, Development and Education Center in Fort Collins, CO; and CPCRC: USDA Columbia Plateau Conservation Research Center in Pendleton, OR). At each site, four replicate plots of no-till corn (ARDEC) or no-till annual wheat (CPCRC) were sampled. At ARDEC, the soils are clay loam and CPCRC the soils are Walla Walla silt loams (fine-loamy, mesic Aridic Haplustalls). For each plot, six 1” diameter cores (15 cm deep) were sampled near plant crowns, composited in plastic bags and stored on ice in coolers until transfer to the laboratory. Once in the laboratory, the soils were homogenized by hand, sieved to 4 mm, placed in a plastic bag, and then stored in the freezer (-20°C) until DNA extraction. Prior to freezing, subsamples (~ 5 g) were removed from each sample to measure gravimetric soil water content.
DNA Extraction
DNA was extracted from three replicate 0.25 g soil samples from each plot using the Qiagen DNeasy Powersoil Pro Kit (Qiagen). The extraction process was carried out using a fully automated Qiagen QIAcube robot with a 10-min vortex lysis step. DNA quality was assess using a Nanodrop 1000 (Thermo Scientific) and quantified fluorometrically with the Invitrogen dsDNA HS Assay Kit on a Qubit 2.0 (Life Technologies)
Library Preparation
PCR amplifications were performed on each DNA sample using two different 16S rRNA gene primer pairs. The first primer pair, 341F/806R (Klindworth et al., 2013), targets the V3-V4 region of the 16S gene and was used for both platforms. The second primer pair, 27F/1492R (Lane 1991), targets the full-length 16S rRNA gene and was only used on the ONT MinION platform (Table 1).
Table 1
Summary of platforms and bioinformatics methods compared in this study.
Method | Platform | Adapter / Primer1 | Target | Classifier | Error-correction |
MinION V34 | ONT MinION | 341F: TTTCTGTTGGTGCTGATATTGC CCTACGGGNGGCWGCAG 806R: ACTTGCCTGTCGCTCTATCTTC GGACTACHVGGGTATCTAATCC | V3-V4 | minimap2 | EMU |
MinION Full | ONT MinION | 27F: TTTCTGTTGGTGCTGATATTGC AGRGTTYGATYMTGGCTCAG 1492R: ACTTGCCTGTCGCTCTATCTTC TACCTTGTTACGACTT | Full-length | minimap2 | EMU |
MiSeq V34 | Illumina MiSeq | 341F: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG CCTACGGGNGGCWGCAG 806R: GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG GGACTACHVGGGTATCTAATCC | V3-V4 | minimap2 | EMU |
MiSeq V34 DADA2 | Illumina MiSeq | 341F: TCGTCGGCAGCGTCAGATGTGTATAAGAGACAG CCTACGGGNGGCWGCAG 806R: GTCTCGTGGGCTCGGAGATGTGTATAAGAGACAG GGACTACHVGGGTATCTAATCC | V3-V4 | minimap2 | DADA2 |
1Adapters are in italics; gene-specific primers are underlined |
ONT MinION PCR Conditions and Library Preparation
Extracted DNA samples were amplified in 60 µL PCR reactions containing 30 µL Phusion HSII (Thermo Scientific, Waltham, MA) master mix, 0.6 µL of each forward and reverse primer (10µM concentration), 21.6 µL molecular grade H2O, and 6 µL soil DNA diluted 1:20 with nuclease-free water. Reactions were held at 98 °C for 30 s, with amplification proceeding for 25 cycles at 98 °C for 15 s, 50 °C for 15 s, and 72 °C for 60 s with a final extension at 72 °C for 5 min. The PCR products (PCR1) were purified using AMPure XP beads (Beckman Coulter, Indianapolis, IN).
Unique barcodes (EXP-PBC096, ONT, Oxford, UK) were added to both ends of the DNA fragments by PCR. These were 50 µL PCR reactions containing 25 µL Phusion HSII master mix, 19 µL H2O, 1 µL of each barcode, and 5 µL PCR1 product diluted 1:10 with nuclease-free water. Reactions were held at 98 °C for 30 s, with amplification proceeding for 15 cycles at 98 °C for 15 s, 62 °C for 15 s, and 72 °C for 60 s; a final extension at 72 °C for 5 min. The barcoded products of this PCR reaction were purified a second time using AMPure XP beads (Beckman Coulter, Brea, CA).
Barcoded amplicons from all samples were pooled and prepared for sequencing using the SQK-LSK109 Ligation Sequencing Kit (ONT, Oxford, UK). The library was loaded on a MinION flow cell FLO-MIN106D-R9 (ONT, Oxford, UK) per manufacturers’ protocol and sequencing was started with a runtime of 48 hours and voltage of -180 V. All libraries included no template (H2O-only) negative controls and a mock community (ZymoBIOMICS Gut Microbiome Standard; Zymo Research, Irvine CA).
MiSeq PCR Conditions and Library Preparation
Extracted DNA was amplified in triplicate, in 20 µL quantitative PCR reactions containing 10 µL Maxima SYBR-green (Thermo Scientific, Waltham, MA), 2 µL of each forward and reverse primer (10uM concentration), 4 µL molecular grade H2O, and 2 µL soil DNA diluted 1:20 with nuclease-free water. Reactions were held at 95°C for 5 min, with amplification proceeding for 28 cycles at 95°C for 40 s, 55°C for 120 s, and 72°C for 60 s; a final extension at 72°C for 7 min. Thermocycling was performed with a Roche 96 Lightcycler (Roche, Indianapolis, IN). Quantities were determined by comparison of the quantification cycle to a standard curve generated by serial dilution of purified Pseudomonas putida KT2440 gDNA. Copy numbers were normalized to the grams of dry soil extracted. The products of the triplicate PCR reactions were pooled and purified using AMPure XP beads.
Nextera XT barcode sequences (Illumina, San Diego, CA) were added to both ends of the DNA fragments by PCR using 50 µL PCR reactions containing 25 µL SYBR-green, 10 µL H2O, 5 µL of each forward and reverse barcode (5µM concentration), and 5 µL of sample PCR1 product. Reactions were held at 95°C for 3 min, with amplification proceeding for 8 cycles at 95°C for 30 s, 55°C for 30 s, and 72°C for 30 s; a final extension at 72°C for 5 min. The barcoded products of this PCR reaction were purified a second time using AMPure XP beads. Barcoded amplicons from all samples were pooled and sequenced on an Illumina MiSeq instrument at Colorado State University using an Illumina MiSeq v3 600-cycle Kit with 25% PhiX spike-in (Illumina).
Bioinformatics and sequence processing
EMU MinION and EMU MiSeq
Sequences generated on the MinION platform were base-called and demultiplexed using Guppy 6.0.1 (Oxford Nanopore Technologies, Oxford, UK). Sequences were filtered based on length (V34: 300–600 bp; Full: 1000–2000 bp) and a minimum q-score of 70 using Filtlong 0.2.1 (Wick, 2017) and Cutadapt 3.2 (Martin, 2011). Chimeras were filtered using vsearch (Rognes et al., 2016), and taxonomy was assigned with minimap2 2.22 (Li, 2018). Error-correcting was done with EMU (Curry et al., 2021) which applies an expectation minimization algorithm to adjust taxonomic assignments using up to 50 sequence alignments per sequence read.
Paired forward and reverse MiSeq reads were join using PEAR (Zhang et al., 2014). Sequences were then filtered based on length (V34: 300–600 bp) and a minimum quality score of 70 using Filtlong 0.2.1 (Wick, 2017) and Cutadapt 3.2 (Martin, 2011). Chimeras were filtered using vsearch UCHIME (Rognes et al., 2016), taxonomy was assigned with minimap2 and error-corrected with EMU (Curry et al., 2021).
DADA2 MiSeq
The MiSeq V34 library was analyzed using DADA2 bioinformatics pipeline (Callahan et al., 2016). Briefly, all primers were removed from demultiplexed raw fastq files using Cutadapt 3.2 (Martin, 2011) and amplicon sequence variants were inferred using the default pipeline in DADA2. Each sequence variant was classified to the EMU reference database using minimap2 2.22 (Li, 2018) and the primary alignment for each sequence was chosen with SAMtools 1.9 (Li et al., 2009) and used for taxonomic assignments.
Data analysis
To calculate the percent similarity of mock community sequences, we used three reference genomes available from Zymobionics for Enterococcus faecalis, Lactobacillus fermentum, and Salmonella enterica (https://www.zymoresearch.com/collections/zymobiomics-microbial-community-standards/products/zymobiomics-gut-microbiome-standard). Each sequence assigned to one of the three genera from all mock communities were BLASTed using the NcbiblastnCommandline command from the Biopython package in python (Cock et al., 2009). Results with no hit found were ignored.
Total library sizes are processing were as follows: MinION Full 1,695,436 total sequence reads with an average of 66,843 reads per sample; MinION V34 2,318,235 total sequence reads with an average of 96,730 reads per sample; and MiSeq V34 2,111,798 total sequence reads with an average of 83,345 reads per sample. Therefore, prior to diversity estimates all samples were rarefied to 50,000 reads. Principal Coordinates Analysis (PCoA) was performed using Bray-Curtis distances on Hellinger transformed data and significant differences between platforms and/or sites were tested using adonis in the vegan package for R (Oksanen et al., 2018). Differential abundances were tested using either the DESeq2 package or Wilcoxon test in the metacodeR package (Foster et al., 2017).