Ethics Statement
Permission to survey V. vitis-idaea was obtained from the Ministry of the Environment and Forestry Agency of Japan.
Study species
V. vitis-idaea ranges across the Northern Hemisphere. It is an evergreen dwarf shrub with creeping stems (Ritchie 1955) and stolons (Persson and Gustavsson 2001). It is pollinated by bees (Jacquemart 1993; Persson and Gustavsson 2001) and dispersed by birds, such as Lagopus mutus (Kobayashi and Nakamura 2011).
Study sites and samples
The study sites are located at Mt. Norikura (36º74ʹ N, 137º33 ʹ E, 2,770–2,790 m a.s.l.) in central Japan (Fig. 1a). Mean annual temperature and rainfall were − 1.2 ºC and 2,738 mm, respectively. In 2019, the lowest and highest temperatures were − 14.2 ºC and 12.0 ºC, respectively. Snow generally covers the area until late June.
On the ridge, there were several different-sized isolated shrubs of P. pumila, whose heights were less than 0.4 m. V. vitis-idaea was mainly distributed under the shrubs of P. pumila. In contrast, the plot on the slope was covered with dense P. pumila, with a height of approximately 2.0 m.
We established 11 plots on the ridge and 1 plot on the eastern slope of the mountain (Fig. 1b). At the 11 plots of patchy P. pumila shrubs (patchy plots; PATs), we estimated patch sizes based on the areas of ellipses by measuring the lengths of the major and minor axes of a PAT. We collected the leaves of V. vitis-idaea at every 150-cm grid point in each PAT in August 2020. When we could not collect more than 16 leaves at every 150 cm grid point, we changed the grid size from 20 to 150 cm, depending on the size of the PATs. On the slope, we set a 45 m × 6 m plot of P. pumila vegetation (a mat plot; a MAT). We collected leaves of V. vitis-idaea at every 150 cm grid point in the MAT in August 2021. When there were no leaves of V. vitis-idaea near the grid points, we only recorded them. All leaves were stored with silica gel and at − 30°C until DNA extraction for MIG-seq analysis.
MIG-seq analysis
Total genomic DNA was extracted from 50 mg of leaves using the cetyltrimethylammonium bromide (CTAB) method as described by Murray and Thompson (1980). We performed multiplex inter-simple sequence repeat (ISSR) genotyping by sequence analysis, known as MIG-seq, which is a useful method for detecting genome-wide single-nucleotide polymorphisms (SNPs) (Suyama and Matsuki 2015). Standard experimental conditions have been described by Suyama (2022). First, ISSRs were amplified from 50 ng of DNA using eight pairs of multiplex ISSR primers. The first PCR product from each sample was purified/equalized, and short fragments (ca. <250 bp) were removed using AMPure XP (Beckman Coulter, Brea, CA, USA) and used as a template for the second PCR. The second PCR was performed to add complementary sequences for the oligonucleotides that coat the Illumina sequencing flow cell, annealing sites of the DNA sequencing primers, and indices for the first PCR products. The concentration of each PCR product (library) was measured using a Microchip Electrophoresis System (MultiNA, Shimadzu, Tokyo, Japan) with a DNA-2500 Reagent Kit (Shimadzu). Libraries from each sample, each with a different index, were pooled at equimolar concentrations. Fragments in the size range of 350–800 bp in the purified library were isolated using the magnetic bead method (AM Pure XP). The final concentration was measured using an SYBR Green quantitative PCR assay (Library Quantification Kit; Clontech Laboratories, Mountain View, CA, USA) with primers specific to the Illumina constructs. Libraries (final concentration of 10 pM) were sequenced on an Illumina MiSeq System (Illumina) using the MiSeq Reagent Kit v3 (150 cycles, Illumina).
Low-quality reads and primer sequences were eliminated from the raw data using trimmomatic 0.39 (Bolger et al. 2014). The quality-filtered sequence data were demultiplexed and filtered using Stacks v2.41 software (Catchen et al. 2011; Catchen et al. 2013; Rochette et al. 2019). We used “ustacks” with the following option settings: minimum depth of coverage required to create a stack (m) = 3; maximum distance allowed between stacks (M) = 2; and maximum distance allowed to align secondary reads to primary stacks (N) = 2. We used “cstacks” with the option settings for the number of mismatches allowed between sample loci when building the catalog (n) = 2, followed by “sstacks.” We used “populations” with the following option settings: minimum proportion of individuals required to process a locus across all data (r) = 0.8; restricting data analysis to a single SNP per locus (write_single_snp); the minimum number of populations in a locus (p) = 1; the minimum minor allele frequency required to process a nucleotide site at a locus (min_maf) = 0.01; and the maximum observed heterozygosity required to process a nucleotide site at a locus (max_obs_het) = 0.95.
Genet identification and clonal diversity
We obtained 18,803,564 and 4,964,890 raw reads for the PAT and MAT samples, respectively, including duplications, by MIG-seq. After quality control, a total of 95, 948 PAT and 20,783 MAT reads were used for further analyses. The mean number of highly common reads per sample was 459.0 and 163.6 for PAT and MAT, respectively, suggesting that the data were reliable. Therefore, we estimated the similarity between the samples. We considered ramets belonging to identical genets if they had over 90% of similarity to 779 SNPs (Fig. 2).
After identifying the genets, we estimated the genotypic richness and three measures of clonal diversity. Genotypic richness was estimated according to Dorken and Eckert (2001) as R = (G-1) / (N-1), where G and N are the numbers of genets and ramets, respectively. Clonal diversity was calculated using:
-
Simpson’s index of diversity corrected for sample size (Pielou 1966), D, which represents the probability that two ramets selected at random from N ramets are from different genets: D = 1 – {[Σ ni (ni – 1)] / [N (N – 1)]}, where ni is the number of ramets of each genet.
-
Hʹ values of Shannon–Wiener (Shannon 1949), Hʹ = – Σ (ni / N)*ln (ni / N).
To consider the effect of the survey area of plots on Simpson’s D, we estimated the values at R1.5 area size (radius = 1.5 m, area size = 7.1 m2) using each grid point excluding the grid points at all edges (n = 87) at MAT. In addition, we estimated the values at R3.0 (radius = 3.0 m, area size = 19.6 m2) using central 27 grid points (n = 27).
Statistical Analysis
To investigate the relationship between Simpson’s D and survey area, we used a binomial generalized linear model (GLM). We used a linear model to investigate the relationship between the maximum distance within the same genotype and the number of ramets within the same genotype. The best model was selected based on the AIC. Statistical analyses were performed using R4.0.3 (R Development Core Team 2019).