Soybean is a short-day crop and the timing of transition from vegetative to reproductive period is regulated by environmental cues. We explored the genetic basis for these developmental phases using an association approach of cultivated germplasm grown at different environments. It had been reported causal genes playing different roles on DPF (before flowering) and DFM (after flowering) in different planting seasons (Yang et al. 1994). Thus, we picked two locations (Beijing and Sanya) with a latitudinal difference of 21.97 degrees and two planting seasons (Spring and Summer) to form three diverse field environments (BJSp, BJSu, and SY). The major soybean loci (E1-8, and J) were reported to be related to both time of flowering and maturity (Watanabe et al. 2012) and were photoperiod-sensitive loci (Cober and Morrison 2010). Photoperiod effects on genes have been shown to be dependent on growth period stages, affecting E1 and E4 during V and R, respectively, and E3 during V + R (Wang et al. 2008).
The four traits, V, R, V + R, and R/V had relationship with each other. In BJSp and BJSu, a higher correlation was observed for V and V + R (r = 0.56, 55) as compared to R and V + R (r = 0.37, 0.39), indicating that V had greater impact on V + R than R. However, in Sanya where the average day length is seven hours, the correlation coefficient of R and V + R was large, r = 0.92, while V and V + R was negligible, r = 0.08, indicating that R had greater impact on V + R than V. The negative correlation between V and R (r = -0.47 ~ -0.07) indicating that a shorter V might imply a longer R, which may be beneficial to the accumulation of more dry matter in seeds while V, R, and V + R were all positively related with yield (Orf et al. 1999). SNP Map-6077 for V and R showed opposite effects, indicating an underlying genetic explanation for the negative correlation between V and R.
Several GWAS have been conducted to identify causal loci or genes. Mao et al using 91 cultivars through four photoperiod treatments (SD, LD, SP, and SU) taken at 39°54'N, 116°46' (Beijing China) identified six makers associated with flowering time close to (within 23 kb) or overlapping with other GWAS results (Fang et al. 2017; Hu et al. 2014; Mao et al. 2017; Sonah et al. 2015; Zhang et al. 2015). The first SSR marker was Satt564 identified in Mao’ result and Hu’s GWAS, which used 113 wild accessions planted in summer at Nanjing (32°12ʹN 118°37ʹE), and Nanyang (38°7ʹN 110°34ʹE), China (Hu et al. 2014). And the second SNP marker was ss715619034, 17.50 kb upstream of ss715619036 identified in Zhang’s GWAS, which collected phenotypic data of 309 early mature accessions in Aurora (44°17'N 96°41'W), Brookings (44°22'N, 96°47'W) and Watertown (44°54'N, 97°7'W), United States (Zhang et al. 2015). The other four SNP markers, Chr02_12316110, Chr05_40012587, Chr11_31718686, and Chr15_13133351, were 18.33, 22.11, 18.86, and 16.37 kb far from ss715581063, ss715591933, ss715610231, and ss715620540 that discovered in Fang’s research (Fang et al. 2017; Mao et al. 2017). Fang planted 421 landraces and 388 cultivars in six environments, including Beijing, (40°22ʹN, 116°23ʹE), Mudanjiang (44°58ʹN, 129°60ʹE), Zhoukou (33°62ʹN, 114°65ʹE), China, over three years. In total, there were six markers identified between Mao with Hu (one), with Zhang (one), and with Fang (four). The highly consistency in Mao and Fang’s GWAS could be caused by the similar photoperiodic environments.
To better understand the adaptation of soybean to more extreme latitudinal differences, we used GWAS to validate and find genes or loci during the growth period. We selected 277 accessions including 135 from Chinese mini core collections (Qiu et al. 2009; Qiu et al. 2013) and planted them in three locations with latitudes from 18.25 to 40.22°N. Three SNPs, Map-6077, Map-1899, and Map-6383, from our study were close to previously reported loci. Map-6077 related with V, R, and V + R in our study was 1.23 kb upstream of Chr05_41854422 associated with V + R (Fang et al. 2017) and 140.77 kb downstream of ss715599242 (Gm05_38636402, Glyma1.1) (Fang et al. 2017; Mao et al. 2017). Map-1899 and Map-6383, associated with V and R individually, were 27.87, and 1.58 kb downstream of Chr10_44710944 and Chr10_45025153 from Fang et al. (Fang et al. 2017). The reminding 26 SNP loci were not found in other reported GWAS studies.
The pleiotropy and trait correlations in this study are different growth stages, different from the general pleiotropy, such as the correlation between seed oil content and protein content. The last two traits (V + R and R/V) are derived from the first two and their relationships with other traits are determined by the variance and covariance of V and R and the calculation formula. Deciphering the correlations among V, R, V + R, and R/V could facilitate soybean improvement. A positive correlation could simultaneously increase the trait values, and negative correlation raised the trait value for one while decreasing other traits.
Map-6077 exhibited extreme significant difference between phenotypic variations carrying A and G alleles in all three environments (BJSp, BJSu, and SY) for V, R, and V + R (Fig. S2). Improved cultivars had a higher proportion of favorable allele A which had higher R, shorter V and V + R, as compared to landraces with 0.84 to 0.51, implying that the soybean accessions with early flowering (V) and maturity (R) and a comparatively longer reproductive period (R) were selected during improvement (Fig. S2). Map-6077 was a synonymous SNV in GmMFT, expressed at the stage of seed 10DAF and increased at seed 42DAF stage. It has been reported to be a negative regulator of seed germination (Li et al. 2014) and a potential causal candidate gene controlling oil content (Li et al. 2018) in soybean. There was another SNP in GmMFT associated with R8 full maturity as shown by GWAS (Fang et al. 2017). We proposed that GmMFT is a pleiotropic gene and Map-6077 is a pleiotropic SNP associated with V, R, and V + R. The analysis above implied that Map-6077 and − 6502 as the pleiotropic SNP could be used via molecular breeding for new modern cultivars.