The genus Citrus L. contains important plant species that extensively contribute to human food and are cultivated in different parts of the world. The sweet orange (Citrus sinensis (L.) Osbeck) is the most important species among those belonging to the Citrus genus, representing about 50% of global Citrus production. Oranges are particularly appreciated for their organoleptic characteristics and the high nutraceutical value of the fruits (high content of antioxidants) (Seminara et al. 2023).
The Citrus genus originated from the Malay Archipelago and Southeast Asia, and the varieties of edible Citrus fruits on the market are generated by hybridizations of ancestral species, natural or artificial mutations, and human selection during domestication (Frydman et al. 2013; Wu et al. 2018; Li et al. 2019).
It has been suggested that Citrus diversified during the late Miocene epoch through a rapid southeast Asian radiation that correlates with a marked weakening of the monsoons. A second radiation enabled by migration across the Wallace line gave rise to the Australian limes in the early Pliocene epoch. Further identification and analyses of hybrids and admixed genomes provide insights into the genealogy of major commercial cultivars of Citrus. Among sweet oranges, Wu et al. (2018) find an extensive network of relatedness that illuminates the domestication of these groups.
Oranges originated in Asia in what is now called southeast China Cultivated for at least 7,000 years in India and in China since 2,500 BCE and documented in China since 340 BCE, sweet orange (Citrus x sinensis) is a hybrid between pomelo (Citrus maxima) and mandarin (Citrus reticulata) (Seminara et al. 2023).
The genetic and genomic studies in oranges have shown the diversification of C. sinensis and its relation with other Citrus species and have clarified that all the genetic diversity within the sweet orange species was derived from subsequent mutations starting from a single ancestor and was derived from complex cycles of hybridization and backcrossing between the mandarin (Citrus reticulata Blanco) and the pummelo (Citrus maxima (Burm.) Merr.) (Seminara et al. 2023).
Oranges are all intermediate between the two ancestors in size, flavor, and shape. The bitter orange and sweet orange both arose from mandarin-pomelo crosses, the former involving a pure mandarin, the latter with a mandarin already containing small amounts of pomelo.
Today oranges are cultivated in subtropical and tropical America, northern and eastern Mediterranean countries, Australia, and South Africa. Orange trees thrive in subtropical regions with warm temperatures and moderate humidity levels. Brazil is the leading country in orange production worldwide with 30% of world production. Within Brazil, the region with the largest production volume is Sao Paulo, with 94% of the total.
Oranges should be stored at about 38°F to 48°F. Warmer temperatures will result in a more rapid loss of quality while prolonged exposure to temperatures below 38°F may result in freeze damage. These trees grow well in slightly acidic, well-drained loam or sandy loam soils, but with proper irrigation that drains easily, could grow well in clay soils. At present, there are over 400 different varieties of oranges grown around the world.
Landscape genetics explores how the micro evolutionary processes of gene flow, genetic drift, and natural selection interact with environmental heterogeneity to shape population genetic structure and identify the movement corridors and barriers to gene flow, and the relative effects of current versus historical landscape factors on population genetic structure (Wang 2012).
This discipline also tries to identify the present-time adaptive genetic loci and investigate the adaptive potential of populations in response to future landscape and climatic changes (Kimberly 2019). Therefore, a combination of population and landscape genetics or genomics is likely to provide the best understanding of the molecular imprint of local adaptation and further guide the conservation strategies. In addition to local adaptation, migrating to new favorable locations is also a response pattern of plants to rapid climate changes, which is important for species conservation (Liu et al. 2022). Data obtained in landscape genetics analyses are intended to inform the conservation and management of the target species (Storfer et al. 2007; Epps and Keyghobadi 2015).
A variety of different molecular markers have been used to study Citrus species' phylogeny and genetic diversity. The markers used are simple sequence repeats (SSRs), nuclear LEAFY Second Intron and Plastid trnL-trnF Sequence, genome sequencing, expression sequence tags (ESts), restriction fragment length polymorphism (RFLP), randomly amplified polymorphic DNA (RAPD) and cleaved amplified polymorphic sequences (CAPS), EST-SSR and genomic SSR markers (see for example, Yingzhi et al. 2007; Omura et al. 2016; Kaur et al. 2022).
SCoT molecular markers are based on the short conserved region flanking the ATG start codon in plant genes (Collard and Mackill 2009), and targets the coding sequence in the genome, mostly the open reading frames (ORFs), which reveals the relationship between amplification results and phenotype (Li and Quiros 2001). It has the benefits of simple operation, low cost, and abundant polymorphisms. The SCoT marker has been used in many crop plant species including orange (Juibary et al. 2021).
Global warming and climate change is considered one of the greatest threats of the 21st century that may affect global biodiversity, with serious ecological consequences, like frequent droughts, wildfires, and invasive pest outbreaks, leading to the loss of plant species and lowered productivity, shortages of food crops, and a higher cost for consumers. Climate change can adversely affect genetic diversity and genetic connectivity of plant populations, ending in a greater homogenization in genetic content and a lower adaptive potential for the species in future environmental changes (Guan et al. 2021).
The species distribution models (SDMs) are used to study the present geographical distribution of species and predict their future occurrence in response to climate changes (Elith and Leathwick 2009). SDMs can be used to make inferences about the distribution of suitable habitats for species of interest, population demography, and genetic diversity (Lee-Yaw et al., 2021).
The concept of the fundamental niche versus the realized niche originally developed by Hutchinson (1957), comprises the central concept of species distribution models. Both abiotic and abiotic environmental conditions and the movement capacity of a species determine the geographic area suitable for the specie (Elith and Leathwick 2009).
Species distribution modeling (SDM), uses computer algorithms to predict the distribution of a species across geographic space and time using environmental data. SDMs use climate data (e.g. temperature, precipitation), and other variables such as soil type, water depth, and land cover. These studies are used in conservation biology, ecology, and evolution and try to illustrate how environmental conditions influence the occurrence or abundance of a particular species. Data obtained may be used for predictive purposes (ecological forecasting), a species’ future distribution under climate change, and predictions are made on the current and/or future habitat suitability of the target species (Elith and Leathwick 2009).
SDM has been also applied to landscape genetics to investigate the association of genetic variation with environmental gradients and make inferences about the role of gene flow and selection (Ortego et al. 2012; Poelchau and Hamrick 2012). These studies often use model predictions to describe habitat or climatic suitability as a single integrated measure of multiple complex environmental factors, which is then assessed in terms of its influence on genetic patterns.
To our knowledge and the literature review, no report is available on landscape genetic analysis of Citrus species including landraces of sweat orange in the country and elsewhere. Therefore, the aims of this study are to 1) investigate the population's genetic structure, and identify the geographical variables shaping it in sweat orange populations, 2) identify the climatic factors affecting the present and future distribution of these plants, and, 3) identify SCoT genetic regions potentially adapted to geographic variables in the studied populations. These findings may be used in future breeding and selection programs for orange trees in the country.
We used multiple computational methods in our study like, K-Means Clustering and linear discriminant analysis of principal components (DAPC), for grouping the accessions studied; spatial PCA (sPCA), for landscape genetic analysis, RDA (redundancy analysis), and LFMM (latent factor mixed model), for identifying SCoT regions with adaptive potentials to geographical and climatic variables studied, and species modeling by Maxent and Dismo package. Moreover, we performed random forest analysis to illustrate the importance of geographical variables on genetic polymorphism and genetic differentiation (Fst values) of the studied populations.