PCR specificity and amplification efficiency of the candidate reference genes
Thirteen housekeeping genes (18S, 40S, CHAL, UBE2, EF-1α, F-box, GAD, PCS1, PP2A, SAND, TATA, TIP41, and TUB) were identified from NCBI and the Cannabis sativa genome browser gateway based on a homology search with Arabidopsis genes (Table 1). Specific primers were designed and used to confirm their specificities based on the amplification efficiency and specificity (Table 1). All gene primers amplified single bands with predicted sizes resolved in agarose gel electrophoresis (Figure 1, Figure S1). For the qRT-PCR amplification, the PCR efficiency (%) ranged from 91.22 to 113.87, and the regression coefficient (R2) varied from 0.9893 to 0.9994 (Table 1).
Ct values of candidate reference genes
Transcript abundances of 13 candidate reference genes were assessed by qRT-PCR for each gene, tested in triplicates across all 11 treatments and control, which was in a total of 36 biological samples (Figure 2). A majority of the candidate reference genes Ct values ranged from 20 to 30. The PCS1 gene showed the lowest expression level with the highest Ct values between 27.4 and 32.1. The EF-1α gene showed the highest expression level with the lowest Ct values ranging from 18.9 to 25.3. The CHAL gene displayed the highest difference among all 36 samples tested, with a minimum Ct value of 22.3 and a maximum Ct value of 30.9. These Ct value analyses indicated that the transcription levels of candidate reference genes are unstable under different stress conditions.
Analysis of gene expression stability by geNorm
The geNorm was used for evaluating the expression stability of the 13 candidate reference genes (Table 2). Data analysis was calculated based on individual 11 different treatments and three different groups of treatments such as osmotic stress (OS: mannitol, NaCl), heavy metal stress (HM: CdCl2, CuSO4, PbNO3, ZnSO4), and hormonal stimuli (PH: ABA, GA3, MeJA, SA). The total ranking was also shown by combining all 11 treatments together. This algorithm evaluated the gene expression stability (M) based on the average pairwise variation of all tested genes27. In this analysis, the lower the M value, the more stable the gene expression. A reference gene that has an M value less than 1.5 is used for qRT-PCR. PP2A and TIP41 were the most stable reference genes with the lowest M value (0.46) whereas CHAL had an M value of 1.07 and was ranked as the least stable gene. Individually, EF-1a and SAND were the most stably expressed genes under osmotic stresses while F-box and TATA were the least stably expressed genes with an M value of 0.22. The TUB and TATA genes showed the lowest M values of 0.16 among all of the heavy metal stressed clones. Exposure to hormonal stimuli resulted in PP2A and F-box to be the most stable with an M value of 0.27 and CHAL to be the least stable with an M value of 0.75.
F-box was ranked as the second least stably expressed gene under both osmotic and heavy metal stresses, but it was ranked as the first and second most stable gene under UV and plant hormone treatments, respectively. The TATA gene was least stably expressed under osmotic stresses but was among the top two and three under heavy metal stress and plant hormone stimulus, respectively. CHAL was the least stably expressed in response to UV light application.
geNorm can determine the minimal number of reference genes that should be used to obtain an accurate normalization. The optimal number of reference genes was determined based on the pairwise variation (Vn) between two normalization factors (NFn) composed of an increasing number of reference genes27. The threshold value (Vn/Vn+1 = 0.15) indicates if the number of reference genes less than or equal to the value of n is sufficient to use as a reference gene. As shown in Figure 3, the pairwise variation value V2/V3 of all experimental samples was less than 0.15, demonstrating that two reference genes should be sufficient for normalization under all conditions tested.
Analysis of gene expression stability by NormFinder
NormFinder is a quantity-model-based software and uses complex statistical models to compute the variation between the expression of genes across different biological groups28. The lowest expression stability value represents the most stable reference genes. Our results obtained from NormFinder analysis are summarized in Table 3. The EF-1a and TUB genes were the most stably expressed in all samples and were ranked as fourth and third by geNorm, respectively. The F-box, TATA, and CHAL were ranked as the three least stable genes both by NormFinder as well as geNorm. The TUB, PCS1, and TATA genes were the most stably expressed under osmotic stress, heavy metal stress, and plant hormone stimuli, respectively. In comparison to geNorm, TUB, PCS1, and TATA were ranked as third, sixth, and third positions in each category, respectively. The least stably expressed reference genes under osmotic stress (CHAL, F-box, TATA), heavy metal stress (CHAL, F-box, TATA), and plant hormone stimuli (40S, GAD, CHAL) had similar rankings when compared to geNorm rankings. The GAD and F-box genes were found to be the most stable reference genes under UV stress while PCS1 and CHAL were the least stable which was a similar trend observed in the geNorm analysis.
Analysis of gene expression stability by BestKeeper
The BestKeeper is an excel-based algorithm and uses standard deviation (SD) and coefficient of variation (CV) data of the average Ct values for specific treatments29(Table 4). Lower CV ± SD values represent higher stability. When using the BestKeeper algorithm, genes with an SD value > 1 are considered as an undesirable reference gene22. When all samples were taken into consideration, TATA (SD=0.74), 40S (SD=0.79), PCS1 (SD=0.84), PP2A (SD=0.90), and TUB (SD=0.99) were determined to be reliable reference genes. TATA showed the lowest SD among all 13 reference genes in all samples and the SD values were greater than 1 in osmotic stress (1.29) and mannitol (1.82). The 40S gene was ranked as the second most stable candidate in all samples tested, but the SD value of 40S under NaCl and PbNO3 stresses were 1.09 and 1.36, respectively. PCS1 was ranked at the third position in all samples tested and SD values were below 1 in any individual treatment and also the three treatment groups. The CHAL gene displayed the highest SD value with 1.95 in all samples indicating that this gene is unsuitable for gene expression normalization. The CHAL gene exhibited an SD value less than 1 only under GA3 (0.71), ABA (0.27), CdCl2 (0.58), and CuSO4 (0.98) treatments.
Analysis of gene expression stability by RefFinder.
RefFinder is a web-based tool for comprehensive analysis that integrates geNorm, NormFinder, Delta Ct, and BestKeeper approaches30. The reference genes were ranked from the most stable (M value is the lowest) to least stable expression (M value is the highest) using Best Keeper (Table 5). Among them, the most stable candidate was the EF-1a gene followed by the TUB gene in all samples. The EF-1a and TUB genes were also ranked at third and first places under osmotic stress conditions, respectively. The TATA gene was most stably expressed under heavy metal and plant hormone treatments while this gene was the least stable under osmotic stress. The CHAL gene was ranked as the least stable gene in all samples tested. The GAD and CHAL genes were the most and least stably expressed genes respectively under UV application, which was the same findings as the NormFinder software.
Validation of selected reference genes
To validate the selected reference genes, gene expression levels of AAE1 and CBDAS were measured (Figure 4). Each of the two most stable reference genes, EF-1a and TUB, a combination of these two stable reference genes (EF-1a+TUB), and the least stable reference gene (CHAL) were used as internal controls. AAE1 expression was significantly reduced under drought (Mannitol) and salinity (NaCl) stresses. EF-1a, TUB, a combination of EF-1a and TUB were used for normalization of qRT-PCR analysis. There was no significant difference in the AAE1 expression between the mock treatment and osmotically stressed samples (Mannitol and NaCl ) when CHAL was used as an internal control. The expression of CBDAS was also reduced under osmotic stresses when expression data was normalized with EF-1a, TUB, and a combination of EF-1a and TUB except when the CBDAS expression under NaCl stress was normalized with the TUB gene. When CHAL was used as a reference gene, CBDAS gene expression was reduced under mannitol treatment but there was no difference between mock and NaCl treatments.