Introduction of GG residues to the 5’ end of an sgRNA does not eliminate cleavage
To determine whether it is feasible to expand the range of sgRNAs synthesized in vitro using the T7 promoter, we designed sgRNAs containing two GG residues at their 5’ ends and used them to target genomic sequences that did or did not contain 5’ GG. Two sgRNA variations were tested. In the first case, the sgRNA target region had a GGN18 structure consisting of two GG residues and 18 additional nucleotides. GGN18 sgRNAs were used to target genomic sites that contained a perfect match to the sgRNA targeting sequence, including the two 5’ G residues. In the second case, the GG residues were added to a targeting sequence 20 nucleotides long, creating a GGN20 structure. We refer to the GG residues on GGN20 sgRNAs as “supernumerary”. GGN20 sgRNAs were tested at genomic sites that did not contain a 5’ GG sequence, but otherwise perfectly matched the 20 nucleotides in the target region.
Two genes were targeted to evaluate the performance of the GGN18 and GGN20 sgRNAs. BmBLOS2 is located on the Z chromosome, and is responsible for the biosynthesis of urate granules, which accumulate in epidermal cells and make the larval integument opaque. Because mutations in this gene result in an easily detected oily skin phenotype, BmBLOS2 is often used to confirm the efficacy of gene editing methods [1, 2, 4]. GR66 is located on the third chromosome and has recently been identified as a key gustatory receptor responsible for the mulberry-specific feeding preference of the silkworm [7]. These two genes, for which null mutations are recessive, were used to test the efficacy of our strategy.
Two targets were selected for initial tests in BmBLOS2 and BmGR66 (designated BLOS2-T1 and GR66-T5, respectively; Fig. 1). For both targets, sgRNAs in the GGN18 and GGN20 formats were synthesized using T7 RNA polymerase-mediated in vitro transcription. Note that the GGN18 sgRNAs match their respective targets in these genes perfectly (Fig. 1A and 1G), while the supernumerary GG residues in the GGN20 sgRNAs have two mismatches at BLOS2-T1 (5’ CT) and one mismatch at GR66-T5 (5’ CG) (Fig. 1B and 1H). Each sgRNA was separately mixed with Cas9 protein and incubated at room temperature to form a Cas9 protein/sgRNA complex, which was then injected into pre-blastoderm embryos. To examine indel frequencies, pools of 60 randomly selected injected embryos were harvested for genomic DNA extraction. Regions surrounding the sgRNA targets in BmBLOS2 and BmGR66 were PCR amplified and T7EN1 assays were performed to detect anomalous DNA structures. The results demonstrated that the GGN18 sgRNAs generated indels at higher efficiency (37.4%±1.6%) than the GGN20 sgRNAs (26.9%±3.3%) at the sgRNA BLOS2-T1 target (P < 0.05) (Fig. 2A and 2B). In contrast, no significant differences in indel frequencies were detected between GGN18 (31.6%±6.8%) and GGN20 (30.6%±6.5%) at GR66-T5 (P = 0.95) (Fig. 3A and 3B). The result at GR66-T5 suggests that a single 5’ G mismatch has a relatively minor effect on sgRNA cleavage efficiency compared with a 5’ GG mismatch.
To further test the method, we selected four additional genomic targets in BmGR66, none of which have native (G/A) (G/A) residues at their 5’ ends (GR66-T1, T2, T3, T4). Thus, at all four targets, sgRNAs in the GGN20 format will mismatch the genomic sequences at both supernumerary G residues (Fig. 1C, 1D, 1E, 1F). After in vitro synthesis from a T7 promoter, the sgRNAs were mixed with Cas9 protein, and the complexes were injected into pre-blastoderm silkworm embryos. DNA was extracted from pools of 60 randomly selected injected embryos for each sgRNA and subjected to a T7EN1 assay. GR66-T1, T2, T3, and T4 generated indels at efficiencies of 30.0%, 7.3%, 68.7%, and 32.2%, respectively (Fig. 3C). Importantly, the GGN20 sgRNA at GR66-T3 produced indels much more efficiently than did the GGN18 sgRNA at GR66-T5 (68.7% vs 31.6%). This result demonstrates that an sgRNA containing 5’ GG mismatches may outperform an sgRNA with 5’ GG matches within the same gene, suggesting that high knock-out efficiencies can sometimes be obtained without the requirement for 5’ GG matching.
In summary, the introduction of 5’ GG mismatches reduces the cutting efficiency of a GGN20 sgRNA, although a single 5’G mismatch has a relatively minor effect. Nevertheless, the presence of one or two 5’ mismatching G residues does not eliminate cleavage. Therefore, the addition of supernumerary 5’ GG residues via the T7 promoter permits the synthesis of sgRNAs at high efficiency and also makes it possible to expand the target repertoire of the Cas9 RNP-based method. Indeed, in some cases it is possible to achieve superior gene knockout efficiency at genomic targets that do not contain matching 5’ GG residues.
Generation of somatic and heritable mutations in BmBLOS2 using a GGN20 format sgRNA and Cas9
To test the germline transmission efficiency of mutations introduced by GGN20 sgRNAs, we injected 560 pre-blastoderm embryos with a complex containing the Cas9 protein and an sgRNA targeting BmBLOS2 (BLOS2-T1). We detected a mosaic translucent epidermal phenotype in 42 fifth instar larvae out of a total of 81 in the injected generation (G0), yielding an efficiency of 51.9% (42/81) (Fig. 4A). G0 mosaic mutants with pronounced phenotypes were sib-mated. To assess heritability of the mutation, we randomly chose hatched larvae from three G1 egg batches (around 100 individuals per batch), and detected at least one larva in each batch with a completely translucent epidermal phenotype, indicative of homozygosity (Fig. 4A). A total of five larvae with completely translucent epidermal phenotypes were found in the three batches. One phenotypic larva from each batch (one male and two females) was chosen randomly to determine the genotype. The analysis showed that all three harbored mutations in the sgRNA target regions, and no wild type alleles were detected (Fig. 4B).
These results demonstrate that the transmission efficiency of mutations generated by GGN20 format sgRNAs is high enough to obtain phenotypic male (ZZ) homozygous mutants and female (ZW) hemizygous mutants in the BmBLOS2 gene after only one breeding.
Generation of heritable BmGR66 mutants by co-injection of four sgRNAs with 5’ (G/A) (G/A) target mismatches
To examine the germline transmission efficiency of mutations introduced by multiple sgRNAs with 5’GG mismatches, we prepared GGN20 sgRNAs targeting T1, T2, T3, T4 in BmGR66, complexed them with Cas9 protein, and co-injected them into 80 preblastoderm embryos. Six embryos hatched, and genomic DNA was extracted from their wings after eclosion. Regions surrounding the four sgRNA targeting sites were amplified by PCR and then subcloned. Three to five randomly selected subclones generated from each silkworm were sequenced. Nucleotide substitutions, indels (small insertions or deletions), large fragment deletions, and inversions were detected surrounding all four sgRNA targeting sites (Additional file 1: Figure S1 and S2). No wildtype sequences were detected from any silkworm, and at least three mutant alleles were identified (Additional file 1: Figure S1 and S2), demonstrating that the silkworms were mosaic in the injected generation (G0). Similar outcomes have been reported previously in silkworms [4], zebrafish [12], and mouse [13, 14].
To test whether the mutations can be transmitted through the germline, we randomly selected one G0 male silkworm (#2) and one female G0 silkworm (#4), both with somatic mutations (Additional file 1: Figure S1 and S2). They were mated to obtain offspring, and newly moulted fifth-instar larvae in generation 1 (G1) were tested to determine their feeding preferences. 21 out of 79 ate cabbage leaves, which are not normally consumed by wildtype silkworms, demonstrating an altered feeding preference (Additional file 2: Video S1). To correlate phenotype with genotype, 17 of the 21 phenotypic silkworms were randomly selected for analysis, and regions surrounding the sgRNA targets were amplified by PCR, subcloned, and sequenced. A total of 11 genotypes were recovered, and all harbored homozygous or compound heterozygous mutations. It is noteworthy that 7 of the 11 genotypes contained large (> 150 bp) insertions or deletions, which are readily obtained by simultaneous injection of multiple sgRNAs, and 10 of the 11 genotypes encoded truncated and presumably functionless proteins (Fig. 5 and Table 1). The changed feeding preferences resulting from the loss of BmGR66 function are consistent with a previous report [7].
Table 1
Genotypes of 17 examined G1 BmGR66 knockout silkworms.
Silkworm ID | Genotype |
GR66 G1 ♀-1 | genotype 4 | homozygous |
GR66 G1 ♀-2 | genotype 1 and 2 | compound heterozygous |
GR66 G1 ♀-3 | genotype 4 and 2 | compound heterozygous |
GR66 G1 ♀-4 | genotype 10 and 11 | compound heterozygous |
GR66 G1 ♀-5 | genotype 4 and 2 | compound heterozygous |
GR66 G1 ♀-7 | genotype 1 and 2 | compound heterozygous |
GR66 G1 ♀-9 | genotype 4 | homozygous |
GR66 G1 ♀-10 | genotype 3 | homozygous |
GR66 G1 ♀-12 | genotype 2 | homozygous |
GR66 G1 ♀-13 | genotype 9 | homozygous |
GR66 G1 ♂-1 | genotype 6 | homozygous |
GR66 G1 ♂-2 | genotype 5 | homozygous |
GR66 G1 ♂-4 | genotype 1 and 2 | compound heterozygous |
GR66 G1 ♂-5 | genotype 2 | homozygous |
GR66 G1 ♂-6 | genotype 7 | homozygous |
GR66 G1 ♂-8 | genotype 8 | homozygous |
GR66 G1 ♂-9 | genotype 1 | homozygous |
The corresponding genotypes are shown in detail in Fig. 5. |
Taken together, these results demonstrate that co-injection of multiple sgRNAs with mismatched 5’GG residues can efficiently generate somatic mutations at targeted regions in injected individuals. Furthermore, mutant alleles can be transmitted through the germline, and null mutants can be readily obtained by sib-mating G0 mosaics.