1. Selection of the cellular receptor for rational design
Several myotrophic AAVs have recently been developed, notably, the insertion into the AAV9 VR-VIII loop of P1 peptide (RGDLGLS) 15, 16, and a series of RGD-containing sequences identified by directed evolution 17. Importantly, these modified capsids shared a common RGD motif, which suggested their affinity to integrin (ITG), cell-surface heterocomplexes that interact with the extracellular matrix 28. Using publicly available datasets, we aimed to select relevant integrin subunits for a subsequent rational AAV design targeting skeletal muscle.
Chemello and colleagues previously performed single-nucleus RNA sequencing, comparing gene expression of all cell types in the skeletal muscle of wild-type (WT) and Duchenne muscular dystrophy mouse models (D51) 29. We extracted RNA levels of all integrin alpha and beta genes from these data (Figure S1A). Among all subunits, only the α- subunits Itgav, Itga7 and the β-subunits Itgb6, Itgb1, and Itgb5 show relatively high expression in the myogenic nuclei. Of interest is the fact that the expression level of Itgb6 is highly enriched in myonuclei, and significantly upregulated in the dystrophic condition, whereas Itgb1 and Itgb5 expression are ubiquitous in all cell types, and significantly lower than the Itgb6 level in all myonuclei. Among the two expressed α-subunits, only Itgav was known to associate with Itgb6 to form αvβ6 heterocomplexes – a member of the RGD-binding integrin family 30. Furthermore, bulk RNA sequencing data from multiple human tissues confirmed high expression of Itgav and Itgb6 in skeletal muscle, and low expression of Itgb6 in the liver and spleen, two preferred targets of natural AAV (Figure S1B, GTEx V8, dbGaP Accession phs000424.v8.p2). We therefore hypothesize that AAV transduction in skeletal muscle can be improved by rationally designing an AAV capsid that specifically binds to αvβ6.
2. Rational design of a hybrid capsid, Cap9rh74, with a high affinity to the ɑVβ6 complex
As we aim to specifically target the skeletal muscle, we selected a hybrid capsid that we previously developed and that has a liver-detargeting property as the parental capsid in our design (Patent Number: EP18305399.0). This hybrid capsid of AAV9 and AAV.rh74 (AAV9rh74) was constructed by replacing the AAV9 sequence of VR4 to VR8 with that of AAV-rh74. The hybrid capsid showed similar infectivity in skeletal and cardiac muscles but was strongly de-targeted from the liver. The latter property is of particular interest in skeletal muscle gene transfer since the majority of administrated viral vector will not accumulate in the liver, as is the case for natural AAVs 31, 32.
After selection of the cellular receptor of interest and capsid backbone, AAV capsids were computationally engineered (Fig. 1A). First, the 3D structure of the parental capsid, of with structure was unknown, was modeled using AlphaFold2 33, 34. The structural prediction of the Cap9rh74 aa 219–737 monomer performed using AlphaFold2 was at a high level of confidence, with predicted local distance difference test (lDDT-Cα), a per-residue measure of local confidence, of 97.04 and low predicted aligned error (PEA) of 4.32 (Fig S1C-D). This structure is thus suitable for the next steps in the design.
Second, we extracted the 3D structure or sequences of binding motifs of the human integrin complex from PDB. Importantly, αVβ6 was previously shown to bind with high affinity to the RGDLXXL/I motif found in the human TGF-β1 and TGF-β3 prodomains 35, 36. Binding peptides with eight amino acid residues, aa214-221 in TGF-β1 (PDB: 5ffo) and aa240-247 in TGF-β3 (PDB: 4um9), were isolated from the corresponding crystal structures before grafting into the Cap9rh74 VR4 loop. Both motifs bind to αVβ6 dimer at a very similar position (Fig S1E).
Third, the defined binding motifs were then grafted into the VR4 loop (residues 453–459) of the capsid protein based on the RosettaRemodel protocol 37. In the grafting-remodel process, many rounds of backbone optimization and sequence design iteratively search for low-energy sequence–structure pairs (Fig. 1B). The lowest-energy designs in grafting experiments of each TGF-β motif showed convergence in both structure and sequence (Fig. 1C-D, S1F-G). The new VR4 loops include the binding peptide and two flanking 2-amino acid linkers and retain the LXXL/I motif as an α-helix, which is important to bind in the β6 subunit’s pocket 36.
Retrospective docking simulations of the two AAV_ITGs with the best scores, namely Cap9rh74_5ffo and Cap9rh74_4um9, on the αVβ6 complex showed highly similar binding positions of the new VR4 loop to its corresponding inserted motifs (Fig. 1E-F). This suggests that the new capsids can bind to αVβ6 thanks to VR4-included RGDLXXL/I motif. Sequences with the best scores, which reflect the thermodynamic stability of one static protein conformation 38, were subjected to experimental validation.
3. All designed AAV_ITGs showed higher productivity and enhanced cellular transduction via αVβ6 binding.
The two AAVs with the best design were then tested for productivity and the effectiveness of using αVβ6 as a cellular receptor. They were produced by tri-transfection with pITR-CMV-GFP-Luciferase as the expression cassette. Thanks to energy optimization, all the designed AAV-ITG variants significantly increase their titers compared to their parental hybrid capsid, to levels similar to those for AAV9 (Fig. 2A, S2A). In addition, all modified AAV-ITG variants retain proportions of VP1, VP2, VP3 capsid proteins with a similar ratio of AAV9 (Fig. 2B). This suggests that the designed sequences result in more stable AAV capsid complexes thanks to their estimated low energy structure, and therefore better production efficacy.
Next, we examined whether these AAV-ITGs can effectively use αVβ6 as a cellular receptor upon infection. First, a HEK293 cell line (293_αVβ6) constitutively overexpressing both integrin subunits, αV and β6, was created using the PiggyBac system (Fig S2B-C). The designed AAVs were then tested for their infectivity in this cell line. As expected, infection of AAV_ITGs in 293_αVβ6 cells, as defined by vector copy numbers (VCN), was higher than for AAV9 and AAV9rh74 (Fig. 2C). Both AAV_ITGs dramatically improved the luciferase activity (FC9rh74_4um9/AAV9=60.50, FC9rh74_5ffo/AAV9=25.99, FC9rh74_4um9/9rh74=63.99, and FC9rh74_4um9/9rh74=27.49, Fig. 2D). To investigate how specific AAV_ITGs used αVβ6 as a cellular receptor, we tested their infectivity under binding competition conditions. The number of AAV_ITG viral vectors entering the cells was significantly reduced when blocked by the recombinant protein αVβ6 before viral infection, but no change occurred with AAV9 or AAV9rh74 (Fig. 2E). This result suggests that efficient transduction of AAV_ITGs requires specific binding to a αVβ6 complex.
During myogenesis, αVβ6 is only expressed in late differentiation, but not in the myoblast stage (Fig S1A, S2D). We therefore hypothesized an enhanced transduction of AAV_ITGs in differentiated myotubes, but not myoblasts. We infected both human myoblasts and myotubes with AAV_ITGs. Low levels of luciferase activity were observed in all AAVs tested in human myoblasts (Fig. 2G,I). On the other hand, in human differentiated myotubes (hMT), VCN and luciferase activities in both AAV9rh74_4um9 and _5ff0 were significantly higher than for AAV9 or AAV9rh74 (Fig. 2F,H,K). In particular, variant AAV9rh74_4um9 showed a 16.56 (p < 0.0001) and 25.02-fold (p < 0.0001) improvement in luciferase activity compared to AAV9 and AAV9rh74, respectively, which is in agreement with its superior transduction efficiency and transgene expression seen in 293_αVβ6 cells.
In summary, the two designed AAV_ITGs were both well-produced and function via αVβ6-specific binding, thus enhancing their transduction efficiency in 293_αVβ6 and human differentiated myotubes.
4. AAV_ITGs enhanced transduction in skeletal muscle following systematic administration
AAV_ITGs, together with AAV9 and AAV9rh74, were administrated systematically via intravenous injection (transgene: CMV_GFP-Luciferase, dose: 1E13 vg/kg, age at injection: 6 weeks, n = 4) in C57Bl6 mice to examine their biodistribution 3 weeks post-injection (Fig. 3A).
In agreement with a previous study, AAV9rh74 slightly reduces transduction in skeletal muscle compared to AAV9 but accumulates much less in the liver (Fig. 3B-D). Thanks to the liver-detargeting capsid and in accordance with the fact that αVβ6 is weakly expressed in the liver, we expected poor entry into the liver for designed AAV_ITGs. Indeed, AAV_ITGs is strongly detargeted from the liver, both at VCN and mRNA levels, even further than the parental capsid (Fig. 3C-D). In contrast, enhanced transduction was observed in all skeletal muscles that were tested, including the tibialis anterior (TA), quadriceps (Qua) and diaphragm (Dia) (Fig. 3B-D). The two AAV_ITGs both showed a substantial increase in VCN and luciferase activity compared to both AAV9 and AAV9rh74. Similar to the results obtained in in vitro models, AAV9rh74_4um9 is the best transducer among the two AAV_ITGs. Compared to AAV9, the variant 9rh74_4um9 significantly increased VCN 5.31/7.21/2.48-fold and increased luciferase activity 15.2/13.2/23.57-fold in Qua, TA, and Dia (p < 0.05), respectively. Compared to the original backbone AAV9rh74, this variant even magnified the difference by increasing VCN 5.53/2.85/7.69-fold and increasing luciferase activity 152.35/106.68/60.43-fold (p < 0.05). Furthermore, AAV9rh74_4um9, but not AAV9rh74_5ffo, significantly increased transduction in the heart (FCVCN=4.15, FCLUC=15.43, p < 0.05). All AAVs that were tested showed poor delivery and transgene expression in the lungs and kidneys. No alteration of TGFβ and integrin signaling was observed at one-month post-injection in all AAVs being tested (Fig S2F-G). Overall, these data indicate that AAV_ITGs, especially the 9rh74_4um9 variant, are strongly liver-detargeted and exhibit enhanced tropism towards skeletal and cardiac muscles.
5. AAV9rh74_4um9 transduced skeletal muscle similarly, but detargeted the liver more strongly than other myotropic AAVs
Several engineered myotropic AAVs (mAAVs), including AAVMYO 15, MYOAAV-1A and − 2A 17, have demonstrated superior efficacy for in vivo delivery of muscle compared to natural AAVs. To evaluate the properties of these AAVs compared to ours, we performed in vitro and in vivo experiments. Viral preparations were produced using the same reporter transgene (CMV_GFP-Luc). All mAAVs were well-produced in 400ml suspension, with higher titers than AAV9rh74. However, MYOAAV productivity was significantly lower than 9rh74_ITGs and MYOAAVs (Fig S3A). Since all investigated mAAVs shared a common integrin-targeting RGD motif, these AAVs were then evaluated for their transduction via integrin complexes in myotubes and in cell lines where integrin complexes were stably overexpressed by the PiggyBac system. In 293_αVβ6 cells as well as in hMT, where αVβ6 is highly expressed, AAV9rh74_4um9 showed the highest transduction among the tested myotropic AAVs, with the sole exception that luciferase activity of MYOAAV2A was higher in hMT (Fig S3B-C). We also tested AAV transduction efficiency in two other cell lines, 293_WT, where αVβ6 expression is low, and 293_α7β1 that stably overexpresses a non-RGD-targeting α7β1 integrin. In both conditions, MYOAAV2A and AAV9rh74_4um9 showed the highest transduction (Fig S3D-E). These results suggest that, as intended with the rational design, AAV9rh74_4um9 uses αVβ6 more preferentially for cellular transduction than others, yet it can also efficiently use other integrin(s) similar to MYOAAV2A.
Following in vivo injection in the same setting as described above (6-week-old WT mice, dose: 1E13 vg/kg, n = 4), the three mAAVs and 9rh74_4um9 all showed strong liver-detargeting, high enrichment in both skeletal and cardiac muscles, and negligible transduction levels in other organs that were tested (kidneys, lungs, and brain) (Fig. 3G-H). No significant difference was observed in either VCN or luciferase activity between all three mAAVs and 9rh74_4um9 in the skeletal muscles that were tested. In heart muscle, MYOAAV2A showed a significant increase in VCN compared to other myotropic vectors, but no difference in luciferase activity, in agreement with the original observation 17. The most striking difference is the level of liver-detargeting between these vectors. The VCN for 9rh74_4um9 in liver is 3.34/22.05/13.85 times lower than for AAVMYO (p = 0.0022), MYOAAV-1A (p = 0.0013) and − 2A (p = 0.033), respectively (Fig. 3G), and is therefore the only vector that accumulates less in liver than skeletal muscles (Fig S3F-G). These data indicate higher muscle specificity for the 9rh74_4um9 variant compared to other myotropic vectors that have been investigated to date.
In summary, the 9rh74_4um9 variant, hereafter referred to as LICA1 (linked-integrin-complex AAV), consistently showed enhanced transduction and strongest liver-detargeting. Therefore, we then attempted to evaluate LICA1 as a delivery vector for muscular dystrophies, in comparison with AAV9. Two different setups will be investigated: the transfer of microdystrophin (µDys) – an incomplete transgene - in mdx, a mild mouse model of Duchenne muscular dystrophy (DMD) and of the full-length human α-sarcoglycan (SGCA) in a severe mouse model of limb-girdle muscular dystrophy R3 (LGMD-R3).
6. Low-dose LICA1-µDys gene transfer is effective in specifically overexpressing microdystrophin in dystrophic muscle but not sufficient to fully correct the underlying pathology
DMD is caused by mutations in the DMD gene, which encodes for dystrophin protein – a key player in the dystrophin-glycoprotein complex (DGC), which is critical for the structural stability of skeletal muscle fibers 39. Lack of dystrophin can result in progressive loss of muscle function, respiratory defects, and cardiomyopathy. The most commonly used DMD animal model is the mdx mouse, with a lifespan reduced by 25%, milder clinical symptoms than those seen in human patients, with the exception of the diaphragm muscle 40. Among many therapeutic strategies to restore dystrophin expression, high-dose AAV-based gene transfer of shortened functional forms of the dystrophin ORF provided excellent results in animal models, but unsatisfactory conflicting data in current clinical trials 6. Severe toxicities, even patient death, have been reported from these trials (NCT03368742, NCT04281485), assumed to be related to the dose of ≥ 1E14 vg/kg. We therefore explored the possibility of low-dose µDys gene transfer 41 in mdx mice using LICA1 in comparison to AAV9 (Fig S4A, age at injection: 4 weeks, dose: 5E12 vg/kg, treatment duration: 4 weeks, n = 5). Three muscles with increasing levels of severity – TA, Qua, and Dia – were used to study AAV transduction and treatment efficacy.
LICA1 showed better µDys gene transfer than AAV9 in this model. LICA1-treated mice exhibited a significantly higher VCN in all 3 muscles that were tested, 1.85/2.02/1.07 times higher in TA (p < 0.0001), Qua (p < 0.0001), and Dia (p = 0.020), respectively (Fig. 4A). RNA levels indicated even greater differences and were 4.56–7.57 times higher in the LICA1-treated group (Fig. 4B; TA: FC = 4.56, p < 0.0001; Qua: FC = 5.46, p = 0.0001; Dia: 7.57, p = 0.05). Consequently, LICA1 can transduce almost 100% in TA and Qua, and 49.98% in Dia, while substantially lower numbers were seen in AAV9-treated muscles, at 73.22% (p = 0.0001), 57.8% (p < 0.0001), 10.34% (p < 0.0001) in TA, Qua, Dia, respectively (Fig. 4C, Fig S4B). Furthermore, while infection levels and expression of the transgene in liver were high for the AAV9 vector (despite the use of muscle-specific promoter), the VCN and mRNA levels in LICA1-treated liver were extremely low (Fig. 4A-B, FCVCN:AAV9/LICA1=36.8, p = 0.0002; FCmRNA:AAV9/LICA1=64.7, p < 0.0001). These data again confirmed the transduction efficiency and specificity towards skeletal muscle for the LICA1 vector, even with low-dose treatment.
The histological features and muscle functionality after AAV treatment were restored accordingly. The centronucleation index (percentage of centronucleated fibers) – an indicator of the regeneration/degeneration process – did not change with AAV9 (except in TA) but was significantly reduced upon LICA1 treatment (reduction of 21.68%, 19.05%, 22.88% in TA, Qua, Dia, respectively) (Fig. 4D, Fig S4C). Similarly, the fibrosis level in two severely affected muscles, Qua and Dia, only exhibited a significant reduction with LICA1, but not AAV9 (Fig. 4E, Fig S4D). The serum biomarker MYOM3 level, an indicator of muscle damage 42, showed a reduction for both AAV treatments, with a considerable further reduction seen in the LICA1-treated group (Fig. 4F, FCAAV9/KO=0.75, FCLICA/KO=0.43, pAAV9−LICA1>0.0001). More importantly, AAV9 treatment did not affect any muscle functionality being tested (Fig. 4G-I), while significant improvements with LICA1-µDys treatment were observed in escape test – a measure of global force (Fig. 4G, FCLICA1/mdx=1.19, pLICA1/mdx=0.02) and in situ TA mechanical force measurement (Fig. 4H, FCLICA1/mdx=1.14, pLICA1/mdx=0.0006). However, none of the treatment normalized to the WT functional levels. These data indicate that LICA1 is better than AAV9 at restoring dystrophic histological features and muscle functions.
We also investigated the molecular alteration in Qua upon AAV treatment using RNA-seq. On the two first principal components (PCs) of the PCA, a clear distinction between four transcriptome groups (WT, mdx, AAV9, LICA1) was observed, while LICA1-treated muscles were clustered closer to the WTs than others (Fig S4E). To our surprise, despite excellent transgene expression by LICA1, global transcriptomic restoration was relatively modest (Fig. 4K). Nevertheless, a substantial improvement can still be seen for LICA1 compared to AAV9. Among 4216 down- and 4501 upregulated differentially expressed genes (DEGs) identified in mdx muscle, 1515 (35.9%) and 1728 (38.4%) were restored by AAV9, while LICA1 was able to correct 1736 (41.2%) and 1980 (44.0%), respectively (Fig. 4L-M). In addition, a greater number of genes were either not or insufficiently corrected by AAV9 than by LICA1 (Fig. 4N). A total of 2572 genes were downregulated (61.0%) and 2620 (58.2%) incompletely restored, while significantly lower numbers were seen for LICA, with 2094 (49.67%) down- and 2019 (44.86%) upregulated. Interestingly, some known dysregulated pathways, including α- and ϒ-interferon responses and oxidative phosphorylation, were significantly better normalized by LICA1 than by AAV9 (Fig S4F).
In summary, at 5E12 vg/kg, LICA1-µDys, but not AAV9, was efficient in transducing close to 100% myofibers, except in the diaphragm. This effective improvement in transduction can significantly reduce some dystrophic features in all muscles that were tested, yet restoration in the global transcriptome remains modest. However, greater improvements in functional, histological, and transcriptomic restoration were achieved with LICA1 compared to AAV9.
7. Low-dose LICA1-SGCA treatment restored the muscle functionality, dystrophic phenotypes, and transcriptomic dysregulation in a severe SGCA mouse model.
LGMDR3 is caused by mutations in the SGCA gene 43 – another component of the DGC complex. Defects in the SGCA protein therefore lead to muscle weakness and wasting. A LGMDR3 mouse model has been established, which closely represents patient’s clinical phenotypes 44. Similar to the setting in mdx mice, low-dose AAV treatment with 5E12 vg/kg was investigated in this mouse model. AAV9 or LICA1 encoding human SGCA (hSGCA) under control of a muscle-specific human Acta1 promoter were injected into 4-week-old SGCA-KO mice (Fig. 5A). Analysis was performed 4 weeks post-treatment.
In all three muscles that were tested, TA, Qua, Dia (in order of increasing severity), transduction in various measures, VCN, mRNA level, and percentage of SGCA + myofibers, was significantly greater in the LICA1-treated group than for AAV9 (Fig. 5B-D, Fig S5A). Of note is the fact that the differences in transduction efficacy (%SGCA + myofibers) between LICA1 and AAV9 are greater in more severely affected muscles (Fig. 5D). At such a low dose, AAV9 was able to transduce > 80% myofibers in TA while LICA1 can reach close to 100% (p < 0.0001). While LICA1 still transduced almost 100% of fibers in Qua (the muscle affected with intermediate severity), only 58.1% fibers were transduced by AAV9 on average (p < 0.0001). In the most severely affected muscle, Dia, both vectors displayed reduced efficiency; however, LICA1 continued to demonstrate much better transduction (µAAV9 = 22.1%, µLICA1 = 59.5%, p < 0.0001).
The differences in transgene delivery and expression positively correlated with levels of histological and functional restoration. Different dystrophic histological features, including percentage of centronucleated fibers (Fig. 5E, Fig S5B), percentage of fibrosis area (Fig. 5F, Fig S5C), and fiber size distribution (Fig. 5G), were all significantly better normalized by LICA1 than AAV9, especially in more severely affected muscles. Importantly, no significant improvement was observed in the AAV9-treated group in centronucleation index and fibrosis level in Dia, while LICA1 reduced these parameters by half (Fig. 5E-F). Fiber sizes were also restored to near-WT distribution by LICA1 in this muscle (Fig. 5G). No difference in body weight was seen between groups with or without AAV treatment (Fig S5D). At the functional level, however, the escape test – a measure of global force - showed a significant increase in AAV9-treated mice (FC = 1.42, p = 0.0072) and was even higher in LICA1-treated group (FC = 1.72, p < 0.0001) (Fig. 5H). On the other hand, in situ TA mechanical forces were both improved in the two AAV groups at similar levels (Fig. 5I), possibly due to > 80% transduction rate by both vectors. Similar to the global force, the serum MYOM3 level was greatly reduced in the LICA1-treated group but not for AAV9, indicating less muscle damage (Fig. 5K). No difference was seen in the anti-capsid antibody between the two AAV treatments (Fig S5E). These results indicate that better and significant functional and histological restoration in the LICA1-treated mice was achieved, even at low-dose treatment, thanks to superior transduction efficacy.
We further investigated the molecular alterations following AAV treatment by transcriptomic profiling of the quadriceps muscle. The first principal component (PCs) of the PCA was able to separate a group including WT and LICA1 with a group including SGCA-KO and AAV9, suggesting close proximity between elements within these 2 groups (Fig S5F). A heatmap of all 8591 significant DEGs (4035 downregulated and 4556 upregulated) further highlighted the restorative effect of LICA1 on gene expression levels (Fig. 5L). LICA1-treated muscles, in particular, demonstrated a significant correction of 69.9% (2821/4035) and 66.5% (3028/4556) of down- and upregulated DEGs, respectively, compared to 12.4% (500/4035) and 9.21% (420/4556) corrected by AAV9 treatment (Fig. 5M-N). Conversely, not all DEGs were significantly restored or returned to WT levels. The number of such transcripts in AAV9-treated muscles was much higher than in the LICA1-treated group (Fig. 5O): 2541 (63.0%) downregulated DEGs and 3045 (66.8%) upregulated DEGs for AAV9, with only 483 (12.0%) downregulated DEGs and 1038 (22.8%) upregulated DEGs in the LICA1-treated group. These data illustrate that low-dose LICA1 treatment can effectively normalize the majority of the dysregulated transcriptome and is much more efficient in correcting gene expression dysregulation than AAV9 at the same dose.
In summary, low-dose (5E12 vg/kg) AAV gene transfer using LICA1 in the LGMDR3 mouse model is effective in restoring muscle function, dystrophic histology, and the dysregulated transcriptome. The efficacy was much greater than for AAV9 at the same dose due to enhanced transduction.