Somatic ERBB3 mutations in cancer
ERBB3 has a somatic mutation frequency of 2.1% in pan-cancer analysis with relatively high frequencies observed e.g. in urothelial (6.3-9.7%), endometrial (8.2-12.8%), lung (1.4-8.8%), and colorectal (4.9-6.2%) cancer (cbioportal.org) [23,24]. Among the 831 observed unique genetic ERBB3 alterations, the cBioPortal database lists only eight missense mutations as oncogenic and 41 variants affecting 11 additional amino acid residues as likely oncogenic (Supplementary Fig. 1).
Modified cell model for screening of activating variants of ERBB3 pseudokinase
Murine lymphoid Ba/F3 cells were selected as the cellular background for screening activating ERBB3 mutations as we have previously used the model to screen for activating EGFR [25] and ERBB4 [26] variants. The fact that the Ba/F3 cells are critically dependent on exogenous IL-3 for survival and that this dependency can be substituted by ectopic expression of an active RTK [34], allows the use of the cells for a functional readout of RTK-driven growth. However, overexpression of even the well-characterized oncogenic E928G variant of ERBB3 was not sufficient to promote IL-3-independent Ba/F3 cell growth (Fig. 1A), as expected for a pseudokinase in the context devoid of endogenously expressed heterodimeric partners. These observations indicated that Ba/F3 background could not be used to address differential transforming potential of ERBB3 variants in the absence of a kinase-competent partner to heterodimerize with ERBB3.
To modify the model with simultaneous co-expression of ERBB2, but to limit the role of ERBB2 to only serve as the receiver kinase in the ERBB3/ERBB2 dimer, a ERBB2 V956R mutant variant [29–31] was introduced to the Ba/F3 cells (Fig. 1B-C; Supplementary Fig. 2A-B). We had previously demonstrated that co-expression of wild-type ERBB2 together with wild-type ERBB3 in the same model was sufficient to readily promote IL-3-independent growth [27], precluding the use of wild-type ERBB2 as the heterodimer partner. The V956R variant with a valine-to-arginine mutation in the C-lobe of the ERBB2 kinase domain disrupts the capability of the receptor to serve as an activator but does not interfere with its function as a receiver in the transactivation process between two ERBB kinases (Fig. 1C). The “activator-incompetent” V956R was therefore also expected to be incapable of transactivating ERBB2 in a homodimeric ERBB2/ERBB2 complex. Consistent with reduced potential of the ERBB2 V956R mutant in transforming the Ba/F3 cells, it did not promote IL-3-independent growth even in the context of co-expression with wild-type ERBB3 when the complex was not activated by the NRG-1 ligand (Fig. 1B). However, indicating a window for a functional read-out, introducing the oncogenic mutant ERBB3 E928G together with the activator-incompetent ERBB2 V956R did result in IL-3-independent growth (Fig. 1B). The approach also seemed to differentiate between ERBB3 variants with variable transactivation potencies, as the other known oncogenic variants G284R and V104M were clearly less potent as compared to E928G in promoting growth and ERBB3 phosphorylation (Supplementary Fig. 2A-B).
An expression library of randomly mutated ERBB3 variants
To perform an unbiased screen to study thousands of ERBB3 mutations in a high-throughput assay, a randomly mutated ERBB3 cDNA library was created with error-prone PCR as previously described for the iSCREAM pipeline [25,26]. As a result, the average mutation frequency in the cDNA library (pDONR221-ERBB3 library) was estimated to be 1.3 mutations per a 4 029 bp ERBB3 cDNA insert by Sanger sequencing. The cDNA library was subsequently cloned into a pBABE-gateway retroviral mammalian expression vector. To characterize the distribution of ERBB3 variants in the library, the ERBB3 insert from the pBABE-puro-gateway-ERBB3 library was PCR amplified and deep sequenced using Illumina NovaSeq6000 platform. The analysis indicated that the library was comprised of 9 013 unique ERBB3 single nucleotide variants out of the 12 088 theoretically possible (74.6%). The resulting amino acid alterations indicated the presence of 8 055 out of the 8 276 theoretically possible amino acid changes (derived by altering a single nucleotide in a codon), resulting in the coverage of 97.3% of all possible ERBB3 missense or nonsense mutations. Specific missense or nonsense mutation distribution and different transition and transversion mutation distributions are shown in Supplementary Fig. 3.
Functional genetics screen with the modified iSCREAM pipeline
The ERBB3 mutation library was transduced into Ba/F3 cells expressing ERBB2 V956R. Cells expressing the mutant expression library or wild-type ERBB3 were cultured in the presence or absence of IL-3, in the presence of 20 ng/ml of the ERBB3 ligand NRG-1 without IL-3, or in the presence of 20 ng/ml of NRG-1 without IL-3 for 48 hours prior to complete depletion of both IL-3 and NRG-1 from the culture medium. The cells expressing the ERBB3 mutation library were able to survive in the complete absence of both IL-3 and NRG-1, while all cells expressing wild-type ERBB3 died (Supplementary Fig. 2C). These observations indicated that the ERBB3 cDNA library included variants that promoted Ba/F3 cell growth by transactivating ERBB2 V956R.
To identify ERBB3 variants enabling Ba/F3 cell transformation, genomic DNA was extracted from the surviving cell populations and the ERBB3 cDNA inserts were PCR amplified and deep sequenced (>1 300 000 X) on Illumina NovaSeq6000 platform. The read counts specific for each ERBB3 coding sequence variant (normalized to total number of reads at each ERBB3 locus) in the surviving cell populations were compared to the original transduced IL-3-dependent cell populations and the enrichment of each specific mutation (fold change) as well as the relative variant frequency of each mutation in the surviving cell pool was estimated (Supplementary Information).
ERBB3 variants promoting Ba/F3 cell transformation
The combined results from three independent samples demonstrated that cDNAs encoding 18 ERBB3 missense variants were enriched by at least 25 fold in the Ba/F3 cell populations surviving in the absence of all exogenous ligands (Fig. 2). Seven of the mutations (P212L, Y265C, L361P, L482P, A676T, D797V, and E928G) were enriched by over >100 fold (Fig. 2). The known oncogenic variant E928G was one of the main hits in the screen, validating the approach. In addition, the 18 enriched variants included the missense mutations K329R and E332K annotated as likely oncogenic by cBioPortal (Supplementary Fig. 1).
To validate the hits, 12 individual ERBB3 mutations, each reaching the enrichment level of at least 25 fold, were selected and transduced into Ba/F3 cells either alone or together with ERBB2 V956R. The 12 variants included all seven with enrichment of over 100 fold, as well as five others from the group of 11 reaching the enrichment level of 25-100 fold.
When expressed in Ba/F3 cells alone, none of the 12 mutants was able to promote IL-3-independent growth (Supplementary Fig. 4A-B), consistent with the lack of dimerization partner to serve as target for transactivation by ERBB3. However, when co-expressed with the activator-incompetent ERBB2 construct (Supplementary Fig. 4B), expression of E332K, A676T, or E928G induced emergence of IL-3-independent clones (Fig. 3A). When ERBB3 expression was first primed by a 48-hour incubation in the presence of 20 ng/ml of NRG-1, three additional ERBB3 mutants, K279N, N353T, and D797V, reproducibly also promoted IL-3-independence (Fig. 3A). Importantly, however, in none of the experimental repeats did an IL-3-independent clone emerge from vector- or wild-type ERBB3-transduced clones, regardless of the presence or absence of the heterodimerizing partner ERBB2 V956R.
The L361P mutant seemed to have a slightly aberrant molecular weight in western analyses (Supplementary Fig. 4B), and flow cytometry using an antibody against the extracellular domain of ERBB3 demonstrated lack of the epitope at the cell surface (Supplementary Fig. 4C), consistent with defective maturation and translocation of the variant to the cell surface.
Functional validation of the ERBB3 variants in MCF-10A and NIH 3T3 cells
The 12 ERBB3 mutations selected for further functional characterization were also tested for activity in human MCF-10A mammary epithelial cells and mouse NIH 3T3 fibroblasts. Again, when expressed alone, none of the variants were able to promote survival of serum-starved MCF-10A cells (Supplementary Fig. 5A-B). However, when analyzed in MCF-10A cells co-expressing ERBB2 V956R (Supplementary Fig. 5C), the presence of 50 ng/ml of NRG-1 significantly enhanced the survival and proliferation of cells expressing the ERBB3 variants K279N, E332K, N353T, L482P, A676T, and E928G (Fig. 3B). No significant ligand-independent growth advantage was observed for any of the ERBB3 variants in MCF-10A background (Fig. 3B).
To address the ability of the ERBB3 variants to promote focus formation in NIH 3T3 fibroblasts, the cells infected with retroviruses encoding the different ERBB proteins were seeded in 6-well plates and cultured in 3% serum for two weeks. When expressed alone, the ERBB3 variants K279N, E332K, and D797V were able to promote formation of foci that covered significantly more area than foci formed by cells expressing wild-type ERBB3 (Supplementary Fig. 6A-C). However, more potent focus formation was observed when ERBB3 variants were analyzed in cells co-expressing ERBB2 V956R (Fig. 3C-D; Supplementary Fig. 6D), and also a greater number of ERBB3 variants demonstrated activity. Indeed, 9 out of the 12 hits selected for validation showed transforming activity in the focus formation assay, including P212L, Y265C, K279N, K329R, E332K, N353T, L482P, A676T, and E928G (Fig. 3D).
ERBB3 mutations demonstrating activity across different cell backgrounds
Statistical analysis of the growth assays repeated for 3 to 8 times demonstrated that five of the 12 hit variants consistently promoted growth when co-expressed with ERBB2 V956R in all the three cell backgrounds studied (Fig. 4). These were K279N, E332K, N353T, A676T, and E928G. The ERBB3 variant L482P also showed enhanced growth compared to the wild-type cell line in MCF-10A and NIH 3T3 cells but promoted emergence of an IL-3-independent Ba/F3 clone in only one out of eight experiments, not reaching statistical significance. In addition, four variants, P212L, Y265C, K329R, and D797V promoted some activity in two cell backgrounds but reached significance only in one. Only the ERBB3 variants L361P and D1259Y were unable to stimulate growth across all the three cell line models. The mutations were relatively ineffective when expressed alone, with the exception of K279N, E332K, and D797V that induced statistically more colony formation in NIH 3T3 cells also without concomitant ERBB2 V956R overexpression (Supplementary Fig. 6A-B), suggesting that ERBB3 pseudokinase mutants can harbor gain-of-function properties in cells that, unlike the Ba/F3 cells, express other ERBBs endogenously.
Long-read sequencing identifies co-occurring mutations
As the mutation frequency in the mutation library was higher than 1 change per ERBB3 cDNA (i.e. 1.3), the clonal evolution of Ba/F3 cells could favor enrichment of cDNAs that harbor multiple ERBB3 mutations in cis. Relatively weak mutations could also function synergistically to achieve transforming activities. To address whether composite ERBB3 mutations occurred in cis during the clonal evolution in the Ba/F3 screen, the samples analyzed with Illumina NGS – producing reads with the length of approximately 100 bp – were reanalyzed with PacBio High-Fidelity Circular Consensus Sequencing [35] producing reads covering the whole ERBB3 inserts.
Indeed, when sequencing reads coding for any of the 18 variants enriched by ≥25 fold in the Ba/F3 screen were analyzed, only 28.0% of the full-length reads included only a single ERBB3 mutation (Fig. 5A; Supplementary Information). Instead, 40.0% of the samples included another missense ERBB3 mutation, and 32.0% three or more co-occurring mutations. This distribution of single vs. multiple mutations was significantly different from those ERBB3 mutations that were only enriched by 1 to 25 fold in the Illumina NGS analysis (P = 0.015; Pearson’s Chi-squared test) that more frequently occurred in contexts of single variants (Fig. 5A). Of the 12 variants that were functionally validated, only ERBB3 K279N, N353T, and E928G were predominantly found to be present alone, suggesting that the activity of these mutations did not benefit from additional mutations (Fig. 5B-C). Interestingly, these three mutations composed three out of the five mutations with most consistent functional activity in the validation analyses carried out by expression constructs verified to include only single ERBB3 mutations (Fig. 4). The rest of the mutations co-occurred along with one or several other ERBB3 mutations in cis in the same sequencing reads (Fig. 5B-C).
Functional significance of mutation co-occurrence
To test the hypothesis that some of the ERBB3 variants with weak individual transforming potential functioned synergistically with another alteration to gain activity, the following two co-occurring mutation pairs were selected for further analysis: K329R+E332K and L361P+L482P (Fig. 5B-C). To express the double mutations in cis in the same receptor, the mutation pairs were cloned into the same cDNA molecules using 1- or 2-step site-directed mutagenesis. The activity of the double mutations in promoting cell survival and growth was then analyzed in Ba/F3 and MCF-10A cells as above and compared to the respective single mutations. In both models, the double mutant ERBB3 K329R+E332K demonstrated significantly enhanced growth as compared to cells expressing either K329R or E332K alone (Fig. 6). Similar to the analyses of single mutants, the activity was restricted to the context of simultaneous overexpression of ERBB2 V956R (Supplementary Fig. 7 and 8). However, no enhanced growth was observed with cells expressing the double mutant L361P+L482P that, resembling the cells expressing the single mutant ERBB3 L361P, seemed to possess less transforming activity as compared to cells expressing L482P alone (Fig. 6). The L361P+L482P double mutant also phenocopied L361P alone in aberrant migration of the ERBB3 band in western analyses and low or no ligand-stimulated ERBB3 phosphorylation (Supplementary Fig. 7B; Supplementary Fig. 8B-C), consistent with lack of L361P cell surface expression (Supplementary Fig. 4C).
Structural analysis of the ERBB3 variants
The observed ERBB3 mutations were mainly clustered in the ECD region and the recent cryo-EM structures with PDB codes 7MN5 and 7MN6 [36] were used for structural analysis of the region (Fig. 7A). Here, we focus on the structural analysis of the ERBB3 variants that were consistently transforming in all three cellular backgrounds, with the exception of E928G for which previous analyses are available [37,38].
K279 of ERBB3 is located centrally within the globular extracellular region of the heterodimer, located within domain II just after the dimerization arm (β-hairpin loop), at the beginning of the disulphide containing module 5 of ERBB3 (Fig. 7A). This latter module is in direct contact with the dimerization arm of ERBB2 and, although K279 does not interact directly with residues of ERBB2, its aliphatic part of the side chain does interact with the disulfide bridge of module 5. K279 forms a salt bridge and a hydrogen bond with module 4 of domain II (Supplementary Fig. 9A). Modules 2-7 of domain II can be considered to function like a “spring”: stretching the spring could narrow the ligand-binding site and contracting the spring could make it more open (Fig. 7B). The “springs” are sensitive to alterations as demonstrated by the oncogenic mutation S310F of ERBB2 that stabilizes the dimerization arm of ERBB3 (PDB entry 7MN6 [36]). With K279N, close interactions with D252 would take place if modules 4 and 5 of domain II approach each other by an Ångstrom or more. On the other hand, K279N packs against P260 and stabilizing interactions with the main-chain carbonyl atoms of T278 (at the end of the dimerization arm) and V287 (located in module 5) may also take place. Thus, because domain II and structures surrounding K279 are so critical to heterodimer formation and receptor activation, it is certainly likely that the mutant K279N, by maintaining a strong interaction with D252, would affect the function of the ERBB3/ERBB2 complex and any stabilization of the complex that results would accentuate the normal enzymatic and signaling functions.
K329 is located at the beginning of extracellular globular domain III of ERBB3 but is not in direct contact with the closest domains (II and III) of ERBB2 (Fig. 7A, C; Supplementary Fig. 9B). The mutation K329R alone, would increase the hydrogen bonding possibilities due to the long length of the side chain and the guanidinium group but conserves the positive charge. The orientation in ERBB3 suggests that K329R could further stabilize the domain in ERBB3, the guanidinium group having the potential to form two hydrogen bonds to the main-chain oxygen atom of V352 (Fig. 7C).
E332, adjacent to the S-S bridge, is located near K329 but points in the opposite direction and towards the domain II of ERBB3 (Fig. 7A, C). In the native structure, E332 forms a salt bridge with R339. With the E332K mutant, several options are possible: e.g. (1) K332 could form strong hydrogen bonds with S338 (2.5 Å) and G333 (3.6 Å) or (2) a strong electrostatic inter-subunit link connecting K332 of ERBB3 and E280 of ERBB2 could be possible but would need changes in backbone conformations or a likely rigid body movement of the domain III half of the clamp (Fig. 7C). Moreover, in the presence of K329R (double mutant), added strength would further support the structural changes and explain why K329R exerts an effect in concert with the E332K mutation but not on its own (Fig. 7C).
The N353 is one of the ten N-glycosylation sites in ERBB3 [39]. There has been evidence that site-specific glycosylation can have distinct effects [40]. Western blot analysis of cells expressing the ERBB3 variant N353T demonstrates a small size shift of the protein (see e.g. Supplementary Fig. 5B; Supplementary Fig. 6D). The structures (PDB codes 7MN5 and 7MN6 [36]) of the ECD of the ERBB2/ERBB3 heterodimer shows that N353 is glycosylated and the identified mutation to threonine (N353T) would eliminate N-glycosylation (Supplementary Fig. 9C).
The JM region is susceptible to threonine phosphorylation [41]. T678 in EGFR is a conserved site in other ERBB receptors except ERBB3 (A676 for ERBB3) [42] and a mutation in T678 of EGFR has been studied to have substantial effects [43]. The ERBB3 A676T mutant very likely increases the dimer stability based on the analysis of the EGFR and ERBB2 structures and derived models: for example, 1) an intrasubunit hydrogen bond may form between the side-chain oxygen of A676T and the main-chain nitrogen of R679, respectively equivalent to that seen for T678 and R681 in the X-ray structure of EGFR (PDB code 3GOP [41]; Fig. 7D) or 2) an intersubunit hydrogen bond may form between the side chains of A676T and R669 equivalent to that seen between T686 (chain B) and R678 (chain A) in the NMR structure of ERBB2 (PDB code 2N2A [44]; Fig. 7E). Either of these interactions, located close to the plasma membrane in a helix of the juxtamembrane region, have the potential to support the kinase domain dimerization.
Sensitivity of mutant ERBB3 transactivation to ERBB inhibitors
The potential to pharmacologically target the identified ERBB3 variants and to use them as predictive biomarkers for ERBB targeting drugs was assessed in Ba/F3 cells. As ERBB3 variants alone did not promote ERBB3-dependent growth, and as there are no clinically relevant drugs available that would directly block ERBB3, the analyses were carried out in cells co-expressing ERBB2 V956R. The cells were cultured for 72 hours in the presence of different concentrations of the ERBB2 antibodies trastuzumab or pertuzumab, or the second-generation ERBB tyrosine kinase inhibitor neratinib. The two antibodies did not show an effect for any of the cell lines regardless of the ERBB3 variant expressed when the cells were stimulated with 20 ng/ml NRG-1 for the duration of the experiment (Fig. 8A-B; Supplementary Fig. 10A). This was demonstrated by IC50 values remaining at the >10 µg/ml level, similar to treatment of cells without any ERBB3 expression (vector control) that were cultured in the presence of IL-3, consistent with the well-documented role of NRG-1 in promoting resistance to ERBB drugs [19,45–47]. When the cells were cultured in the absence of NRG-1, significantly lower IC50’s were observed (in the 1–100 ng/ml range) but the sensitivity was similar between all ERBB3 variants.
The sensitivity to the TKI neratinib was similar between all ERBB3 variants (Fig. 8C; Supplementary Fig. 10). Interestingly, however, two of the single ERBB3 mutations, P212L and L482P, were completely resistant to neratinib, even when the cells were cultured in the absence of NRG-1 (Fig. 8C; Supplementary Fig. 10B). The P212L and L482P mutations are located respectively on domains II and III, on either side of the NRG-1–binding site, and hence may result in a receptor conformation mimicking the NRG-1 bound state and leading to resistance to neratinib even in the absence of NRG-1. Cells expressing the double mutation L361P+L482P were also an outlier as they responded similarly – and with relatively high IC50 – to neratinib both in the presence and absence of NRG-1, in accordance with the very low level of ERBB3 or ERBB2 phosphorylation (Supplementary Fig. 7B; Supplementary Fig. 8C). Taken together, these observations suggest that all the ERBB3 variants can be targeted in ERBB3/ERBB2 heterodimeric complexes with anti-ERBB2 antibodies, but that the sensitivity to ERBB TKI varies.