Transcriptome-(Phospho)proteome Characterization of the Brain of a Constitutional Model of Cytoplasmic-predominant Pten Expression With Autism-Like Phenotypes

PTEN, a well-studied tumor suppressor, has one of the strongest Mendelian associations with autism spectrum disorder (ASD), representing a special case in autism’s complex genetic architecture. Animal modeling for constitutional Pten mutation creates an opportunity to study how disruption of Pten affects neurobiology, providing insights that may be generalizable or at least inform our understanding of ASD. Although the neural transcriptome has been well characterized in Pten models, little has been done concerning the proteome and phosphoproteome. This is a critical gap in knowledge given that these –omic landscapes are more proximal to the actively observed biology than the transcriptome. analyses of a germline Pten mutation model point to ASD risk genes like Pten and Psd-95 as major hubs in the protein networks, highlighting their important regulatory inuence. Our observations here suggest Pten and Psd-95, known interactors in biological networks in the brain, are critical to either initiation or maintenance of cellular and perhaps organismal phenotypes related to ASD. Future research should explore rescuing Pten and Psd-95 function in attempts to ameliorate neurological pathologies and behavioral abnormalities. the implicated molecules appear to be distinct, which suggests differential modes of regulation across the different biological tiers and/or the fact that multiple seemingly disparate molecules converge on nal common pathways. As such, the molecules that do emerge from our study as major regulatory players, Pten and Psd-95, are strongly associated with autism biology. Our work suggests that a renewed focus on these molecules, specically developing an understanding that would allow for rescue of their important functions in the nervous system, is critical to tackle the clinical challenges presented by ASD.

Background PTEN (Phosphatase and Tensin homolog on chromosome Ten, OMIM #601728), a well-studied, tumor suppressor gene, is known for its canonical role as the major antagonist of PI3K/AKT/mTOR signaling, a crucial growth and survival intracellular cascade, via the lipid modality of PTEN's dual-speci c phosphatase function [1][2][3]. Germline PTEN mutations are considered a rare cancer predisposition mechanism, while also representing one of the strongest Mendelian associations with autism spectrum disorder (ASD) [4,5]. Ostensibly, individuals with germline PTEN mutations present with a diverse array of clinical phenotypes, ranging from high risk of benign and malignant neoplasia to ASD [6].
Next-Generation Sequencing (NGS) technology has gifted biomedical science a tool for exploration of big data, especially for complex diseases like ASD. While there have been several rigorous efforts to explore the transcriptome of ASD [7][8][9][10][11][12][13][14], there has been a limited exploration of the proteome of ASD [15][16][17][18][19]. Little has been done to understand the proteome, let alone characterize the functional effectors of the phosphoproteome in syndromic ASD contexts, such as those arising from Mendelian gene variation, here, germline PTEN mutation. Thus, we believe that animal modeling for PTEN mutation-associated ASD followed by the deployment of proteomic/phosphoproteomic technologies could represent an excellent approach for disentangling the complexity associated with autism pathophysiology.
Given the prior ndings from the Pten m3m4 mouse and the lack of neural (phospho)proteomic data available to interrogate PTEN pathophysiology in the context of ASD, we performed proteomic, phospho-serine/threonine (pS/pT) proteomic, and phospho-tyrosine (pY) proteomic surveys of two-week-old (P14) hemibrains and sixweek-old (P40) cortices of Pten m3m4/m3m4 mice compared to littermate wildtype samples. We hypothesized that disruption of Pten, which has poorly characterized but likely promiscuous protein phosphatase function, would likely perturb the phosphoproteome in ways that contribute to pathophysiology that disrupts synaptic functions and promotes stress or neuroin ammatory processes. Furthermore, we anticipated disruption of the proteome given the previously documented changes in the transcriptome and anticipated relationships between the differential expression in the transcriptome and proteome. Ultimately, our transcriptome-proteomephosphoproteome exploration of the molecular complexity of the Pten m3m4 brain has illuminated the variation within and between different -omic landscapes, identifying important candidate regulators and effectors of PTEN dysfunction in human ASD.

Methods
The Pten m3m4 murine model Our planned experimental procedures were approved by the Cleveland Clinic's Institutional Animal Care and Use Committee (IACUC) under protocol numbers 2018-1952 and 2017-1879 and guided by the Principles of Laboratory Animal Care formulated by the National Society for Medical Research. We created the Pten m3m4 mouse on an outbred CD1 background, which has been studied extensively, shown to have de cits in social behavior, changes in neuron and glia populations, and changes in gene expression in many known autism risk genes [21][22][23]25,26,29]. The Pten m3m4 mutation, located within exon 7 of mouse Pten, consists of ve nucleotide substitutions that results in four nonsynonymous and one synonymous codon changes. The four missense mutations disrupt the third (m3) and fourth (m4) putative nuclear localization-like sequences of Pten [20,21]. We performed mouse genotyping on genomic DNA extracted from toe clippings following Jackson Laboratory's protocol using in-house PCR primers. The wildtype allele primers are mPten-F5, 5′-TGGCAGACTCTTCATTTCTGTGGC-3′, and mPten-R6, 5′-ACTTCTTCACAACCACTTCTTTCAAC-3′, and mutant allele primers are mPten-F3, 5′-TACCCGGTAGAATTTCGACGACCT-3′, and mPten-R6, 5′-ACTTCTTCACAACCACTTCTTTCAAC-3′. Our mice were maintained on a 14:10 light:dark cycle, where access to food and water was ad libitum. We maintained room temperature between 18 and 26°C. Mice were euthanized via CO2 asphyxiation with additional cervical dislocation. None of the experiments were blinded but all were randomized and conducted under protocols approved by the Institutional Animal Care and Use Committee (IACUC) at Cleveland Clinic. In our experiments, we utilized only male mice because the Pten m3m4/m3m4 behavioral phenotypes are most pronounced in males, and the previous RNA-Seq experiment used only male mice [25,26]. A sample size of three biological replicates (N = 3) was used for each genotype and each time point of the study.

Transcriptomic data analysis
The hemibrain and cortical transcriptome of two-week-old and six-week-old Pten m3m4 mice (GSE59318), respectively, shares a similar differential expression (DE) pro le with that of idiopathic autism, as described in detail by Tilot et al [25]. We deployed Ingenuity Pathway Analysis (IPA) software (Qiagen, Redwood City, California) to perform a core analysis on the DE genes from the hemibrain and cortex of Pten homozygous mutant and wild-type mice at both time points (i.e. P14 and P40). We also con rmed and extended this analysis with STRING software (version 11.0; https://string-db.org/). We performed STRING analysis on the same list of DE. These comparisons were of the homozygous mutant compared to the wildtype (i.e., MUT versus WT = MUT/WT Fold Change).

Proteomic scan
Three wildtype and three homozygous mutant two-week-old mouse hemibrains and three wildtype and three homozygous mutant six-week-old cortices were submitted to the Lerner Research Institute (LRI) Proteomics Core. These samples were homogenized in urea lysis buffer (20mM HEPES, 9M urea, 1X HALT phosphatase inhibitor, pH 8.0) with lysis Matrix D beads using FastPrep-24 from MP Biomedical. An aliquot from each tissue lysate was taken out for in-solution tryptic digestion. These samples were solubilized in 50 μL of in-solution digestion buffer, 6M urea, 100 mM Tris pH 8.0, reduced with DTT, alkylated with iodoacetamide, and then diluted to give a nal urea concentration of less than 1 M for digestion. All samples were digested in-solution by adding 0.5 μg trypsin and incubating overnight at room temperature. A second aliquot of trypsin was added and digestion was continued for an additional 6 hours. After digestion, the peptides were desalted and quanti ed using Thermo formic acid gradient at a ow rate of 0.25 μL/min were introduced into the source of the mass spectrometer online. The microelectrospray ion source is operated at 2.5 kV. The digest was analyzed using the data dependent multitask capability of the instrument acquiring full scan mass spectra to determine peptide molecular weights and product ion spectra to determine amino acid sequence in successive instrument scans. The data was analyzed by using all CID spectra collected in the experiment to search the mouse UniProtKB databases with the search program Mascot and Sequest. These search results were then uploaded into the program Scaffold. The quantitation was performed by comparing the normalized spectral counts for these samples. The experimental comparison is between homozygous mutant and wildtype (i.e., MUT versus WT = MUT/WT Fold-Change).
Phospho-serine/threonine proteomic scan Three wildtype and three homozygous mutant two-week-old mouse hemibrain and three wildtype and three homozygous mutant six-week-old cortices were submitted to the Lerner Research Institute (LRI) Proteomics Core.
These samples were homogenized in urea lysis buffer (20mM HEPES, 9M urea, 1X HALT phosphatase inhibitor, pH 8.0) with lysis Matrix D beads using FastPrep-24 from MP Biomedical (Solon, OH). The protein concentration was measured using a Pierce BCA assay kit. The proteins samples were reduced with DTT and alkylated with iodoacetamide. These proteins were then digested with trypsin overnight at room temperature. After digestion, the peptides were puri ed using Sep-Pak C18 spin columns (Waters Corporation, WAT 051910). Prior to enrichment, an 8 pmol aliquot of phosphopeptide standard mix was spiked into each sample. Phosphopeptide enrichment was performed using Pierce TiO2 Phosphopeptide Enrichment Spin Tips according to the manufacturer's instructions. The enriched peptide samples were subjected to C18 clean-up prior to LC-MS/MS analysis. The LC-MS system was a ThermoScienti c Fusion Lumos mass spectrometry system. The HPLC column was a Dionex 15 cm x 75 μm id Acclaim Pepmap C18, 2μm, 100 Å reversed-phase capillary chromatography column. Five μL volumes of the extract were injected and the peptides eluted from the column by an acetonitrile/0.1% formic acid gradient at a ow rate of 0.25 μL/min were introduced into the source of the mass spectrometer on-line. The microelectrospray ion source is operated at 1.9 kV. The digest was analyzed using the data dependent multitask capability of the instrument acquiring full scan mass spectra to determine peptide molecular weights and product ion spectra to determine amino acid sequence in successive instrument scans. The data were analyzed by using all CID spectra collected in the experiment to search the mouse UniProtKB databases with the programs Proteome Discoverer 2.2. The experimental comparison is between homozygous mutant and wildtype (i.e., MUT versus WT = MUT/WT Fold Change).

Phospho-tyrosine proteomic scan
Three wildtype and three homozygous mutant hemibrain and cortical tissue samples from P14 and P40 mice was delivered to the Proteomics Services Group, Cell Signaling Technology (Danvers, MA). These samples were homogenized and subject to protease digestion yielding peptides. These extracted peptides were loaded directly onto a 50 cm x 100 μm PicoFrit capillary column packed with C18 reversed-phase resin. The column was developed with a 90-minute linear gradient of acetonitrile in 0.125% formic acid delivered at 280 nL/min. After

Bioinformatic analysis
In order to assess the biology impacted by the differential expression or differential phosphorylation observed in our various -omic datasets, we employed Ingenuity Pathway Analysis (IPA, www.qiagen.com/ingenuity) to perform core analysis, which provides enrichment scores as p-values for their various outputs (e.g. canonical pathways) determined by one-tailed Fisher's exact test using Ingenuity Knowledge Base as a reference (restricted to only experimentally observed ndings on the mus musculus background). Reported p-values were adjusted for multiple-testing using Benjamini-Hochberg (BH) procedure. We also used STRING software (https://stringdb.org/) to obtain gene ontology, Reactome (https://reactome.org), and KEGG results. The STRING analysis served to both con rm the observation in IPA, which are based on a proprietary database, and to obtain the raw data to pass along to network analysis.

Network analysis
We performed network analysis using Cytoscape 3.7.0 (https://cytoscape.org/) on the interaction data generated by STRING software (https://string-db.org/) on all of our -omic datasets. The STRING analysis generates an association network from an input gene list based on their database of biological relationships, which included several types of evidence: co-expression, text-mining, biochemical/genetic data, and previously curated pathway and protein-complex knowledge. Cytoscape 3.7.0 was utilized to generate the network gures and statistics, such as degree connectivity and betweenness centrality.

Results
Differential expression and phosphorylation in Pten m3m4 brain In order to assess how the m3m4 mutation affects the landscape of gene and protein expression and protein phosphorylation, we performed parallel transcriptomic [25], proteomic, and phosphoproteomic experiments on Pten m3m4/m3m4 hemibrains and cortices at two-weeks-(P14) and six-weeks-of-age (P40), respectively (Fig. 1a). At P14, a total of 3,207 unique proteins, with an average of 2,345 unique proteins per sample, were identi ed (Additional le 1: Table S1), whereas at P40, a total of 6,635 unique proteins, with an average of 5,381 unique proteins per sample, were identi ed (Additional le 1:  Fig. S1A, B). At P14, a total of 4,080 unique peptides were identi ed, 97% of which contained phosphorylated residues in the phosphoserine/threonine scan, an average of 2,239 unique phosphopeptides per sample (Additional le 1: Table S3). At P40, a total of 8,468 unique peptides were identi ed, 90% of which contained phosphorylated residues, with an average of 5,783 unique phosphopeptides per sample (Additional le 1: Table S4). PCA, based on the LFQ abundance values for a given phosphopeptide, at both time points revealed non-overlapping clusters of homozygous mutant and wildtype samples (Additional le 2: Fig. S1C, D). These ndings illustrate robust sampling of the proteome and phosphoproteome, and the captured variability differentiated homozygous mutant and wildtype brain samples well.
In assessing differentially expressed proteins, we found 24 over-expressed and 27 under-expressed proteins in the P14 mutants compared to wildtype controls (Table 1; Fig. 1b). At P40, we identi ed 150 over-expressed and 102 under-expressed proteins in the mutant brains compared to the wildtype brains (Table 1; Fig. 1b). When assessing differential phosphorylation of phospho-serine/threonine phosphopeptides, we found 1 increased and 99 decreased in the P14 mutant hemibrain (Table 1; Fig. 1c). Moreover, we found 113 increased and 185 decreased at P40 in the mutant cortex (Table 1; Fig. 1c). In the special phospho-tyrosine-speci c phospho-scan, we identi ed 39 and 29 relatively increased phosphopeptides at P14 and P40, respectively (Table 1). In the same phospho-tyrosine scan, we found 29 and 26 decreased phosphopeptides at P14 and P40, respectively (Table 1).
Overall, the over-expression/under-expression of proteins or enrichment/depletion of phosphopeptides were distributed roughly equally except in the P14 phospho-serine/threonine scan, where phosphopeptide depletion was heavily favored in the mutant by 99-fold (Table 1). Moreover, there is a general increase in differential expression and phosphorylation as the Pten m3m4/m3m4 mice age, a trend not observed in the phospho-tyrosine scan (Table 1).
Dissimilarity across -omic datasets describing the Pten m3m4 brain Given the eight different -omic datasets spanning two time points, we sought to understand what molecules, showing differential expression/phosphorylation in the homozygous mutant brain, were shared among these datasets. Thus, we performed pairwise comparisons of the molecule lists of each separate -omics approach that compared Pten m3m4/m3m4 to Pten +/+ mice for both P14 and P40 time points. Surprisingly, we found no intersection between the P14 and P40 proteome (Fig. 2a). For the phospho-serine/threonine scan, we found 21 molecules or roughly 7% of all input molecules shared between time points (Fig. 2b). For the phospho-tyrosine scan, we found 25 molecules or roughly 32% of all input molecules shared between time points (Fig. 2c).
Furthermore, 49 molecules were shared between the P14 and P40 transcriptome datasets or roughly 4% of all input molecules (Fig. 2d). These analyses indicate little to marginal overlap within the various -omic scans over developmental time.
Additionally, we assessed overlap among the proteomic and phosphoproteomic datasets within their respective time points. Surprisingly, we found almost no intersection across these datasets (Fig. 2e). We proceeded to assess the overlap between the transcriptome and proteome within each time point and found 1 and 30 intersecting molecules at P14 and P40, respectively (Fig. 2f). Although we expected to nd more shared genes between the transcriptome and proteome, these observations suggest that the mechanisms governing changes in protein expression are separate from those governing gene expression in our model. Moreover, the general lack of overlap among the various datasets suggests that in the context of Pten disruption, there may be little redundancy in modes of dysregulation, meaning that molecules are either subject to dysregulation via expression or phosphorylation but not both.
Differentially expressed/phosphorylated molecules associate with major intracellular signaling cascades, neurological processes, and cancer To understand the biological consequences of the observed changes in protein expression and phosphorylation in the Pten m3m4/m3m4 cortex, we performed an IPA analysis on the proteomic and phosphoproteomic (pooled phospho-serine/threonine and phospho-tyrosine datasets) ndings. The comprehensive summaries for all IPA analyses can be found in the supplemental information (Additional le 3). The top ve most enriched canonical pathways at P14 and P40 for the proteome and phosphoproteome are shown in Fig. 3a and b, respectively. In the Pten m3m4/m3m4 proteome, the top implicated canonical pathways broadly involve phosphoinositide metabolism (i.e., 3'-phosphoinositide Biosynthesis) or neurological pathways, such as GABA Receptor Signaling (Fig. 3a). In the Pten m3m4/m3m4 phosphoproteome, the top implicated canonical pathways generally involve neurological and cancer processes or major cellular signaling cascades: Molecular Mechanisms of Cancer, Synaptic Long Term Potentiation, and cAMP-mediated Signaling (Fig. 3b). Moreover, we extracted the top two salient disease and bio function networks (i.e., the networks comprised of the largest number of differentially expressed/phosphorylated molecules) from the proteome and phosphoproteome at P40. Cancer and Development of Head are the two largest disease and bio function networks identi ed by IPA in the Pten m3m4/m3m4 P40 cortical proteome (Fig. 3c), and Synaptic Transmission and Seizure Disorder are two of the largest salient networks from the Pten m3m4/m3m4 P40 cortical phosphoproteome (Fig. 3d). These enrichment analyses highlight the broad yet related biological perturbations in the neural (phospho)proteome of Pten m3m4/m3m4 mice.
In an effort to understand more about how these biological changes are initiated and maintained, we sought to identify important upstream regulators via network analysis of the differentially expressed proteins. Strikingly, the top protein interaction network identi ed by IPA from the Pten m3m4/m3m4 P40 cortical proteome agnostically positions Pten as the pre-eminent regulatory node via hierarchical ordering. The position of Pten at the top of this network's regulatory architecture establishes its importance and in uence as a regulator of the molecules positioned below (Fig. 3e). When IPA's biological predictions are overlaid on the network using the Molecule Activity Predictor (MAP) tool, the observed decrease in Pten expression predicts an increase in the activity of Pi3k, Akt, Erk, p70 S6, Gfap, and C1qa, while predicting a decrease in P38 Mapk and Creb (Fig. 3e). These predictions are largely consistent with previous observations in the Pten m3m4/m3m4 cortex [21,23,24]. The network analysis enables a biological understanding of the changes in protein expression and phosphorylation, which unequivocally point to disrupted Pten function in the brain and highlight its importance in propagating dysfunction to downstream effector molecules.
Pten and Psd-95 are major regulatory nodes in the Pten m3m4/m3m4 cortex In an effort to expand our biological understanding of the results from the proteomic and phosphoproteomic Pten m3m4/m3m4 versus Pten +/+ comparisons, we performed STRING analyses on each to identify the relationships among the differentially expressed/phosphorylated molecules. We extracted the largest STRING networks from each -omic dataset at each time points and analyzed the network statistics using Cytoscape (Additional le 2: Fig. S2). The most striking network ndings were observed in the P40 Pten m3m4/m3m4 cortical proteome and phosphoproteome. In the largest network (83 nodes) constructed from the P40 Pten m3m4/m3m4 cortical proteome ndings, we found that Pten exhibited the greatest degree of connectivity, 12, and betweenness centrality, 0.66, relative to all other nodes (Fig. 4a). These data independently identify Pten as the most connected (i.e., degree connectivity) and most important node for transmitting information across the network (i.e., betweenness centrality), suggesting that Pten is likely to be the dominant regulatory node affecting the network of differentially expressed proteins in the P40 mutant cortex. Additionally, in the largest network (92 nodes) constructed from the P40 Pten m3m4/m3m4 cortical phospho-serine/threonine scan, we found Psd-95 (also known as Dlg4) exhibited the greatest degree connectivity, 19, and betweenness centrality, 0.68, relative to all other nodes (Fig. 4b). These data identify Psd-95 as the most connected (i.e., highest degree connectivity) and most important node for transmitting information across the network (i.e., highest betweenness centrality), suggesting that Psd-95 is likely to be the dominant regulatory node affecting the network of differentially phosphorylated proteins in the P40 mutant cortex. The Psd-95 nding also implicates Pten given the known and well-described protein-protein relationship between the two [30]. Ultimately, the STRING network analysis implicates Pten and Psd-95 as the likely perpetrators of the proteomic and phosphoproteomic dysregulation observed in the cortex of the Pten m3m4/m3m4 model and possibly responsible for some of the pathological cellular, physiological, and behavioral phenotypes.
Meaningful overlap between differentially expressed or phosphorylated molecules and known ASD risk genes We sought to assess whether the differentially expressed/phosphorylated molecules identi ed by our -omic surveys demonstrate signi cant overlap with known autism risk genes as curated by the Simons Foundation Autism Research Initiative (SFARI). Accordingly, we compared the SFARI catalogue of ASD risk genes with the gene lists of the signi cant results of -omic surveys (time points and phospho-serine/threonine/tyrosine pooled). We found that 41 molecules intersected between the phosphoproteome and SFARI genes, and 12 molecules intersected between the proteome and SFARI (Fig. 5a). This was modest overlap given the number of genes curated by SFARI, but again, STRING network analysis implicated Pten and Psd-95 as central nodes in association networks derived from the intersecting molecules (Fig. 5b, c). The re-emergence of Pten and Psd-95 in a separate network analysis predicated on known ASD risk genes strengthens the evidence that implicates them as potential drivers of the phenotypes observed in Pten m3m4/m3m4 mouse model. The overlap that does exist with the known ASD risk genes underscores the importance of Pten biology to ASD pathophysiology overall.

Discussion
This study represents a proteomic and phosphoproteomic exploration of the brain of a Pten model with strong neurological phenotypes. These experiments demonstrate a distinct developmental difference in protein expression and phosphorylation with differences of greater number, magnitude, and signi cance at P40 versus P14 (Fig. 1). We found little to marginal overlap among the various -omic datasets; interestingly, however, the affected biological pathways across the different datasets were similar, if not overlapping (Fig. 2, 3), suggesting potential convergence of non-overlapping, non-redundant regulatory mechanisms. Furthermore, network analysis highlighted Pten as a critical regulator of the differential expression observed in the neural proteome (Fig. 3e,  4a), and Psd-95 as a critical regulator of differential phosphorylation observed in the neural phosphoproteome ( Fig. 4b). Ultimately, we observed overlap between our agnostic survey's ndings and known ASD risk genes, which when that overlap was subject to network analysis, we found Pten and Psd-95 to again be prominent nodes (Fig. 5).
The lack of overlap among the -omic datasets, especially the Pten m3m4/m3m4 neural proteome and transcriptome, is likely our most surprising nding. Importantly, despite this, we found that they affected nal common pathways. As this is the rst exploration of a transcriptome and proteome simultaneously in a Pten model, it is di cult to comment whether this nding would be typical. The "knee-jerk" conclusion may be that multiple molecules can feed the same pathway. However, it is edifying to note that in mice, the average correlation between the transcriptome and proteome is modest, 27%, which has been shown to be partially explained by alternative splicing [31]. However, our ndings still show less overlap than would be expected based on the aforementioned average. Subsequently, these ndings appear to indicate that gene versus protein expression, at least in our model, are regulated by different mechanisms, and perhaps re ecting no need for redundancy. It is possible that dysregulated transcription and splicing factor expression may be responsible in part for changes in gene expression [25,26], whereas translational mechanisms subsequent to changes in the phosphoproteome, including signaling changes downstream of Pten (e.g. dysregulation of translational control from changes in mTOR activity) may be responsible for the changes in protein expression. The possibility of signi cant changes in translational control in the Pten m3m4/m3m4 brain are real and tantalizing; disruption of proteins involved in translational control, including many in the PI3K/AKT/mTOR cascade, have been recently shown to mediate ASD phenotypes [32]. It is also possible that changes in alternative splicing may affect the differential protein expression that has been observed in the Pten m3m4/m3m4 brain [26]. Furthermore, the lack of overlap between the proteome and phosphoproteome suggests differential modes of regulation at these levels with minimal redundancy. Thus, mechanisms that regulate protein expression appear to be distinct from the mechanisms that regulate phosphorylation the Pten m3m4 brain, and a phenomenon previously observed and described in other models [33]. This differential regulation seems like a reasonable possibility in our model given that it is probable that a portion of the observed changes in the Pten m3m4/m3m4 neural phosphoproteome are more proximal to the dysfunction introduced by the m3m4 mutation than the differential gene/protein expression. This is likely due to the primary functional role of Pten as a dual-speci city phosphatase with possibly a large array of diverse protein substrates [34][35][36][37][38][39][40][41][42]. Additional biochemical experiments and expanded -omic surveys on this model and other various neural Pten models will be required to disentangle the complexity at the various biological levels.
Beyond the lack of overlap between transcriptome and proteomes, we found the continued emergence of Pten and Psd-95 in the proteome and phosphoproteome pathway and network analysis to be a crucial indicator of their importance to the underlying pathology of the Pten m3m4/m3m4 brain and potentially to ASD in general. This postulate appears to be supported by one of the only two proteomic analyses of post-mortem ASD brains that identi ed Psd-95 as a major hub protein in the Brodmann area 19 (BA19) [17]. Although their nding was in the proteome, whereas ours was in the phosphoproteome, the shared nding is striking, and it is also possible that the effect of the perturbations to Psd-95 have similar biological effects in both contexts. There were other hits that our study shared with that of Abraham and colleagues (e.g. MAPT), suggesting proteomic approaches to ASD may uncover shared pathological hallmarks.
The picture of the broader changes in the (phospho)proteome of the Pten m3m4/m3m4 brain was largely aligned with our hypotheses, and in many ways, consistent with the overall pathophysiological changes we observed previously in the Pten m3m4/m3m4 neural transcriptome [25]. Although many of the speci c molecules and pathways found to be dysregulated or enriched, respectively, are different across the various datasets, there is a convergence on biological pathways downstream of Pten functions and disease processes that manifest in individuals with germline PTEN mutations, such as neurodevelopmental abnormalities and cancer (Additional le 3). Interestingly, the co-occurrence of neurodevelopmental and cancer pathways arising from study of Pten m3m4/m3m4 brain underscores how closely the biology of these two seemingly disparate disease processes are and that an important nexus between them is PTEN. Our detailed description of the affected pathways and processes at the (phospho)proteomic levels appear to suggest that there may be convergent mechanisms at play in autism and that the apparent complexity might be reduced to a few central molecules and pathways despite a complex phenotype.

Limitations
The Pten m3m4 model is excellent for the purposes of this study, but it is quite unique, and our study is limited to this model. Thus, our observations cannot necessarily be extended to all mutations or disruptions of Pten despite some of the evidence here and in the literature that support generalizability. In fact, it is quite likely that at least part of the expression and phosphorylation changes are speci c to the m3m4 model due to the homozygosity of the mutation and cytoplasmic predominance of Pten expression -both features that are unique to our model and rarely, if ever, modeled in other studies. Furthermore, our study would have ostensibly been more sensitive to changes in the proteome, phosphoproteome, and transcriptome with a larger sample size.
Although the PCA ndings cluster Pten m3m4/m3m4 and Pten +/+ separately, there is still enough variation within each genotype to indicate that our study would likely nd more signi cant hits, if sample size expanded. Subsequently, we have likely only uncovered the most differentially expressed/phosphorylated molecules, but there may remain important subtle changes in major regulators that were missed.

Conclusions
In sum, our study represents an important entry into understanding the proteome and phosphoproteome in a Pten model with well described neurological pathologies and behavioral de cits. Given the current lack of knowledge of what the proteomic and phosphoproteomic landscapes should look like in the context PTEN-ASD or even idiopathic ASD, this study is a step toward lling this critical knowledge gap. The observed changes in protein expression and phosphorylation appear to affect pathways and processes important in the nervous system (e.g. neurodevelopment and synaptic function) and neuroin ammation. Despite the shared biological signature across the various -omic datasets, the implicated molecules appear to be distinct, which suggests differential modes of regulation across the different biological tiers and/or the fact that multiple seemingly disparate molecules converge on nal common pathways. As such, the molecules that do emerge from our study as major regulatory players, Pten and Psd-95, are strongly associated with autism biology. Our work suggests that a renewed focus on these molecules, speci cally developing an understanding that would allow for rescue of their important functions in the nervous system, is critical to tackle the clinical challenges presented by ASD.