Shared genetic architecture and causality between autism spectrum disorder and irritable bowel syndrome, pain, and fatigue

doi:10.21203/rs.3.rs-3223927/v1

Download PDF

Article

Shared genetic architecture and causality between autism spectrum disorder and irritable bowel syndrome, pain, and fatigue

https://doi.org/10.21203/rs.3.rs-3223927/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Autism spectrum disorder (ASD) often co-occurs with functional somatic syndromes (FSS), such as irritable bowel syndrome (IBS), pain, and fatigue. However, the underlying genetic mechanisms and causality have not been well studied. Using large-scale genome-wide association study (GWAS) data, we investigated the shared genetic architecture and causality between ASD and FSS. Specifically, we first estimated genetic correlations and then conducted a multi-trait analysis of GWAS (MTAG) to detect potential novel genetic variants for single traits. Afterwards, polygenic risk scores (PRS) of ASD were derived from GWAS and MTAG to examine the associations with phenotypes in the large Dutch Lifelines cohort. Finally, we performed Mendelian randomization (MR) to evaluate the causality. We observed positive genetic correlations between ASD and FSS (IBS: r_g = 0.27, adjusted p = 2.04×10^− 7; pain: r_g = 0.13, adjusted p = 1.10×10^− 3; fatigue: r_g = 0.33, adjusted p = 5.21×10^− 9). Leveraging these genetic correlations, we identified 4 novel genome-wide significant independent loci for ASD by conducting MTAG, including NEDD4L, MFHAS1, RP11-10A14.4, and C8orf74. PRS of ASD derived from both GWAS and MTAG were associated with ASD and FSS symptoms in Lifelines, and MTAG-derived PRS showed a bigger effect size, larger explained variance, and smaller p-values. We did not observe significant causality using MR. Our study provided new evidence of shared genetic architecture between ASD and FSS, specifically with IBS, pain, and fatigue. The findings confirm the genetic associations between ASD and FSS, and advance our understanding of the mechanisms underlying co-occurrence.

Health sciences/Diseases/Psychiatric disorders/Autism spectrum disorders

Biological sciences/Neuroscience/Molecular neuroscience

Biological sciences/Genetics/Genomics

Autism spectrum disorder (ASD) is a common neurodevelopmental disorder affecting 1-1.5% of individuals ^{1, 2}. It is characterized by social-communication difficulties and restricted repetitive patterns of interests or behaviors. The symptoms of ASD usually appear in childhood and persist throughout life ³. Genetic factors may explain most of the risk for ASD with a heritability of higher than 80% for ASD estimated by twin and family studies ⁴. The most recent genome-wide association study (GWAS) of ASD (N = 46,350) has identified 3 susceptibility loci, explaining 2.45% of the variance in ASD ².

ASD is often comorbid with functional somatic syndromes (FSS), which exacerbates disparities in quality of life ⁵. FSS are characterized by the presence of one or multiple chronic symptoms that cannot be fully attributed to a known somatic disease, such as irritable bowel syndrome (IBS), pain, and fatigue ⁶. Similar to ASD, IBS, pain, and fatigue are also complex traits and genetic factors make substantial contributions to the etiology ^7–11 with a heritability of up to 57% ^12–14. Recent GWASs have identified 89, 1748, and 1 single-nucleotide polymorphisms (SNPs) for IBS, pain and fatigue, respectively ^15–17.

Epidemiological studies demonstrate associations between ASD and FSS ¹⁸. Previously, we investigated comorbid somatic problems of ASD among 35,048 adults from the Netherlands. The results showed that adults with more ASD symptoms had more FSS including IBS, pain, and fatigue. Besides, a meta-analysis reported that the odds ratio (OR) for ASD individuals having abdominal pain was 2.45 compared with the non-ASD group ¹⁹. As ASD and FSS are moderately to highly heritable, their comorbidity observed from epidemiological studies may suggest genetic overlap between ASD and FSS. Indeed, previous studies showed significant genetic correlations between ASD and some FSS. For instance, a genetic correlation (r_g) of 0.33 between ASD and tiredness has been reported ². As for the genetic correlation between ASD and pain, mixed results exist probably due to different measurements of pain. A positive genetic correlation was found between abdominal pain and ASD ²⁰, while a negative genetic correlation of − 0.1 was reported between ASD and multisite chronic pain ¹⁶.

Two mechanisms may explain genetic correlations between ASD and FSS. One mechanism is horizontal pleiotropy, i.e. the genetic variants contribute directly to risk of both ASD and FSS or indirectly through an intermediate phenotype, indicating shared biological processes between ASD and FSS. The other mechanism is vertical pleiotropy, i.e. there is a causal relationship between ASD and FSS where ASD itself leads to an increased risk of FSS or vice versa ²¹.

Leveraging genetic correlations between ASD and FSS, analyses can be conducted that could improve power to potentially detect new genetic variants for ASD and FSS, improve genetic risk prediction, and make inferences on causality with the advances in genetic methods using GWAS summary statistics. For example, multi-trait analysis of GWAS (MTAG) enables joint analysis of multiple related traits, therefore boosting statistical power to detect novel genetic loci for each trait and increase risk prediction ²². Also, a polygenic risk score (PRS) of a trait could be calculated in independent genotyped samples to explore its associations with genetically correlated phenotypes ²³. Besides, Mendelian randomization (MR) analyses are considered a promising way to explore potential causality of one phenotype on the other (and vice versa) using genetic variants as instrument variables (IV) ²⁴.

Some of these methods have been applied previously to investigate the genetic overlap between ASD and pain. For example, one study investigated causality between pain and neuropsychiatric disorders including ASD using MR, but did not find a causal relationship between pain and ASD ²⁰. Another study found that the PRS of ASD was associated with low pain tolerance, suggesting that a higher genetic risk of ASD might be associated with more severe pain symptoms ²⁵. However, the underlying shared genetic architecture between ASD and FSS has yet to be well elucidated and little is known about causality between ASD, IBS, and fatigue.

To address these gaps, we aimed to probe into the shared genetic etiology between ASD and FSS, specifically with respect to IBS, pain, and fatigue. Specifically, we first estimated the genetic correlations of ASD with IBS, pain, and fatigue, and then conducted a MTAG leveraging the correlations to detect potentially novel genetic variants for each single trait. Afterwards, PRS of ASD derived from GWAS and MTAG were calculated to examine the associations with FSS phenotypes in the large Dutch Lifelines cohort. Finally, we explored causal relationships between ASD and FSS using MR. The results may help to unravel the shared genetic architecture and better understand shared biological pathways and/or causality underlying comorbid ASD and FSS conditions.

1.1 Study design and data summary

The overall study design is shown in Fig. 1. We used summary statistics retrieved from publicly available GWASs, ASD from the Psychiatric Genomics Consortium (N_case/N_control = 18,381/27,969) ², IBS from the UK BioBank (N_case/N_control = 53,400/433,201) ¹⁵, multisite chronic pain from the UK BioBank (N = 387,649) ¹⁶, and fatigue from the UK BioBank (N = 108,976) ¹⁷. Polygenic risk score analyses were conducted in the large Dutch Lifelines cohort.

1.2 Estimation of genetic correlations

We performed genome-wide genetic correlation analyses for ASD and FSS including IBS, pain, and fatigue, respectively. First, global genetic correlations (r_g) were estimated from GWAS summary statistics by linkage disequilibrium score regression (LDSC) ²⁶. Global genetic correlation refers to the correlation of genetic effects between two related traits throughout the genome, highlighting the degree of pleiotropy and genetic overlap ^{26, 27}. We filtered included genetic variants using the following criteria: imputation score (INFO) > 0.9 and minor allele frequency (MAF) > 0.01. Indels, strand-ambiguous SNPs, and SNPs with duplicated rs numbers were removed. We adjusted p-values for multiple testing by the false discovery rate (FDR) using the Benjamini-Hochberg procedure and a p < 0.05 was considered statistically significant.

The global genetic correlation describes the average effect of pleiotropy across all causal loci, but the underlying architecture of correlations at individual loci can vary ²¹. To gain better insight into genetic correlation at the locus level (within genome-wide partitioned LD blocks), local genetic correlations between ASD and FSS were calculated by the Local Analysis of coVariant Association (LAVA) tool. Unlike the global correlation analysis, the local approach enables us to identify and estimate genetic correlation in specific genomic regions, thereby providing insights into their local effects and shared genetic basis. The method has been detailed elsewhere ²⁸. Briefly, 2495 semi-independent LD blocks were defined using European-ancestry 1000 Genome data ²⁸. Then we conducted univariate association analyses on the local genetic signal of each trait. Regions that showed significant univariate associations for more than one trait were selected to test for local genetic correlation. For this purpose, we used a relatively lenient p-value threshold of 0.05. Finally, pairwise bivariate local genetic correlation analyses across the selected regions were estimated between ASD, IBS, pain, and fatigue. For the correlation analyses, we adjusted p-values for multiple testing by FDR using the Benjamini-Hochberg procedure and a p < 0.05 was considered statistically significant.

1.3 Multi-trait analysis of GWAS

Because of the putative genetic correlations between ASD and FSS, the multi-trait analysis of GWAS (MTAG) was conducted to perform the meta-analysis of the four traits ²². MTAG leverages the genetic correlations between multiple traits to generate trait-specific effect estimates for each SNP using GWAS summary statistics. As the authors describe, MTAG is robust to potential sample overlap between different GWASs. It improves the effect estimates of each trait and boosts statistical power to detect genetic associations Therefore, it can also improve the power of PRS ²². We filtered included genetic variants using default MTAG parameters ²⁹. Briefly, variants were restricted to those common to all four traits of the GWAS, with a MAF > 0.01. For each trait, we calculated the 90th percentile of the SNP sample-size distribution. All SNPs with a sample size below 75% of this calculated value were removed.

To identify genomic risk loci from the MTAG results, functional Mapping and Annotation (FUMA) analyses were conducted ³⁰. SNPs with p-value < 5 × 10^− 8 were genome-wide significant. Lead SNPs were LD-pruned based on a 250 kilobase (kb) window and r² < 0.1 using the 1000 Genome European population as a LD reference panel. Lead SNPs which were closer than 250 kb were merged into one genomic risk locus. ANNOVAR employed in FUMA was used to map SNPs to genes ³¹. A locus that was not found to be significant in previous GWASs and was not in LD with a previous loci in GWASs (r² < 0.1 within 1 megabase (Mb) window) for the phenotype was labeled as ‘novel.’

1.4 PRS analyses

To validate the shared architecture between ASD and FSS, PRS analyses were conducted in the Lifelines Cohort, a large and multi-generational population-based cohort study starting in 2006 ^{32, 33}. Lifelines assesses biomedical, socio-demographic, behavioural, physical, and psychological factors related to the health and disease of over 167,000 residents living in the three northern provinces of the Netherlands. This cohort is broadly representative of socioeconomic characteristics, diseases, and general health of the population in the north of the Netherlands ³⁴. The participants were recruited between 2006 and 2013 at baseline and followed up from 2014 onwards. The Lifelines Cohort Study was conducted according to the principles of the Declaration of Helsinki and approved by the ethics committee of the University Medical Centre Groningen. All participants signed an informed consent form ³⁵.

For the current study, we included 14,134 participants with European ancestry from the Lifelines cohort with genetic data and our phenotypic data of interest. DNA samples were genotyped using the Illumina Global Screening Array (N = 10,758) and Illumina CytoSNP12v2 array (N = 3,376). After quality control, both genotyping datasets were then imputed at the Sanger imputation server using the Haplotype Reference Consortium panel r1.1 ³⁶. Details of genotyping, quality control, and imputation in Lifelines have been published elsewhere ³⁷. Two PRSs of ASD were calculated in the Lifelines samples based on the summary statistics of ASD GWAS and ASD MTAG results, respectively. SNPs of low quality were removed from summary statistics (INFO ≤ 0.8, MAF ≤ 0.01, strand-ambiguous or duplicated SNPs). To obtain an independent set of SNPs, the LD-driven clumping procedure was performed in PLINK version 1.90 (r² < 0.1, kb = 250) using the 503 European samples from 1000 Genome projects as reference. Then weighted PRSs were calculated as the sum of risk allele dosages weighted by the corresponding effect size estimated in the GWAS results. PRSs were constructed across a range of p-value thresholds (5 × 10^− 8, 5 × 10^− 7, 5 × 10^− 6, 5 × 10^− 5, 5 × 10^− 4, 5 × 10^− 3, 0.05, 0.5, 1) and were then standardized using z-score transformations. Thereafter, we conducted a principal component analysis (PCA) on the resulting PRSs and used the first principal component as the final ASD PRS in subsequent association analyses ³⁸. This approach uses PCA to reweight the SNPs included in the PRS to achieve maximum variation across all the p-value thresholds. It avoids choosing one optimal p-value threshold, therefore controlling type-1 error and preventing overfitting.

Linear regression models were applied to examine the associations between ASD PRSs and phenotypes of ASD, IBS, pain, and fatigue, adjusted for age, sex, arrays, and 10 genetic principal components that correct population structure. Measurements of the phenotypes were described in the supplementary material. PRS and phenotype measures were standardized to Z-scores (mean = 0 and standard deviation = 1) to facilitate interpretation. We adjusted p-values for multiple testing by FDR using the Benjamini-Hochberg procedure and a p < 0.05 was considered statistically significant.

1.5 Bidirectional Mendelian randomization analyses

Bidirectional two-sample MR was conducted to infer putative causal relationships between ASD and FSS from GWAS summary statistics. MR estimates the causal effects of the exposure on the outcome using genetic variants as IVs ²⁴. We performed the method of random-effect inverse variance weighted (IVW) as the main analysis, and four complementary methods as sensitivity analyses, including MR Egger, weighted median, and median-based methods (simple and weighted) ³⁹. IVs were selected based on association statistics of the exposure GWAS summary statistics (Stable 1). First, genome-wide significant SNPs (p < 5×10^− 8) were selected. For SNPs that were not available in outcome GWASs, a proxy SNP identified in high-LD (r² > 0.7) was used. If the proxies were not available (e.g., the only one significant SNP in the GWAS of fatigue is 1:64178756_C_T and it does not have an rs id or proxy), we chose SNPs that have a p-value less than 5×10^− 7. Second, independent genetic variants were selected by LD clumping procedure at r² < 0.001 within 10 Mb. SNP alleles were harmonized between exposure and outcome summary statistics and palindromic SNPs with intermediate allele frequencies (0.42–0.58) were discarded. The Steiger filtering method was applied to remove potential invalid IVs that explain more variance in the outcome than the exposure ⁴⁰. To evaluate horizontal pleiotropy, heterogeneity tests by MR-Egger and IVW methods and Egger-intercept tests were applied ⁴¹. MR analyses were performed using the TwoSampleMR (version 0.5.6) package for R (version 4.0.5) ^{40, 42}. We adjusted p-values for multiple testing by FDR using the Benjamini-Hochberg procedure and a p < 0.05 was considered statistically significant.

1.1 Genetic correlation

We found significant genetic correlations between all the phenotype pairs. Regarding ASD, there was a positive genetic correlation with IBS (r_g = 0.27, adjusted p = 2.04 × 10^− 7), pain (r_g = 0.13, adjusted p = 1.10 × 10^− 3), and fatigue (r_g = 0.33, adjusted p = 5.21 × 10^− 9) (Table 1). Additionally, we detected 21 positive local genetic correlations among the four phenotypes (Stable 2). Regarding ASD, one was with pain: chromosome 11: 75.45–76.52 Mb (r_{g, local} = 0.61, adjusted p = 0.044), and two were with IBS: (1) chromosome 13: 53.34–54.68 Mb (r_{g, local} = 0.78, adjusted p = 0.019) and (3) chromosome 16: 53.39–54.87 Mb (r_{g, local} = 0.56, adjusted p = 0.036).

Table 1

Global genetic correlations of ASD and FSS
Phenotype 1	Phenotype 2	r_g	SE	p	Adjusted p^a
ASD	IBS	0.27	0.05	1.70×10^− 7	2.04×10^− 7
	pain	0.13	0.04	1.10×10^− 3	1.10×10^− 3
	fatigue	0.33	0.06	3.47×10^− 9	5.21×10^− 9
IBS	pain	0.59	0.03	4.79×10^− 100	2.87×10^− 99
	fatigue	0.52	0.04	6.53×10^− 31	1.31×10^− 30
pain	fatigue	0.68	0.03	2.33×10^− 86	3.99×10^− 86
Note: ASD: autism spectrum disorder; FSS: functional somatic syndrome; r_g: genetic correlation; SE: standard error; IBS: irritable bowel syndrome.
^aadjusted p: adjust p-values were calculated by false discovery rate using the Benjamini-Hochberg procedure

1.2 Multi-trait analysis of GWAS

Given the significant genetic correlations between ASD and FSS, we improved the overall statistical power to detect novel loci by conducting MTAG analysis. A total of 6,506,975 overlapping SNPs were used in MTAG. After MTAG analysis, the number of lead SNPs in the GWAS result of ASD increased from 3 to 5, and the distribution of locations was more extensive (from chromosomes 8, 20 to chromosomes 5, 8, 18, 20) (Table 2, 3, Fig. 2). Similarly, the number of lead SNPs for FSS phenotypes also increased and the distributions of locations were more extensive (Table 2, Fig. 2).

We identified 3 novel genome-wide significant independent loci for ASD by MTAG (Table 3). Among the 3 novel loci, 2 of them were mapped to protein-coding genes. In these 2 loci, the stronger signal was observed on chromosome 18 at the NEDD4L region (lead SNP rs63615960, p = 1.81 × 10^− 8). The other signal was observed on chromosome 8 mapped to the MFHAS1 and RP11-10A14.4 region (lead SNP rs12547493, p = 3.32 × 10^− 8).

Table 2

Statistical summary of GWAS and MTAG results
	ASD		IBS		pain		fatigue
	GWAS	MTAG	GWAS	MTAG	GWAS	MTAG	GWAS	MTAG
Significant SNPs	93	127	89	793	1748	1908	1	1084
Lead SNPs	3	5	6	19	50	52	0	21
Genomic locus	3	5	6	19	46	48	0	21
Note: GWAS: Genome-wide association studies; MTAG: Multi-Trait Analysis of GWAS; ASD: autism spectrum disorder; IBS: irritable bowel syndrome; SNP: single nucleotide polymorphism

Table 3

Genome-wide significant loci associated with ASD
Genomic locus	rsID	Chromosome	Position	P value in GWAS	P value in MTAG	Start of position	End of position	Mapped genes
GWAS
1	rs910805	20	21248116	2.04×10^− 9	2.21×10^− 9	20118473	22105020	XRN2, NKX2-4
2	rs10099100^a	8	10576775	1.07×10^− 8	NA^a	10571591	10583506	RP1L1, SOX7
3	rs71190156^b	20	14836243	2.75×10^− 8	NA^b	14697386	14836243	MACROD2,
MTAG
1	rs910805^c	20	21248116	2.04×10^− 9	2.21×10^− 9	21117840	21474473	XRN2, NKX2-4
2	rs60410697^d	8	10572617	6.91×10^− 8	2.73×10^− 8	10566591	10802146	RP1L1, C8orf74, SOX7
3	rs416223	5	103991476	3.84×10^− 7	2.44×10^− 8	103791044	104082179	-
4	rs63615960	18	55881401	1.60×10^− 6	1.81×10^− 8	55855025	55898199	NEDD4L
5	rs12547493	8	8661534	4.68×10^− 6	3.32×10^− 8	8310508	9033744	MFHAS1, RP11-10A14.4
Note: ASD: autism spectrum disorder; GWAS: Genome-wide association studies; MTAG: Multi-Trait Analysis of GWAS

^aNot available. The lead SNP in GWAS was removed from the MTAG analysis due to strand ambiguity.

^bNot available. The lead SNP in GWAS was removed from the MTAG analysis because it did not overlap among the four phenotypes.

^cThe same lead SNP as found in GWAS

^dIn linkage disequilibrium with SNP rs10099100 found in GWAS (r² = 0.83)

1.3 Associations between PRSs of ASD and phenotypes

Sample characteristics of the total sample are shown in Stable 3. Table 4 shows the results of the regression models examining the associations between GWAS and MTAG PRSs of ASD and phenotypes of ASD and FSS in Lifelines. We found that the effects of both ASD PRSs on phenotypes of ASD and FSS were positive, indicating that higher PRS of ASD was associated with more severe ASD and FSS. For PRS derived from ASD GWAS, each unit (equal to 1 SD) increase in PRS of ASD was associated with 1.12 times higher odds of having IBS (for IBS: OR = 1.12, 95% CI: (1.03, 1.21). Similarly, each unit (equal to 1 SD) increase in PRS of ASD was associated with 0.02 units increase in ASD, pain, and fatigue (for ASD, b = 0.023, 95% CI: (0.006, 0.039); for pain, b = 0.020, 95% CI: (0.002, 0.038); for fatigue, b = 0.024, 95% CI: (0.006, 0.042)). For PRS derived from MTAG result of ASD, the associations were more pronounced with larger b and explained variance, and smaller p-values, compared with PRS derived from ASD GWAS (Table 4).

Table 4

PRS effect on phenotypes
Outcomes	ASD PRS calculated from GWAS results				ASD PRS calculated from MTAG results
	Effect size [95% CI]	p	Adjusted p^a	Adjusted R² (%)	Effect size [95% CI]	p	Adjusted p^a	Adjusted R² (%)
ASD	0.023	0.007^**	0.009^**	1.9	0.041	< .001^***	< .001^***	2.0
(b [95% CI])	[0.006, 0.039]				[0.025, 0.058]
IBS	1.12	0.006^**	0.009^**	3.0^b	1.2	< .001^***	< .001^***	3.2^b
(OR [95% CI])	[1.03, 1.21]				[1.11, 1.31]
pain	0.020	0.032^*	0.032^*	2.6	0.053	< .001^***	< .001^***	2.8
(b [95% CI])	[0.002, 0.038]				[0.035, 0.071]
fatigue	0.024	0.008^**	0.009^**	2.5	0.059	< .001^***	< .001^***	2.8
(b [95% CI])	[0.006, 0.042]				[0.041, 0.077]
Note: PRS: polygenic risk score; ASD: autism spectrum disorder; GWAS: Genome-wide association studies; MTAG: Multi-Trait Analysis of GWAS; IBS: irritable bowel syndrome; CI: confidence interval; OR: odds ratio
^aadjusted p: adjust p-values were calculated by false discovery rate using the Benjamini-Hochberg procedure
^bMcFadden’s R squared
^p < 0.05, ^p < 0.01, ^**p < 0.001

1.4 Bidirectional MR

We did not observe significant causal relationships between ASD and FSS traits (Table 5). Sensitivity analyses using different MR methods yielded similar non-significant results. In forward MR, we extracted 3 IVs to examine the effect of ASD on FSS; in reverse MR, we extracted 4 IVs for IBS, 33 IVs for pain, and 3 IVs for fatigue to examine the effects of FSS on ASD. The included IVs are presented in the supplementary materials (Stable 3). The results of Egger-intercept tests and almost all heterogeneity analyses were not significant, suggesting a low risk of bias caused by horizontal pleiotropy (Stable 4, 5).

Table 5

The bidirectional MR analysis of ASD and FSS phenotypes
Exposure	Outcome	MR method	Forward					Reverse
Exposure			Number of IVs	beta	SE	p	Adjusted p^a	Number of IVs	beta	SE	p	Adjusted p^a
ASD	IBS	IVW	3	0.066	0.05	0.194	0.470	4	0.279	0.17	0.110	0.470
		MR Egger	3	-0.245	0.45	0.681	0.817	4	-0.006	2.97	0.999	0.999
		Weighted median	3	0.078	0.06	0.205	0.470	4	0.376	0.20	0.065	0.470
		Simple mode	3	0.103	0.08	0.336	0.504	4	0.418	0.30	0.260	0.470
		Weighted mode	3	0.104	0.09	0.356	0.509	4	0.421	0.28	0.232	0.470
	pain	IVW	3	0.002	0.03	0.954	0.987	33	0.253	0.19	0.178	0.470
		MR Egger	3	0.372	0.15	0.245	0.470	33	0.156	0.97	0.874	0.936
		Weighted median	3	-0.008	0.03	0.773	0.882	33	0.390	0.24	0.109	0.470
		Simple mode	3	-0.031	0.04	0.479	0.653	33	0.609	0.54	0.268	0.470
		Weighted mode	3	-0.025	0.04	0.560	0.700	33	0.539	0.53	0.321	0.504
	fatigue	IVW	3	0.017	0.07	0.794	0.882	3	0.816	0.42	0.050	0.470
		MR Egger	3	0.746	0.27	0.218	0.470	3	-1.581	1.69	0.520	0.678
		Weighted median	3	0.063	0.04	0.129	0.470	3	1.023	0.51	0.046	0.470
		Simple mode	3	0.074	0.05	0.282	0.470	3	1.146	0.64	0.214	0.470
		Weighted mode	3	0.074	0.05	0.254	0.470	3	1.146	0.63	0.209	0.470
Note: MR: Mendelian randomization; ASD: autism spectrum disorder; FSS: functional somatic syndrome; IV: instrumental variable; SE: standard error; IBS: irritable bowel syndrome; IVW: inverse variance weighted

^aadjusted p: adjust p-values were calculated by false discovery rate using the Benjamini-Hochberg procedure.

This study comprehensively investigated shared genetic architecture and causality between ASD and FSS using different genetic analyses and findings contribute to a better understanding of mechanisms underlying the co-occurrence between ASD and FSS. The results showed that ASD had positive genetic correlations with IBS, pain, and fatigue. Leveraging these genetic correlations, we identified previously unknown risk loci for ASD by conducting MTAG, together with the creation of a more powerful PRS that improved the prediction of phenotypes of ASD and FSS in an independent European sample. We did not find a significant causal relationship between ASD and FSS by MR.

Converging with previous evidence supporting the phenotypic co-occurrence of ASD and FSS ^{18–20, 25}, we found positive genetic correlations between them. Moreover, we detected 3 positive genetic correlations at locus level, which gives a closer insight into underlying shared genetic architecture between ASD and FSS. Based on these genetic correlations, we identified novel genetic variants and mapped genes of ASD using MTAG, pointing to shared genetic mechanisms of co-occurrence. We highlight the potentially interesting functions of MFHAS1 and NEDD4L. MFHAS1 is known to be involved in the regulation of inflammation ^{43, 44}. Neuroinflammation and altered inflammatory responses have been established as key mechanisms in the development and maintenance of ASD ^{45, 46}. Gut inflammation is also considered a key factor of comorbid gastrointestinal disorders in ASD ¹⁸. Additionally, an animal study suggested that MFHAS1 was related to cognitive impairment, which is strongly linked to ASD ^{47, 48}. Another novel gene is NEDD4L, which is known as an epilepsy-associated gene by mediating neuronal circuit activity ⁴⁹. It was also identified from single-cell sequencing of autism cortical tissue, suggesting it may be modulated as a downstream target in ASD ^{50, 51}. Besides, NEDD4L is associated with sensitivity to inflammatory pain by regulating nociceptive sensations ^{52, 53}. Dysfunction of NEDD4L can cause the occurrence of neuropathic pain ⁵⁴. Our findings show that the MTAG analysis improves the power to identify previously unknown risk genes involved in the shared genetic basis between ASD and FSS, and advances our understanding of the co-occurrence mechanisms.

Our results showed that PRSs of ASD were associated with ASD and FSS severity, confirming its predictive ability. One previous study reported the association between PRS of ASD and ASD phenotype at age 6 years ^{7, 8}. The proportion of explained variation was 2.4%, which was comparable with our results (adjusted R² = 1.9%). We added to the literature that PRS of ASD is also useful for evaluating individuals’ risk or severity of FSS conditions. Consistent with our results, Louis et.al. found that PRS of ASD was associated with sensory processing issues ²⁵. Their findings showed that higher polygenic loading of ASD was related to lower pain tolerance, thus possibly leading to more severe pain symptoms ²⁵. These previous studies calculated ASD PRS using GWAS of ASD, but our study showed that the MTAG-derived PRS of ASD had better predictive ability for both ASD and FSS compared with GWAS-derived PRS. Integrating genetic risk variants of both ASD and FSS by MTAG yielded more powerful PRS, thus improving genetic risk prediction and stratification.

Our study did not find any evidence supporting a causal relationship between ASD and FSS. This is in line with an earlier study that did not find a causal relationship between pain and ASD using the MR method and the same GWASs ²⁰. To the best of our knowledge, causality of ASD in relation to IBS and fatigue have not been explored using GWAS data in the literature. A potential reason for the lack of evidence for causality in our study could be the limited power of the GWASs of particularly ASD and fatigue, given insufficient independent significant SNPs in the GWAS (7 for ASD and 0 for fatigue). To solve the problem, we used a relatively lenient p-value threshold of 5 × 10^− 7 for the IV of ASD and fatigue. However, it still resulted in limited IVs (3 for ASD and 3 for fatigue). Clearly, additional, more strongly powered GWASs of ASD and FSS with more IVs are warranted for future studies to examine the causality using GWAS.

Our results should be interpreted in light of several limitations. First, although the novel genetic variants identified by MTAG revealed some potential mechanisms of co-occurrence of ASD and FSS, they still need to be replicated in future studies to avoid false positives. Second, although we used GWAS summary statistics with the largest sample sizes up to date, the sample sizes were still relatively small, which limits the power of PRS and MR. Third, our study focused on individuals of European ancestry. The genetic relationship between ASD and FSS in other populations remain to be studied.

Our study provided new evidence for shared genetic etiology between ASD and FSS, specifically with respect to IBS, pain, and fatigue. The findings confirm the genetic associations between ASD and FSS, and advance our understanding of the co-occurrence mechanisms.

Conflict of Interest

None

Acknowledgements

We acknowledge the services of the Lifelines Biobank.

Lyall K, Croen L, Daniels J, Fallin MD, Ladd-Acosta C, Lee BK et al. The Changing Epidemiology of Autism Spectrum Disorders. Annu Rev Public Health 2017; 38: 81–102.
Grove J, Ripke S, Als TD, Mattheisen M, Walters RK, Won H et al. Identification of common genetic risk variants for autism spectrum disorder. Nature Genetics 2019; 51(3): 431-+.
American Psychiatric Association D, Association AP. Diagnostic and statistical manual of mental disorders: DSM-5, vol. 5. American psychiatric association Washington, DC2013.
Sandin S, Lichtenstein P, Kuja-Halkola R, Hultman C, Larsson H, Reichenberg A. The Heritability of Autism Spectrum Disorder. JAMA 2017; 318(12): 1182–1184.
Kuhlthau KA, McDonnell E, Coury DL, Payakachat N, Macklin E. Associations of quality of life with health-related characteristics among children with autism. Autism 2018; 22(7): 804–813.
Donnachie E, Schneider A, Enck P. Comorbidities of Patients with Functional Somatic Syndromes Before, During and After First Diagnosis: A Population-based Study using Bavarian Routine Data. Scientific Reports 2020; 10(1).
Takahashi N, Harada T, Nishimura T, Okumura A, Choi D, Iwabuchi T et al. Association of Genetic Risks With Autism Spectrum Disorder and Early Neurodevelopmental Delays Among Children Without Intellectual Disability. Jama Netw Open 2020; 3(2).
Schendel D, Laursen TM, Albinana C, Vilhjalmsson B, Ladd-Acosta C, Fallin MD et al. Evaluating the interrelations between the autism polygenic score and psychiatric family history in risk for autism. Autism Research 2022; 15(1): 171–182.
Wojczynski MK, North KE, Pedersen NL, Sullivan PF. Irritable Bowel syndrome: A co-twin control analysis. Am J Gastroenterol 2007; 102(10): 2220–2229.
Diatchenko L, Nackley AG, Tchivileva IE, Shabalina SA, Maixner W. Genetic architecture of human pain perception. Trends Genet 2007; 23(12): 605–613.
Sullivan PF, Evengard B, Jack A, Pedersen NL. Twin analyses of chronic fatigue in a Swedish National Sample. Psychological Medicine 2005; 35(9): 1327–1336.
Saito YA. The role of genetics in IBS. Gastroenterol Clin North Am 2011; 40(1): 45–67.
Norbury TA, MacGregor AJ, Urwin J, Spector TD, McMahon SB. Heritability of responses to painful stimuli in women: a classical twin study. Brain 2007; 130(Pt 11): 3041–3049.
Buchwald D, Herrell R, Ashton S, Belcourt M, Schmaling K, Sullivan P et al. A twin study of chronic fatigue. Psychosom Med 2001; 63(6): 936–943.
Eijsbouts C, Zhen TH, Kennedy NA, Bonfiglio F, Anderson CA, Moutsianas L et al. Genome-wide analysis of 53,400 people with irritable bowel syndrome highlights shared genetic pathways with mood and anxiety disorders. Nature Genetics 2021; 53(11): 1543–1552.
Johnston KJA, Adams MJ, Nicholl BI, Ward J, Strawbridge RJ, Ferguson A et al. Genome-wide association study of multisite chronic pain in UK Biobank. Plos Genet 2019; 15(6).
Deary V, Hagenaars SP, Harris SE, Hill WD, Davies G, Liewald DCM et al. Genetic contributions to self-reported tiredness. Molecular Psychiatry 2018; 23(3): 609–620.
Al-Beltagi M. Autism medical comorbidities. World journal of clinical pediatrics 2021; 10(3): 15.
McElhanon BO, McCracken C, Karpen S, Sharp WG. Gastrointestinal symptoms in autism spectrum disorder: a meta-analysis. Pediatrics 2014; 133(5): 872–883.
Chen MY, Li S, Zhu ZW, Dai CG, Hao XJ. Investigating the shared genetic architecture and causal relationship between pain and neuropsychiatric disorders. Hum Genet 2022.
van Rheenen W, Peyrot WJ, Schork AJ, Lee SH, Wray NR. Genetic correlations of polygenic disease traits: from theory to practice. Nat Rev Genet 2019; 20(10): 567–581.
Turley P, Walters RK, Maghzian O, Okbay A, Lee JJ, Fontana MA et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nature Genetics 2018; 50(2): 229-+.
Dudbridge F. Power and Predictive Accuracy of Polygenic Risk Scores. Plos Genet 2013; 9(3).
Davey Smith G, Ebrahim S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International journal of epidemiology 2003; 32(1): 1–22.
Klein L, D'Urso S, Eapen V, Hwang LD, Lin P. Exploring polygenic contributors to subgroups of comorbid conditions in autism spectrum disorder. Scientific Reports 2022; 12(1).
Bulik-Sullivan BK, Loh PR, Finucane HK, Ripke S, Yang J, Schizophrenia Working Group of the Psychiatric Genomics C et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat Genet 2015; 47(3): 291–295.
Bulik-Sullivan B, Finucane HK, Anttila V, Gusev A, Day FR, Loh P-R et al. An atlas of genetic correlations across human diseases and traits. Nature genetics 2015; 47(11): 1236–1241.
Werme J, van der Sluis S, Posthuma D, de Leeuw CA. An integrated framework for local genetic correlation analysis. Nature genetics 2022; 54(3): 274–282.
Turley P, Walters RK, Maghzian O, Okbay A, Lee JJ, Fontana MA et al. Multi-trait analysis of genome-wide association summary statistics using MTAG. Nature genetics 2018; 50(2): 229–237.
Watanabe K, Taskesen E, Van Bochoven A, Posthuma D. Functional mapping and annotation of genetic associations with FUMA. Nature communications 2017; 8(1): 1–11.
Wang K, Li MY, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 2010; 38(16).
Sijtsma A, Rienks J, van der Harst P, Navis G, Rosmalen JGM, Dotinga A. Cohort Profile Update: Lifelines, a three-generation cohort study and biobank. International Journal of Epidemiology 2021.
Scholtens S, Smidt N, Swertz MA, Bakker SJL, Dotinga A, Vonk JM et al. Cohort Profile: LifeLines, a three-generation cohort study and biobank. International Journal of Epidemiology 2015; 44(4): 1172–1180.
Klijs B, Scholtens S, Mandemakers JJ, Snieder H, Stolk RP, Smidt N. Representativeness of the LifeLines Cohort Study. Plos One 2015; 10(9): e0137203.
Stolk RP, Rosmalen JG, Postma DS, de Boer RA, Navis G, Slaets JP et al. Universal risk factors for multifactorial diseases: LifeLines: a three-generation population-based study. Eur J Epidemiol 2008; 23(1): 67–74.
McCarthy S, Das S, Kretzschmar W, Delaneau O, Wood AR, Teumer A et al. A reference panel of 64,976 haplotypes for genotype imputation. Nature Genetics 2016; 48(10): 1279–1283.
Neustaeter A, Nolte I, Snieder H, Jansonius NM. Genetic pre-screening for glaucoma in population-based epidemiology: protocol for a double-blind prospective screening study within Lifelines (EyeLife). Bmc Ophthalmol 2021; 21(1).
Coombes BJ, Ploner A, Bergen SE, Biernacka JM. A principal component approach to improve association testing with polygenic risk scores. Genetic epidemiology 2020; 44(7): 676–686.
Burgess S, Bowden J, Fall T, Ingelsson E, Thompson SG. Sensitivity analyses for robust causal inference from Mendelian randomization analyses with multiple genetic variants. Epidemiology (Cambridge, Mass) 2017; 28(1): 30.
Hemani G, Tilling K, Davey Smith G. Orienting the causal relationship between imprecisely measured traits using GWAS summary data. Plos Genet 2017; 13(11): e1007081.
Hemani G, Bowden J, Davey Smith G. Evaluating the potential role of pleiotropy in Mendelian randomization studies. Human molecular genetics 2018; 27(R2): R195-R208.
Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D et al. The MR-Base platform supports systematic causal inference across the human phenome. elife 2018; 7.
Wang HH, Sun PF, Chen WK, Zhong J, Shi QQ, Weng ML et al. High Glucose Stimulates Expression of MFHAS1 to Mitigate Inflammation via Akt/HO-1 Pathway in Human Umbilical Vein Endothelial Cells. Inflammation 2018; 41(2): 400–408.
Zhong J, Wang HH, Chen WK, Sun ZR, Chen JW, Xu YJ et al. Ubiquitylation of MFHAS1 by the ubiquitin ligase praja2 promotes M1 macrophage polarization by activating JNK and p38 pathways (vol 8, 2017). Cell Death Dis 2018; 9.
Siniscalco D, Schultz S, Brigida AL, Antonucci N. Inflammation and Neuro-Immune Dysregulations in Autism Spectrum Disorders. Pharmaceuticals-Base 2018; 11(2).
Arenella M, Cadby G, De Witte W, Jones RM, Whitehouse AJ, Moses EK et al. Potential role for immune-related genes in autism spectrum disorders: Evidence from genome-wide association meta-analysis of autistic traits. Autism 2022; 26(2): 361–372.
Zhong J, Guo C, Hou W, Shen N, Miao C. Effects of MFHAS1 on cognitive impairment and dendritic pathology in the hippocampus of septic rats. Life Sci 2019; 235: 116822.
Hajri M, Abbes Z, Yahia HB, Jelili S, Halayem S, Mrabet A et al. Cognitive deficits in children with autism spectrum disorders: Toward an integrative approach combining social and non-social cognition. Front Psychiatry 2022; 13: 917121.
Zhu JH, Lee KY, Jewett KA, Man HY, Chung HJ, Tsai NP. Epilepsy-associated gene Nedd4-2 mediates neuronal activity and seizure susceptibility through AMPA receptors. Plos Genet 2017; 13(2).
Velmeshev D, Schirmer L, Jung D, Haeussler M, Perez Y, Mayer S et al. Single-cell genomics identifies cell type-specific molecular changes in autism. Science 2019; 364(6441): 685–689.
Reilly J, Gallagher L, Leader G, Shen S. Coupling of autism genes to tissue-wide expression and dysfunction of synapse, calcium signalling and transcriptional regulation. Plos One 2020; 15(12): e0242773.
Cheng J, Deng Y, Zhou J. Role of the Ubiquitin System in Chronic Pain. Front Mol Neurosci 2021; 14: 674914.
Yanpallewar S, Wang T, Koh DC, Quarta E, Fulgenzi G, Tessarollo L. Nedd4-2 haploinsufficiency causes hyperactivity and increased sensitivity to inflammatory stimuli. Sci Rep 2016; 6: 32957.
Laedermann CJ, Cachemaille M, Kirschmann G, Pertin M, Gosselin RD, Chang I et al. Dysregulation of voltage-gated sodium channels by ubiquitin ligase NEDD4-2 in neuropathic pain. J Clin Invest 2013; 123(7): 3002–3013.

The authors have declared there is NO conflict of interest to disclose

SupplementalMaterials.docx

Download PDF

Editorial decision: revise
12 Jul, 2024
Review #3 received at journal
26 Jun, 2024
Reviewer #3 agreed at journal
26 Jun, 2024
Review #1 received at journal
20 Jan, 2024
Reviewer #2 agreed at journal
15 Jan, 2024
Reviewer #1 agreed at journal
11 Jan, 2024
Reviewers invited by journal
25 Sep, 2023
Submission checks completed at journal
02 Aug, 2023
First submitted to journal
02 Aug, 2023
Unknown event
02 Aug, 2023
Editor assigned by journal
01 Aug, 2023

You are reading this latest preprint version

Shared genetic architecture and causality between autism spectrum disorder and irritable bowel syndrome, pain, and fatigue

Status:

Version 1

Abstract

Figures

Introduction

Methods

Results

Discussion

Conclusion

Declarations

Conflict of Interest

Acknowledgements

References

Additional Declarations

Supplementary Files

Status:

Version 1