PCOS is an endocrine disease with a high incidence in women of childbearing age. Its basic pathophysiological features include HA and insulin resistance caused by disturbances in the ovarian environment, cytokine expression, and dysfunction. At present, the aetiology of PCOS is still not completely clear; however, based on existing studies, it is caused by environmental and genetic factors. As a disease with complex aetiology, PCOS may be caused by the joint action of multiple genes and their mutations or polymorphisms (6).
4.1 Single nucleotide variation in susceptibility genes in the PCOS family
In this study, the susceptibility genes of the family of PCOS patients were screened using whole-exon sequencing technology for the first time. Seven susceptibility genes were predicted: TGF-β1, DGAT1, EHMT1, KDR, LAMTOR1, SETD2, and SEC13.
In the past, genome-wide association studies (GWAS) have often been used to study the genetics of complex diseases. However, this method has several significant shortcomings. First, due to the inclusion of the whole genome, the identified gene loci are often located in non-coding or non-functional regions, making it difficult to explain the specific mechanisms underlying the occurrence of diseases. Second, the time taken limits the use of GWAS. The expressed region is the part of the eukaryotic DNA expressed as a protein. Exons account for approximately 1% of the human genome; however, 85% of disease-causing mutations occur in this region (32). Therefore, whole-exome sequencing, through the capture, enrichment, and high-throughput sequencing of exons, provides a new, economical, and efficient method for identifying susceptibility genes.
Traditional case-control studies are prone to bias due to factors such as race, environment, and genetic background affecting the results. As PCOS is a genetically predisposed disease, Studies on the correlation between susceptibility gene loci and disease based on family pedigree can ensure consistency in terms of genetic background, living habits, and environment, eliminating the interference of confounding factors. Therefore, we selected a small sample from this family to study the susceptibility genes of PCOS, which can guide subsequent research to a certain extent.
In view of the wide range of biological functions of TGF-β1 and the diverse clinical manifestations of PCOS, research has explored their relationship. Many studies have shown that abnormal TGF-β1 expression is involved in multiple pathological PCOS changes and has foetal origins (33, 34). TGF-β1 is associated with the pathological manifestations of ovarian fibrosis, HA, ovulation disorders, and insulin resistance equal to that in PCOS (9, 14–19, 22, 23, 35, 36). However, most of these studies have focused only on the level of correlation without studying its mechanism from the perspective of the pathway.
TGF-β intracellular signal transduction relies on the Smads protein family, which directly or indirectly acts on target genes and affects the transcription and expression of downstream genes (37). This series of regulatory processes has gradually become a focus of research. the TGF-β1 protein and a variety of cytokines communicate in the TGF-β/Smads pathway, which is involved in the growth and development of the body and a variety of diseases in the process of bidirectional regulation (38).
In the screening of susceptibility genes in families by exon sequencing, it was found that DGAT1, EHMT1, KDR, LAMTOR1, SETD2, and SEC13 may be susceptibility genes for PCOS. DGAT1 encodes diacylglycerol o-acyltransferase 1, a multichannel transmembrane protein and key metabolic enzyme possibly associated with obesity and other metabolic diseases. EHMTI is a histone methyltransferase regulating the monomethylation and dimethylation of histone H3 lysine 9 (H3K9), which forms heteropolymeric complexes with G9a in euchromatin. EHMT1 has various functions and is associated with tumour development, obesity, embryo growth, and cardiac hypertrophy. The brown adipose-specific loss of EHMT1 results in a significant reduction in tissue-mediated adaptive thermogenesis, obesity, and systemic IR (39). KDR encodes a type 2 receptor for vascular endothelial growth factor (VEGF). VEGF is highly specific and can promote the growth of vascular endothelial cells, increase vascular permeability, and promote the degeneration of the extracellular matrix. The VEGF system has been linked to ovarian diseases, including PCOS, in which follicles are prevented from developing (40–44). LAMTOR1, a late endosomal/lysosomal connector, which acts as a MAPK and MTOR activator 1, plays important roles in energy and glucose metabolism. Huang et al.(45) found that mouse models with β-cell LAMTOR1 gene-specific defects have higher glucose tolerance and glucose-stimulated insulin secretion during hyperglycaemic clamp and islet perfusion than control models. LAMTOR1 loss increases the amplification pathway induced by glutamic acid and acetyl-CoA carboxylase 1, ultimately leading to increased insulin secretion. The histone lysine methyltransferase SETD2 regulates the trimethylation of lysine 36 (H3K36) in histone H3 and is involved in the maintenance of chromatin structure, transcriptional extension, and genome stability. The proteins encoded by SEC13 belong to the WD(Trp-Asp) repeat protein family, a component of the endoplasmic reticulum and nuclear pore complex and are required for endoplasmic reticulum vesicle biogenesis during transport (46). SEC13 has also been linked to inflammation in the body (47), Macrophages of SEC13 mutant mice expressed low levels of MHC I and II and had high levels of soluble and membrane-bound TGF-β and serum immunoglobulin production. TGF-β expression remained high after stimulation or immunisation, suggesting that SEC13 is a key influencing factor in TGF-β production.
In the family included in this study, both SEC13 and TGF-β1 genes were mutated, and it is speculated that the genes interact through their independent changes, with noteworthy changes in the transport of substances in vivo and immune function. At present, studies on PCOS and the above-mentioned genes are limited. Subsequent studies can serve as a bridge between insulin resistance and abnormal lipid metabolism and further elucidate the aetiology and pathogenesis of PCOS by focusing on the mechanism of action of related genes.
4.2 TGF-β1 gene promoter methylation and PCOS
Regarding the aetiology and pathogenesis of PCOS, epigenetics can better explain the comprehensive effects of heredity, environment, nutrient metabolism, and neuroendocrine regulation, a new consensus reached by the academic community (48). Epigenetic modifications are reversible and adjustable. Compared with the direct intervention of gene expression, it has greater development potential and is likely a means to study the pathogenesis and intervention of diseases; it has gradually been applied in diabetes, hypertension, obesity, and other metabolism-related diseases (7). Based on existing studies, TGF-β1 is involved in the regulation of follicular growth, ovarian fibrosis, and insulin resistance, affecting the occurrence and development of PCOS. Therefore, TGF-β1 was selected as the susceptibility gene to further investigate the influence of methylation of this gene on the occurrence and development of PCOS.
The DNA methylation epigenetic modification of TGF-β1 has been studied in many fibrosis-related diseases such as lung, liver, and heart diseases (49–51). Nour et al.(52) reported for the first time that neuroendocrine and metabolic abnormalities in PCOS mice could be significantly improved by applying the methylation-promoting agent S-adenosyl methionine to their progeny.
In the PCOS rat model, our team found that the DNA methylation of the anti-Mullerian hormone and insulin receptor genes in blood lymphocytes was closely related to ovarian pathological changes, ovulation disorders, and insulin resistance (53, 54), However, no studies on TGF-β1 methylation and PCOS pathological manifestations have been reported. Tian et al.(55) used letrozole to induce PCOS in a rat model; the results showed that TGF-β1 expression levels in the follicular membrane and interstitial cells were significantly higher than those in the control group. However, the specific mechanism affecting its expression was not clear, and there is no study on whether the same is true in peripheral blood.
In this study, the TGF-β1 gene promoters in the 59 subjects showed significant CpG4 hypomethylation in PCOS patients compared with the normal controls (p = 0.001). The methylation rates of the CpG4 (p = 0.004) and CpG7 (p = 0.012) sites in the IR group were significantly lower than those in the normal control group. In addition, the CpG4 methylation level in the ELSE group of PCOS patients was significantly lower than that in the control group (p = 0.012). The total methylation rate in the IR group was significantly lower than that in the normal group (P = 0.005), and the methylation rate was positively correlated with age (R = 0.38, p = 0.0032) and negatively correlated with fasting insulin and HOMA-IR (R = -0.32, p = 0.012; R = -0.28, p = 0.029). There was no significant difference in mRNA and protein expression levels between the different groups of TGF-β1 methylation level, and no correlation with methylation rate. Subgroup analysis of the methylation levels showed that the testosterone level in the high methylation rate PCOS group was significantly higher than that in the low methylation rate PCOS group. Under the high methylation rate, the BMI, testosterone level, fasting insulin level, and HOMA-IR of the normal control group were significantly lower than those of PCOS patients, but their age was significantly higher than that of patients with PCOS. There was no significant difference in mRNA and TGF-β1 protein expression between these groups.
In this study, we found that methylation at the CpG4 site of the TGF-β1 promoter may affect PCOS pathogenesis and is more likely to cause insulin resistance than HA; the methylation rate is negatively correlated with fasting insulin level and HOMA-IR in the population. It is generally believed that hypermethylation inhibits transcription and hypomethylation promotes transcription (56), In the insulin resistance group, there are two low methylation CpG loci, the ELSE group have a CpG locus with low methylation, but did not make corresponding mRNA, and protein expression was significantly higher, on the one hand, may be associated with overall degree of methylation of the promoter, two loci methylation differences do not affect the transcription of the promoter area; this may be because the mRNA expression level is also affected by other factors, such as non-coding RNA and gene copy number variation. Additionally, BSP was used to measure methylation. Differences in each independent CpG locus were studied, easily leading to significant but small differences in the degree of methylation at the overall gene level, which failed to have a significant impact on the subsequent mRNA expression level. According to the results of this study, we speculate that abnormal methylation of TGF-β1 in peripheral blood may not be the direct cause of PCOS. However, due to the tissue specificity of DNA methylation and the interaction of genes, the epigenetic role of the TGF-β1 gene in PCOS cannot be completely denied. This study found that in women of childbearing age, the higher the methylation rate of the TGF-β1 gene, the lower the fasting insulin level and insulin resistance index, which also reflected the effect of the methylation rate on the IR phenotype.
Subgroup analysis of the case and control groups according to the methylation level showed that in patients with PCOS, the testosterone level in the high methylation rate group was significantly higher than that in the low methylation rate group. The reason for this result in our study is probably that the PCOS patients in the hypomethylation group did not have HA symptoms; all came from the IR and ELSE groups. This may indicate that the degree of methylation of the TGF-β1 gene promoter may not be directly associated with HA.
We also noted that methylation rates positively correlated with age in women of childbearing age (R = 0.38, p = 0.0032). This suggests that the lower the age, the lower the methylation rate of the TGF-β1 gene in peripheral blood during the reproductive age, which may increase the incidence of PCOS, especially IR PCOS. This is consistent with PCOS epidemiology in that the high incidence is concentrated in 18–30-year-olds (57). This also suggests that early PCOS prevention and intervention is particularly important as the methylation degree of TGF-β1 in peripheral blood may increase with age, and the role of TGF-β1 in PCOS pathogenesis may gradually decrease. However, this requires further verification in population cohort studies.
Inevitably, there are some drawbacks in this study. Firstly, we only included women in one PCOS family for sequencing, which lacks a large population study on the susceptibility genes. Secondly, PCOS susceptibility sites may also be distributed in the intron region of genes, producing regulatory non-coding RNA that affect protein expression. This is also a disadvantage of WES. In addition, although we found an association between TGF-β1 methylation and PCOS phenotypes, the mechanism has not been further explored and further studies are needed.