Acquisition of high-quality guinea pig preimplantation embryos
To acquire high-quality single-cell data from guinea pig preimplantation embryos, we employed morphological observation to stage these embryos based on post-fertilization time and microscopic morphological features. Embryos were harvested from guinea pigs at various embryonic days ranging from E0.5 to E5.5, encompassing stages from Zygote to Blastocyst, including 2-cell, 4-cell, 8-cell, early morula, late morula, early blastocyst, and blastocyst stages. Some blastocysts were further dissected into Inner Cell Mass (ICM) plus polar Trophoblast Cells (pTE) and mural Trophoblast Cells (mTE) groups using microdissection. These embryos were then enzymatically dissociated into single cells. A total of 599 single-cell samples were collected (Fig. 1A,B). Following quality control (QC), we obtained final data from 551 high-quality single-cell samples derived from guinea pig preimplantation embryos. The morphology of each stage of embryos and the dissociated single cells was documented (Fig. 1C,Fig.S1A,B).
Single-cell analysis of guinea pig preimplantation embryos
The post-QC single-cell data, were subjected to UMAP dimensionality reduction and classified into a total of 9 clusters. Given the limited research on guinea pig preimplantation embryos, we relied on cell markers from human and mouse preimplantation embryos to characterize the single-cell data of guinea pig embryos (Fig. 2B; Fig. S1D) [37–40]. The joint morphological pre-staging of sample information facilitated accurate cell annotation of clusters (Fig. S1C). We defined Totipotency (Toti) to include Zygote, 2-cell, and 4-cell stages, as well as 8-cell, morula, Trophoblastic Ectoderm (TE), and Primitive Endoderm (PrE) based on morphological observations and corresponding markers (Fig. 2A). However, the definitions of the Inner Cell Mass (ICM) and Epiblast (EPI) were more ambiguous.The regulation of the pre-implantation Epiblast (EPI) in various species has been shown to be regulated by Nanog[41–43]. Consequently, we aimed to use Nanog as a specific marker for EPI in cell cluster characterization. However, unexpectedly, Nanog expression was not detected in the available single-cell data (Fig. 2B). Given that guinea pig embryos initiate implantation at E6 [21], and theoretically, Nanog should be expressed in both ICM and EPI at E5.5, which was not observed. Further investigation revealed a gene named LOC100731016, which, upon inquiry in the NCBI database, was identified as the Nanog2 gene of the guinea pig. Nanog belongs to a gene family that includes NANOG1, its tandem duplication gene NANOG2, and the pseudogenes NANOGP2-P11. NANOG and NANOG2 have been demonstrated to be functionally equivalent and to activate specific stem cell regulatory pathways[44]. Thus, we adopted LOC100731016 (Nanog2) as an EPI-specific marker for cell cluster characterization (Fig. 2B).
Additionally, Dnmt3b was highly expressed in EPI, indicating a hypomethylation state, akin to findings in human embryo studies[45]. By subgrouping based on sampling information and marker expressions such as Tfcp2l1 and LOC100731016, the ICM group was also defined. The final classification comprised 7 cell classes: Totipotency (Toti), 8C, Morula, ICM, TE, EPI, and PrE (Fig. 2A). Multiple marker gene enrichment and specific up- and down-regulated genes were also clearly differentiated (Fig. 2C-E).
Guinea pig ZGA occurs at the 4–8 cell stage
Mammalian embryonic development commences with the formation of a fertilized egg following sperm-egg fusion. The initial stage of embryonic development is predominantly governed by a series of maternal factors supplied by the egg cell's cytoplasm. During this phase, the zygotic genome remains transcriptionally silent, enabling the zygote to maintain a state of totipotency. As the maternally deposited mRNA degrades, the zygotic genome starts to be transcribed, leading to the activation of the genome, the mobilization of embryonic gene products, and the clearance of maternal factors. This process, known as the maternal-to-zygotic transition (MZT), involves two key events: the degradation of maternal mRNA and the production of nascent mRNA, which signifies the activation of the zygotic genome (ZGA)[46, 47]. The coordination of these events is crucial. MZT is a critical phase in developmental studies, as the transitions that occur during this period are essential for embryonic patterning, which is heavily influenced by the actions of transcriptional activators [48–50].
Advances in histological sequencing have enabled the development of a sensitive and accurate method for sequencing in continuous time to assess the activation of the zygotic genome. We analyzed differences in gene expression between adjacent time periods for samples ranging from the zygote to the morula stage. The onset of ZGA is marked by significant changes in gene expression patterns and transcript levels, transitioning the embryo from a state of minimal transcription to one where thousands of genes are actively transcribed. This shift triggers a cascade of gene expression changes that influence subsequent cell fates [51].
Analysis of the differentially expressed genes (DEGs) revealed that the transcriptomic differences in guinea pig embryos at the zygote, 2C, and 4C stages are minimal, suggesting a transcriptionally quiescent state. In stark contrast, the transition from 4C to 8C stages exhibited significant transcriptomic alterations (Fig. 3A), aligning with the anticipated dynamics associated with the onset of ZGA. We delved into the gene expression patterns during the 4C to 8C developmental stages (Fig. 3B; Fig. S2,B). Gene Ontology (GO) enrichment analysis indicated that the up-regulated genes were predominantly involved in protein synthesis and transport (Fig. 3C). Conversely, Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis highlighted the pentose phosphate pathway (Fig. 3D). which is crucial for providing precursors for nucleic acid synthesis. The enrichment of this pathway during the 4C to 8C stages suggests vigorous DNA replication and synthesis. Additionally, Gene Set Enrichment Analysis (GSEA) and GO confirmed that numerous synthesis reactions are actively occurring at this stage. (Fig. S2).
Pseudotime analysis of preimplantation embryonic development in guinea pigs provides insights into lineage specification, a process extensively investigated in humans and mice. In mice, lineage specification typically unfolds in two stages. The initial stage occurs at the morula stage, delineating the inner cell mass (ICM) and trophectoderm (TE) cell fates, with TE later giving rise to extraembryonic tissues like the placenta. The subsequent stage transpires in the blastocyst phase, where the ICM differentiates into epiblast (EPI) and primitive endoderm (PrE) lineages, with EPI eventually forming the individual and PrE contributing to the yolk sac [3, 52]. In human studies, however, two contrasting views have emerged. One perspective posits that TE, EPI, and PrE are specified concurrently during human preimplantation development, suggesting a one-step process [41]. The alternative view advocates for an intermediate ICM state during lineage specification, akin to the two-step model observed in mice [53–55].
To investigate the lineage specification of preimplantation embryonic development in guinea pigs, we analyzed embryonic data spanning from the 8C to the blastocyst stages. The pseudotime analysis revealed a bifurcating trajectory with two primary branches emerging from an initial branch at the first branching point. Subsequently, one of these branches further diverged into two additional branches at the second branching point. Mapping cell types onto these branches indicated that the initial branch was predominantly composed of 8C and 16C morula cells. The first branching point delineated the trophectoderm (TE) and inner cell mass (ICM) branches. Following the proposed developmental timeline, the ICM branch subsequently differentiated into epiblast (EPI) and primitive endoderm (PrE) branches at the second branching point (Fig. 4A). Additionally, we overlaid the sampling time and pseudotime values onto the branches, identifying the first lineage separation in guinea pig preimplantation embryonic development around day E4.5, with the second lineage separation occurring around day E5.0 (Fig. 4B). To elucidate the key drivers of lineage specification, we visualized the top five genes with the highest contribution to the trajectory construction. Among these, Argfx, a gene homologous to a frame gene frequently observed in mammalian embryonic development studies from 8C to morula stages and known to influence developmental regulation [56], exhibited a comparable expression pattern in guinea pigs. The projection of specific marker genes further validated the accuracy of the trajectory construction (Fig. S3; Fig. S4). Notably, the ICM branch in guinea pigs was found to contain a mixture of EPI cells, differing from mice where distinct ICM cell clusters can be identified based on significant molecular features. Similar to humans, guinea pigs exhibit a high degree of similarity between ICM and EPI cell clusters. Overall, the pseudotime analysis of our data offers a comprehensive view of the three-lineage specification in guinea pig preimplantation embryos. Furthermore, we depicted the cell proportion dynamics at various time points, aligning with the developmental trajectory (Fig. 4C).
To explore the gene expression dynamics during lineage specification, we ranked the top 1000 genes by ascending q-values and generated a heatmap depicting differential gene expression across pseudotime, aiming to pinpoint genes with significant expression shifts over time. These genes were grouped into six clusters based on their expression patterns (Fig. 4D). The heatmap clusters illustrated distinct gene expression profiles that evolved with pseudotime. To delve deeper into pathway expression trends, we utilized Ucell to score relevant pathways, focusing on key processes such as oxidative phosphorylation, purine metabolism, and the VEGF pathway (Fig. 4E-G). Notably, the oxidative phosphorylation pathway exhibited elevated scores in ICM and EPI cells, aligning with observations of enrichment in preimplantation EPI cells across various species [57, 58]. These findings are congruent with prior research. Furthermore, the VEGF pathway showed higher scores in TE and PrE cells, which is consistent with its role in angiogenesis, as TE and PrE later contribute to the vascular-rich placenta and yolk sac, respectively. The observed expression patterns were congruent with the pathway enrichment analysis, underscoring the relevance of these pathways in early embryonic development.
Functional characteristics of ICM, EPI and PrE
To delineate the molecular characteristics of each lineage, we employed hdWGCNA (high-dimensional Weighted Gene Co-expression Network Analysis) [34] to uncover gene expression patterns associated with lineage specification (Fig. 5A; Fig. S5B). We selected a soft threshold power of 7 to construct the co-expression network (Fig. S5A). Following high-dimensional gene co-expression analysis, we identified 19 functional modules (Fig. S5A-B; Fig. S5D), with particular attention to those associated with the second lineage segregation. Utilizing correlation-assisted enrichment module screening, we identified the most significant modules enriched for EPI, ICM, and PrE as 3, 6, and 13 modules, respectively (Fig. 5B; Fig. S5C). The hub genes of these modules were determined by selecting the top 25 hub genes for each. Visualization revealed that Nanog2 was the central regulator in the 3 modules most significantly enriched for EPI, whereas Sox17 was the predominant core regulator enriched in PrE (Fig. 5C), aligning with previous findings [59].
To understand the gene expression patterns associated with the second cell fate decision, we identified 1209 genes by intersecting the markers identified by hdWGCNA and the findmarker function. We then analyzed these genes using KEGG to plot and examine a heatmap. The analysis revealed that EPI cells primarily involve pathways such as Hippo, Notch, oxidative phosphorylation, embryonic development, and cell adhesion. In contrast, ICM cells were associated with embryonic development and oxidative phosphorylation, while PrE cells were enriched in pathways like Ras, Rap1, and VEGF. These cell types exhibited partially overlapping yet distinct pathway expression profiles (Fig. 5D).
Comparison of human and guinea pig EPI characteristics
It is now generally accepted that preimplantation EPI is characterized by a naïve state, with the potential to produce a complete individual (but not to develop extra-embryonic tissues) and the ability to form chimeras [60, 61]. Also similar to humans, after implantation into the endometrium, both guinea pig and human EPIs show developmental events of cavitation and formation of a double-layered intervertebral disc[62]. EPI plays an important role in understanding the developmental mechanisms of pluripotent stem cells as well as the development of regenerative medicine. To compare the characteristics of EPI in human and guinea pig preimplantation embryos, we combined previously published human data with our guinea pig data. We focused on the dynamics of naïve genes and the EPI pluripotency transition process (EPST), which have been documented in humans [35]. Our analysis revealed that, compared to human EPST, guinea pigs lacked expression of Il6r and Nr0b1, showed reduced expression of Utf1, and exhibited increased expression of Lifr (Fig. 6C -D).
Furthermore, we performed PCA on the integrated data. The analysis indicated a clear separation of human and guinea pig cell data along the PC1 axis, highlighting species differences as the most significant factor (Fig. 6A). We identified 13,915 genes expressed in both species. Along the PC1 axis, we classified these genes into significantly positive and significantly negative categories. KEGG enrichment analysis revealed notable differences in pathways such as phosphatidylinositol metabolism, pyrimidine and purine metabolism, and oxidative phosphorylation (Fig. 6B). Along the PC2 and PC3 axes, the cellular projections of human and guinea pig overlapped, suggesting similar developmental trajectories during the EPI pluripotency transition process (Fig. 6A). These findings suggest both conservation and some differences in the development of human and guinea pig preimplantation EPI.