Characteristics of the preterm and term infants
Of the 19 preterm infants studied, two were lost-to-follow-up due to early death. The remaining preterm infants included three sets of twins and one set of triplets. For comparison, a total of 20 healthy full-term infants born at a gestational age of more than 37 weeks were included in this study. Characteristics of the preterm and term infants are reported in Table 1. Term infants were mainly born by spontaneous vaginal delivery and no exposure to antibiotics while preterm infants were mainly born by caesarean delivery (94.7%, p < 0.0001) and exposed to antibiotics (57.9%, p < 0.0001). Clinical characteristics of the preterm infants during admission are reported in Table 2. Eleven preterm infants had respiratory distress syndrome (RDS) and five patients received invasive mechanical ventilation. Eleven patients received parental nutrition (PN) and three of them had peripherally inserted central catheter (PICC). Eleven patients were treated with antibiotics for three days to two week with various combinations of penicillin (8 patients), gentamicin (10 patients), vancomycin (4 patients) and meropenem (4 patients).
Preterm infants’ stool samples collected during NICU admission were subjected for isolation for multidrug resistant Enterobacteriaceae, as published in Yap et al., (22). Antibiotic susceptibility profile of the isolates was correlated with patient clinical data for further statistical analyses.
Stool samples were collected at: day 1 (meconium), week 1, week 2, month 6 and month 12 of life from all infants. However, some time points were missed for selected infants due to the lack of adherence to the sampling schedule. In total, we collected 141 samples, for an average of 3.05 samples per preterm infants and 4.15 samples per term infants.
Microbial composition comparison between term and preterm infants
The alpha diversity of the faecal microbial community between term and preterm infants and time points were evaluated using Simpson, Shannon and Pielou’s evenness indices (Supplementary Figure 1). Overall, no significant difference in bacterial richness and evenness were observed between term and preterm samples. However, among the five sampling time points, statistically significant differences (p < 0.01) were achieved for all three indices between samples from week 2 with month 12. Separately, beta diversity was assessed using PERMANOVA and PLS-DA. Overall, significant differences in faecal bacterial composition between the term and preterm infants were detected in PERMANOVA (pseudo-F = 2.4834, P(perm) = 0.001). From the first PLS-DA plot (Figure 1), it is apparent that the faecal bacterial compositions from both groups are highly variable. When time points were added as factor, it was observed that samples from week 2 were overlapped with meconium and week 1. For month 6 and month 12, however, both groups formed two tight clusters with little overlapped. Consistent observation was also obtained using pairwise PERMANOVA analysis (Supplementary Table 2).
The overall distributions of the phyla and genera were provided in Supplementary Figure 2. Differentially abundance OTUs were identified based on negative binomial model. A total of 55 OTUs from 3 phyla (Proteobacteria, Firmicutes and Bacteroidetes) whose abundance differed between two groups across the five time points were identified. The species which matched highest sequence homology with the input were included in the Supplementary Table 3. When the comparison was made at phylum level, the dominant phyla for term infants were Firmicutes and Bacteroidetes; while enriched levels of Proteobacteria were observed in preterm infants’ stool at the first two weeks of life. At 12 months of life, significant abundance of Bacteroidetes (OTU0012, OTU0029, OTU0051 and OTU0110) were observed among the preterm group. Conversely, term infants were observed with abundant species of Proteobacteria at month 12. The preterm infants’ stool samples had significantly enriched Klebsiella OTUs (mainly K. pneumoniae) during the first two weeks of life while the level of Klebsiella OTUs started to elevate in term infant stools only after 6 months of life. It was observed that OTU0029 (Bacteroides fragilis) was elevated in week 1 and week 2 stools of term infants while the significant elevation only observed at month 12 for preterm.
Temporal differences in metabolomics profiles between term and preterm infants
PLS-DA plot inferred using metabolomics profiles showed less apparent separation in comparison to the microbial composition, although distinct clusters specific to term and preterm are still discernible (Figure 1). When time points were added as factor, samples from meconium, week 1 and week 2 were loosely clustered especially among the preterm infants (Figure 2). Consistent with bacterial composition, samples from month 6 were significantly different (P(perm) = 0.005, P(MC) = 0.005). Nonetheless, both sample groups (i.e. term and preterm) from month 12 did not show significant difference with most of the data points overlapped with month 6. This observation was also reflected in the significant metabolites detected consistently in month 6 and month 12 in the term infants. Significantly expressed metabolites according to time points were further identified using permutation test (number of permutations = 1000). Only metabolites with a P-value of 0.01 and below were selected and summarised in Table 3. The corresponding covariance plots of preterm vs term derived from stools samples obtained from meconium, week 1, week 2, month 6 and month 12 of life were included in Supplementary Figure 3. Meconium collected from preterm infants showed elevated glycerol. At week 1, preterm group showed elevated faecal valine, leucine, isoleucine, tyrosine and phenylalanine, whereas α-glucose and methylmalonic acid (MMA) were elevated in the term group. However, no significant metabolic changes in preterm and term infants were observed at week 2. Changes in stool metabolites were more pronounced at month 6 with the preterm group showing higher levels of faecal succinate, citrate and trimethylamine-N-oxide (TMAO), whereas the term infants showed higher levels of faecal β-hydroxybutyric acid (BHBA), fucose and pyruvatoxime. Additionally, the latter consistently showed elevated levels of BHBA and fucose up to month 12, whereas the preterm infants exhibited higher levels of faecal tyrosine and phenylalanine at month 12 as well as week 1.
Association between differentially expressed OTUs and significant metabolites
Network analysis was performed based on sparse least square model (sPLS-DA) (Figure 3). Three Erysipelatoclostridium-related OTUs (OTU0035, OTU0090 and OTU0112) were positively correlated to the branched-chain amino acids (BCAAs): valine, leucine and isoleucine. In addition, OTU0035 (Erysipelatoclostridium ramosum) was negatively correlated to BHBA and pyruvatoxime while OTU0090 (Clostridium cocleatum) and OTU0112 (Clostridium spiroforme) were negatively correlated to pyruvatoxime and α-glucose. Bacteroides fragilis (OTU0029) which was differentially expressed in term group at week 1 and week 2 and subsequently in preterm group at month 12, showed moderate to strong negative correlations to BHBA, fucose, MMA and pyruvatoxime. Lactobacillus mucosae (OTU0048) showed moderate positive correlations with the three BCAAs and strong negative correlations with pyruvatoxime. Veillonella seminalis (OTU0017) also showed strong correlations with the three BCAAs. OTU0050 which belongs to the family Comamonadaceae, showed distinct clustering with negative correlations with BHBA and fucose. OTU0050 was consistently significantly expressed in preterm group at week 1, month 6 and month 12, while term group, conversely, was enriched with the same OTU at week 2.
Correlations between omics data with demographic and clinical parameters
Distance based linear modelling was carried out to identify the demographic and clinical predictors for the differentially expressed 16S metagenomic and NMR metabolomic profiles. We modelled the faecal microbial and metabolic composition by splitting the data into each respective time point. Birth weight was consistently selected as the best explanatory variable for the total variation in the metabolic profiles all infants at birth and at month 12 (Table 4). On the other hand, gestational age was selected to explain the elevated 16S metagenomics profile for all infants at month 12. When the data was analysed by considering clinical parameters, the step-wise selection algorithm selected “PICC line insertion” and “isolation of bacteria resistant to 3rd generation cephalosporins” as the best explanatory parameters for the faecal metabolic composition of the preterm group at month 6 and month 12 respectively.