There are limited studies that investigate the molecular characterization of VP6 and NSP4 genes in human RVA strains worldwide as the interest has mainly focused on G and P distribution. This 15-year study focusses on the genotyping of VP6 and NSP4 genes and molecular characterization of the NSP4 gene of unusual G and P RVA strains isolated from children hospitalized with AGE.
Genotyping revealed three different I (I1, I2, I3) and E (E1, E2, E3) genotypes in unusual RVA strains, I2 and E2 being the most common. According to the Rotavirus Classification Working Group, 32 I and E genotypes are known so far, with I1, I2 and E1, E2 being the most commonly detected genotypes among humans [8, 20–22]. I1-E1 are strongly associated with G1/G3/G4/G5/G9-P[8] and follow the Wa-like genotype constellation, I2-E2 are associated with G2-P[4] typical of the DS-1 like genotype constellation and I3-E3 are associated with G3-P[9] typical of the AU-1 like constellation [21, 23].
Similarly, in a 10-year study (1996–2006) conducted in Brazil, which included both common and unusual strains, they found the same three I and E genotypes [23]. ] In their study the most prevalent I and E genotypes were I1 (82.7%) and E1 (81.5%), respectively. However, among strains with an unusual G (G6, G8, G10) and/or P (P[6], P[9], P[10], P[11], P[14]) genotype, I2 and E2 were the most prevalent I and E genotype (n = 7/13) [23], as in the present study. In other epidemiological studies such as a 4-year study conducted in the Democratic Republic of Congo, although the number of unusual G and/or P strains recorded was substantial, only two I (I1, I2) and E (E1, E2) genotypes were recorded [24].
In this study, I3 and E3 were detected in 2019 and 2017 onwards, respectively. Strains carrying the E3 genotype showed a significant increase between 2019–2021, during the COVID-19 pandemic period and they were mostly found in combination with G3-P[9]-I2 (n = 5/21, 23.8%). G3-P[9]-I2-E3 was the most prevalent G-P-I-E genotype combination throughout this study (n = 7/34, 20.6%). The increase in E3 was observed in the same period with the increase of P[9] strains in Greece, as recorded by Tatsi et al [15]. The first record of the G3-P[9]-I2-E3 genotype in humans was in 2012 in Korea, where it was isolated from a 9 year old female [25]. However, a similar strain (G3-P[9]-I2-R2-C2-M2-A3-N2-T3-E3-H3) was recently identified in 2021 in Thailand, and was originated from a feline with diarrhoea [26].
The rare combination of G3-P[9]-I2-E3 that was detected in this study is possibly derived from a reassortment event, but further investigation should be performed. Reassortment is common among RVs and is a crucial mechanism for the evolution of the virus. Molecular characterization of multiple RVA genes is important, as it may contribute to detect strains that do not fit into any of the major constellations (Wa, DS-1 and AU-1) and are probably products of reassortment events. Furthermore, this finding supports that VP6 and NSP4 can segregate independently, contradicting a study in 2003 that reported a genetic linkage among these two proteins in common, unusual and reassortant human strains [21]. Similarly, to our observation, many studies reported such reassortment events at VP6 and NSP4, but at a lower rate. In an 11-year study (1996–2006) in Brazil, the I1-E2 unusual I-E genotype combination was found in 1.2% of circulating strains [23]. The combinations I2-E1 and I1-E2 were detected in 15.4% of RVA strains in India during 1990–2000[27] and in 6.5% in Iran during 2021–2022 [28].
NSP4 is an essential protein for virus morphogenesis and pathogenesis. In the present study, nine possibly novel substitutions were found in the NSP4 gene. Most substitutions were detected in VP4-binding domain which also contains the toxic peptide and the interspecies variable domain (ISVD). According to other studies characterizing the nucleotide sequence of the NSP4 gene, the ISVD region shows great heterogenicity and the amino acid vary according to genotype [17, 29–32]. Limited functional studies exist and therefore the effects of these variants on the functionality and immunogenicity of the corresponding protein remain unknown.
Of interest are the substitutions in amino acid 131 in the region of the toxic peptide, in which the majority of the strains of this study carried the H131 and E2 strains mainly carried Y131. Ball et al. conducted functional study for this amino acid on infant mice and they found that substitutions in amino acid 131 has an effect on the enterotoxin properties of NSP4.[33] Specifically, they reported that the Y131K substitutions resulted in the absence of diarrhoea. Studies from Brazil between 1990–2000 and 1987–2003 have reported that Y131 was detected only in E2 strains, while E1 strains had H131, and there was no data regarding E3 strains.[34, 35] Srivastava et al. showed that patients infected with a strain carrying Y131 experienced more severe diarrhoea [29]. Even though the severity of symptoms was not evaluated in the present study, statistical analysis showed that children infected with an unusual strain carrying the E2 genotype had a higher chance to exhibit dehydration, which may indicate more severe diarrhoea. This result may also be related to the fact that Y131 was detected more in E2 strains.
Limitation of the present study included the moderate detection rates of both VP6 and NSP4 genes in RVA-positive fecal samples. However, similar detection rates have also been reported in other studies, possibly due to poor sample storage conditions or the presence of RNases resulting in fragmentation of the viral RNA genome, presence of PCR inhibitors or inability of primers to hybridize [8, 36]. Another limitation of our study was that the analysis was based only in four genes (VP7, VP4, VP6 and NSP4) and not in the complete genotype constellation, which would provide more information about the genetic evolution of the strains.
This is the first study of VP6 and NSP4 epidemiology and molecular characterization of NSP4 of unusual RVA strains in Greece, in which the unusual I3 and E3 genotypes, the reassortant I2-E3 human strains and many substitutions in significant domains of NSP4 gene were detected. Furthermore, a significant clinical association between dehydration and E2 genotype was described.
Continuous surveillance of the distribution of RVA genotypes based on the whole genome, the molecular characterization and their association with epidemiological and clinical data is important for the better knowledge of the virus’ evolution, the disease prognosis and upgrading RVA vaccines.