In the current study we utilized a genomic nucleotide distance-based method previously used for identifying phylogenetic clades and applied it to detect viral species. The results suggest that herpes simplex viruses isolated from Lion and Pig-tailed macaques should be designated as separate species. To our knowledge this is the first time this technique was been applied to virus species and may be useful in detecting cryptic viral species.
Host-Virus Co-speciation
Herpesviruses have been shown to cospeciate with their hosts [46], however they can cross species barriers [47], especially in captivity [48-55]. These captive transmissions, especially between macaque species can complicate phylogenetic analysis. In particular, cross-species transmission appears to be fairly common among the core herpes B strains, and has been discussed previously in depth by Eberle et al [12]. In some of the herpes B strains, the original source of the virus appears to be unclear. For instance, the cynomolgus macaque derived strain E90-136 is more distant and phylogenetically separated from the core herpes B strains (Figure 1), however it was not sufficiently distant (Figure 2) to be considered a separate species. Interestingly, strain E90-136 was isolated from a Cyno macaque which died due to a disseminated infection caused by the virus [56]. Herpes B strains are generally asymptomatic within the natural host, which may suggest that Cyno macques are not the natural reservoir for this particular strain. For other OWM strains, interspecies spread is well documented. The isolate 8100812 was originally isolated from a DeBrazza monkey, however restriction digest patterns showed that the Lion-tailed macaque was the natural host [51]. Phylogenetically, this appears appropriate as strain 8100812 forms a node with the two Pig-tailed macaque isolates (Figures 1A and B), and importantly matches phylogenetic profile of the macaque species themselves (Figure 1C). The correlation between Lion and Pig-tailed virus and macaque phylogeny strongly suggests host-virus co-speciation. Additionally, while natural cross-species viral transmissions between animals does occur [47, 57-59], natural species viral transmissions between the animals and viruses in this study are fairly unlikely given that the natural host ranges of the monkeys (Figure 5). Exceptions are Lion-tailed and Bonnet macaques, as well as the Pig-tailed and Cyno macaques, where there is considerable overlap in the host ranges (Figure 5).
Viral Species Concept
Standard definitions of what constitutes a biological species, such a reproductively isolated population [16], are insufficient for viruses as they replicate, but do not reproduce like other organisms. Originally, viruses were simply classified according to the host that was infected, i.e. bacterial, plant or animal [60]. It wasn’t until 1950 that official principles of animal virus classification were established, with categories such as morphology, chemical composition, method of transmission, tropism and symptomatology [60]. In 1963 the International Committee on Nomenclature of Viruses (ICNV) was established and in 1966 the body proposed a taxonomic framework and classification rules which included class, order, family. This organization is now known as the International Committee for Taxonomy of Viruses (ICTV) [60, 61]. In 1990 the ICTV established an official definition of viral species which was stated as “a virus species is a polythetic class of viruses that constitutes a replicating lineage and occupies a particular ecological niche” [62], and has since evolved to state “a monophyletic group of viruses whose properties can be distinguished from those of other species by multiple criteria….not limited to natural and experimental host range, cell and tissue tropism, pathogenicity, vector specificity, antigenicity, and the degree of relatedness of their genomes or genes [63]. While this statement recommends distinguishing properties for determining species, the process is still ambiguous.
We chose to focus our efforts on genomic distance in order to apply a quantitative measure to delimit viral species. Several species delimitation methods have been used in bacteria and eukaryotes. One of the most common and recent methods for species delimitation in bacteria and eukaryotes is generalized mixed Yule coalescent, where branching patterns of a single tree transition from Yule process inter-species branching to coalescent process intra-species branching [33]. Single loci can be used for this method, however more recently multiple genes and morphological characters can be used [64]. Previously, a distance method based on gene homology and sharing was used to reevaluate viral family classifications [65]. A relatively simple genomic distance cutoff method has been used to validate viral clades [34-37] and was applied to delimit species in the current study. A caveat with the distance cutoff value used in the current study is that the cutoff value is not universal, but dataset dependent. A potential issue with using the distance cutoff method to establish species boundaries is that as the genomes of additional viruses are sequenced, the species cutoff value could potentially shift, resulting in species cutoff values that could vary over time. A general complication of the method used in the current study and in other genetic data delimitation techniques is that the methods may be delimiting populations, and not necessarily species [66]. We cannot eliminate this possibility in our analysis however this is unlikely given the large distance values between species in the dataset. In our study to determine if the Lion and Pig-tailed derived simplex viruses were species separate from herpes B, we included all sequenced Old-World monkey strains in an effort not to bias the results and establish a general cutoff for the Old-World monkey group. The results of our study showed the genome-based genetic distance between Lion/Pig-tailed macaque derived viruses and the core herpes B strains were both approximately 14%, which was actually greater than the distance observed (~10%) between SA8 and herpes papio (Figure 2B), previously established viral species. The recovery of SA8 and PaHV-1 as separate species helps to validate the method. Both of these values were well above the species cutoff value (8.94%; Figure 2B). The genetic distance data, and the data supporting co-speciation of the Lion and Pig-tailed macaque viruses reinforces the idea that these should be designated as separate, individual species from herpes B, and each other.
Cryptic Viral Species
The term cryptic species is related to similar concepts such as sibling species, species complex, and superspecies, with the definitions between these concepts often blurred. Cryptic species are generally defined as species which appear virtually identical phenotypically, but belong to different taxa, and were thus “hidden”. Cryptic species were originally described three centuries ago [28, 29], and with modern molecular techniques have been increasingly identified across multiple organisms [67-71]. To our knowledge, the concept of cryptic species has not been applied to viruses, however species complex occasionally has [72, 73]. From the phylogenetic network of the Old-World monkey simplex viruses (Figure 1A), these viruses could be described as a series of species complexes (i.e. a group closely related viruses that are difficult to separate), one comprising the macaque viruses and a second encompassing the baboon simplex viruses. The genetic distance cutoff method may be useful in establishing species boundaries in these complexes, as the method confirmed species status for the baboon derived PaHV-2 and SA8. Importantly, the method identified Lion and Pig-tailed simplex viruses as separate species (Figure 2), defining these viruses essentially cryptic species. The genetic distance cutoff method provides a quantitative threshold to determine species status and could be another tool for establishing species status among viral cryptic species complexes.
Summary
In summary genome-based phylogenetic and genetic distance cutoff techniques were applied to the available Old-World monkey simplex virus genome sequences. The results showed that Lion and Pig-tailed macaque simplex viruses were approximately 14% distant from core herpes B strains, which was more distant than between PaHV-2 strains and SA8, previously established viral sequences. The genomic distance cutoff method recovered PaHV-2 and SA8 as separate strains, and Lion and Pig-tailed macaque simplex viruses as separate species, effectively identifying these macaque viruses as cryptic species. Based on the genetic distance analysis, the fact that the OWM hosts are designated as separate species, and herpes viruses co-evolve with their hosts, we propose establishing Lion and Pig-tailed macaque simplex viruses as separate species. This may be the first identification of cryptic viral species.