Open reading frame (ORF) prediction in assembled contigs was carried out in the GeneMark suite. The functional annotation of proteins encoded by these ORFs were performed using BLASTp, NCBI Conserved Domain Search, HHpred Interactive Server and Inter-ProScan. The phage genomes were also analyzed by tRNAscan-SE to detect tRNA genes. Phage PZL-Ah1 and Phage PZL-Ah8 included with 44 ORFs and 52 ORFs, respectively. The size of proteins encoded by 44 ORFs of PZL-Ah1 ranged from 4.39 kDa (ORF 19) to 143.40 kDa (ORF 06). For the phage PZL-Ah8, the protein sizes ranged from 4.50 kDa (ORF 08) to 150.07 kDa (ORF 01). During the annotation process, 28 (63.6%) ORFs in PZL-Ah1 and 29 (55.8%) ORFs in PZL-Ah8 were homologous to the sequences of known functional proteins in NCBI database. In addition, 16 (36.4 %) ORFs in PZL-Ah1 and 23 (44.2%) ORFs in PZL-Ah8 were classified as “hypothetical proteins” with yet unknown functions (Table S1 and S2).
Like most phages, the coding region of Aeromonas phage genome was closely arranged, and the genes related to the function were clustered. In terms of gene functions, except for genes with unknown functions, the genome of Aeromonas phage could be divided into three functional categories, so as to the PZL-Ah1 and PZL-Ah8: nucleic acid metabolism and replication module; phage structure and packaging; host lysis. The gene sequence of each module was relatively conservative. PZL-Ah1 and PZL-Ah8 belonged to the Podoviridae family. A major difference between Podoviridae and Siphoviridae was the small number of ORFs, from about 20 to 50. But none of them contain lysogenic modules [11]. The lysogenic module consists of integrase and repressor proteins that were key to getting the phage into the lysogenic or lytic cycle, no ORFs were associated with drug resistance or lysogeny (e.g., site-specific integrases and repressors) in both genomes (Fig. 3). The nucleic acid metabolism and replication module contains genes related to DNA replication and gene regulation respectively. There were 8 ORFs and 12 ORFs related to nucleic acid metabolism and replication module in PZL-Ah1 and PZL-Ah8. The phage structure and packaging module contains terminal enzymes related to DNA packaging and genes in head and tail morphogenesis. 15 ORFs and 14 ORFs related to the structure and packaging module existed in the PZL-Ah1 and PZL-Ah8 genomes. Endolysin (lysin) proteins, which were found to be encoded in the two phage genomes (ORF 02, ORF 29 and ORF 04) in PZL-Ah1, ORF 24 in PZL-Ah8), were responsible for host cell lysis and release of phage progeny. Due to their ability to kill bacteria by degrading their cell wall, lysins had recently emerged as novel anti-microbial molecules, that could potentially be used to tackle emerging antimicrobial resistance challenges in bacteria [12, 13]. The receptor binding protein (RBP) interaction determined the range and specificity of the phage host. By comparing the ORFs of the two phages, we found the two phages encoded host range proteins (ORF16 in PZL-Ah1, ORF12 in PZL-Ah8). ORF16 in PZL-Ah1 encoded 89 amino acids with a protein size of about 9.15kDa, while ORF12 in PZL-Ah8 encoded 95 amino acids with a protein size of about 10.10 kDa. As for the receptor binding protein, we could use genetic engineering technology to expand the host range. So as to effectively solve the problems of strong specificity and narrow host range of phage application. Bardyt [14] found that when used genetic engineering techniques to modify the host decision region genes of phages, which could broaden the host range of phages. Yoichi [15] exchanged the GP37 and GP38 regions of T2 phage with PP01 phage to form the new recombinant T2 phage, which could infect the heterogeneic Escherichia coli, thus effectively expanded the host spectrum of T2 phage.