Pyropia is a typical genus in the rhodophyte order Bangiales and is of fundamental commercial and social importance as the edible seaweed “nori” (Cho and Rhee 2019). Pyropia species, mainly Pyropia yezoensis (also named as Neopyropia yezoensis) and Pyropia haitanensis (also named as Neoporphyra haitanensis), are widely cultivated in Asian countries(Cao et al. 2016, Yang et al. 2020). Genetic breeding of new cultivars with improved economic traits, such as higher productivity and better quality, is critical to the commercial development of Pyropia sp. as a multibillion-dollar, world-wide aquaculture industry(Abdelrahman et al. 2017). Meanwhile, as intertidal red seaweed, Pyropia has attracted numerous research interests in multicellularity, stress adaptation, cellular development and evolution (Wang et al. 2020, Guan et al. 2022). Genome-associated analysis in Pyropia species is required by both genetic dissection of economic traits and basic biological studies. Recently, it was revolutionized by the high throughput and the low cost of next-generation sequencing (NGS) technologies, which have increased the amount of genome sequencing data (McCombie et al. 2019).
A big challenge in Pyropia genome sequencing is bacterial contamination, as Pyropia, like other seaweeds, is colonized by widely diverse bacteria that interact with them throughout their life cycle (Wang et al. 2022). Antibiotic treatments (usually using a mix of several kinds of antibiotics) will affect the growth of seaweed and fail to remove bacteria completely. Researchers used quartz sands polishing to remove bacteria on algal surface before isolating genomic DNA, but still found some contamination remained (Liu et al. 2022). Once the bacterial sequences were included in the data, it is impossible to identify and remove them completely, as the reference genomes for marine bacteria are not sufficient nowadays (Wang et al. 2022). Therefore, technology that can avoid bacterial DNA is in urgent need.
Another problem in Pyropia genome sequencing is that Pyropia species usually have high copies of chloroplast genome and mitochondrion genome. In sequencing data of directly extracted DNA from P. yezoensis thalli, chloroplast and mitochondrion genome data can account for up to 70%-80% (Wang et al. 2013, Wang et al. 2015). Thus, a vast amount of sequencing data is uninformative and more sequencing depth is required. This strongly leads to uncomplete genomic analysis in Pyropia.
Chromatin Immuno-precipitation (ChIP), using specific antibodies to bind to chromatin, is widely applied to identify protein-associated genomic loci (Ueda et al. 2022). As a eukaryote, nuclear DNA is tightly bound to a group of basic histone proteins and packaged into a structure called the nucleosome(Li and Zhu 2015). As a contrary, bacterial DNA, as well as the plastid and mitochondrion DNA, that originated from ancient bacterial endosymbionts, are naked(Burki 2017, Hołówka and Płachetka 2017). Based on the absence/presence of histone proteins associated with nuclear DNA only, we used the histone H3 antibody in ChIP to specifically isolate nucleosome DNA from P. yezoensis thalli for high-throughput sequencing. The effect of diminishing bacterial and plastid DNA, and the genome-wide coverage were examined through comparing to the direct-extraction and nuclei-extraction methods.