Genome-wide Identification of ATP Binding Cassette (ABC) Transporter and Heavy Metal Associated (HMA) Gene Families in Flax (Linum usitatissimum L.)

doi:10.21203/rs.3.rs-17128/v1

Download PDF

Research article

Genome-wide Identification of ATP Binding Cassette (ABC) Transporter and Heavy Metal Associated (HMA) Gene Families in Flax (Linum usitatissimum L.)

https://doi.org/10.21203/rs.3.rs-17128/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 19 Oct, 2020

Read the published version in BMC Genomics →

You are reading this older preprint version

Read the latest preprint version →

Background Flax (Linum usitatissimum L.) is a self-pollinated crop and diversified into two morphotypes for its stem fibre and seed oil. The availability of the flax reference genome sequence, previously assembled into 15 pseudomolecules, enables the characterization of important gene families. The ABC transporter and HMA gene families are considered important gene families in the control of cadmium (Cd) accumulation in crops. To date, the genome-wide analysis of these two gene families has been successfully conducted in several plant species but no systematic study is available for the flax genome.

Results Here we described both gene families in flax to provide a comprehensive overview of its evolution and some support towards the functional annotation of its members. The 198 ABC transporter and 12 HMA genes identified in the flax genome were classified into eight ABC transporter and four HMA subfamilies based on their phylogenetic analysis and domain compositions. Nine of these genes, i.e., LuABCC9, LuABCC10, LuABCG58, LuABCG59, LuABCG71, LuABCG72, LuABCG73, LuHMA3, and LuHMA4, were orthologous with the Cd associated genes in Arabidopsis, rice and maize. Ten motifs were identified from all ABC transporter and HMA genes and several motifs were conserved for all genes with similar gene length, but different subfamilies had their different motif structures. Both the ABC transporter and HMA families were highly conserved among subfamilies of flax and with Arabidopsis. While four types of gene duplication were observed at different frequencies, whole-genome or segmental duplications were the most frequent with 162 genes, followed by 29 dispersed, 14 tandem and 4 proximal, suggesting that segmental duplications contributed substantially to the expansion of both gene families in flax. The rates of non-synonymous to synonymous (Ka/Ks) mutations of paired duplicated genes were mostly less than one, indicative of a predominant purifying selection. Only five pairs of genes clearly exhibited positive selection with a Ka/Ks ratio greater than one. Gene ontology analyses suggested that most flax ABC transporter and HMA genes functioned in ATP binding, transporter, catalytic, ATPase activity, and metal ion binding. The RNA-Seq analysis of eight different organs demonstrated diverse expression profiling patterns of the genes and revealed their functional or subfunctional conservation and neo-functionalization.

Conclusion Characterization of the ABC transporter and HMA genes will help in the functional analysis of candidate genes in flax and other crop species.

Epigenetics & Genomics

flax

ABC transporter

HMA

gene duplication

expression profiling

ATP binding cassette (ABC) transporter genes are ubiquitous across the kingdoms of eukarya, eubacteria and archaea [1, 2]. Plant genomes harbor more than 100 ABC transporters which are involved in a broad range of biological functions [3]. ABC transporters comprise at least four domains: two transmembrane domains (TMDs) embedded in the membrane bilayer, and two nucleotide-binding domains (NBDs) located in the cytoplasm [2]. The structure of the TMDs is highly diverse and varies in the number of transmembrane helices, whereas the NBDs have highly conserved helices [4]. The ABC transporters are further categorized into full size transporters with two NBDs and two TMDs and half size with only one of each domain [5]. Therefore, the half-size transporter must form either homodimer or heterodimer to be functionally active.

In plant genomes, ABC transporters are categorized into eight different subfamilies (ABCA-ABCG and ABCI) [3]. Proteins belonging to the ABCA-ABCD subfamilies have a forward direction for domain organization (TMD-NBD) whereas ABCG and ABCI subfamilies have the inverse domain organization (NBD-TMD) [6]. ABCE and ABCF include only two NBDs and are designated as soluble proteins [6]. In Arabidopsis, 130 ABC transporter genes have been identified but few have been functionally characterized [7]. Previous studies have shown that ABC transporters participate in a wide range of processes including the transport of ions, carbohydrates, lipids, xenobiotics, antibiotics, drugs, and heavy metals [8–10]. The two members of the ABCB gene family in Arabidopsis (AtABCB1 and AtABCB2) are auxin transporters and the overexpression of AtABCB1 caused elongation of hypocotyl cells [11, 12]. Several members of the ABCC subfamily are responsible for phytate transport as exemplified in Arabidopsis (AtABCC5) [13], maize (ZmABCC4) [14] and rice (OsABCC13) [15]. Two other ABCC transporters (AtABCC1 and AtABCC2) mediate tolerance to both Cd and mercury by vacuolar sequestration [16]. The ABCF subfamily member AtABCF3 in Arabidopsis is involved in root growth and development [17]. ABCG subfamily members were reported to be involved in cuticle formation and Cd tolerance such as in Arabidopsis (AtABCG32) [18] and rice (OsABCG31 and OsABCG36) [19, 20]. Also, the ABC transporters AtABCG36 in Arabidopsis were shown to mediate Cd uptake in the epidermal cells of roots [21] and to be up-regulated by a Cd treatment [22].

Heavy metal pollution in food, water and soil is hazardous to human health. One of the systems of heavy metal toxicity includes irreversible in vivo protein auxiliary changes, causing a loss or reduction in protein function. The D-block elements generally exist in regular and rural condition [23]. Few of them, for example, copper, zinc, and calcium are fundamental micronutrients for plant digestion and support, while some are non-required metals, for example, Cd, and lead. These metals can be extremely toxic to plants and animals [24]. Similarly, many HMA genes have shown to play a specific function in different plant species. For example, OsHMA2 is concerned with vascular tissue loading of zinc and tonoplast localization in rice [25]. OsHMA3 is involved in localized tonoplast and translocate Cd into root while OsHMA4 transports Cu into root [26]. HvHMA1 is involved in Zn and Cd translocation into barley grain [27]. In wheat HMA genes also play an important role in Cd translocation and are localized in the plasma membrane [28]. Overexpression of AtHMA3 in Arabidopsis resulted in upregulated expression when compared to wild-type plants and Cd accumulation increased by about 2- to 3-fold [29]. The above studies provide an overview of the importance of both ABC transporter and HMA genes in various plant species yet no systemic studies has been reported in flax.

Flax is a self-pollinated crop and diversified into two morphotypes for its stem fibre and seed oil. The initial draft of the flax genome sequence was produced using a whole-genome shotgun (WGS) sequencing and short reads obtained on the Illumina sequencing platform [30]. A de novo assembly generated 88,384 scaffolds, totaling 318 Mb and representing ~ 81% of the estimated ~ 370 Mb flax reference genome. Thus, the availability of this recent update of the flax genome (version 2.0) provides a genomic resource that allows the identification of gene families, evolutionary relationship and structural analyses. To date, ABC transporter and HMA gene families have been studied in several plant species including Oryza sativa and Arabidopsis thaliana [31], Zea mays [32, 33], Brassica rapa [7], Brassica napus [34], Triticum aestivum [35], and Vitis vinifera [36], but not in flax. In this study, we hypothesized that either whole genome duplications (WGDs) or tandem events contributed to the expansion of the ABC transporter and HMA gene families in flax. Therefore, we studied the phylogenetic relationships, gene annotation, physicochemical properties, chromosomal distribution, gene synteny, protein-protein interaction (PPI), and gene duplication of all predicted ABC transporter and HMA genes of the flax genome to understand their evolution and hypothesize their putative functions. Finally, we examined the gene ontology (GO), and expression profiling of ABC transporter and HMA genes in eight organs, i.e., root, seed, ovary, and embryos five different stages (heart, globular, torpedo, mature and cotyledon embryo). This comprehensive study is the first report on ABC transporter and HMA genes in flax and provides gene candidate information for further Cd trait and marker association study.

ABC Transporter and HMA Genes in Flax and Their Physicochemical Properties

A total of 198 ABC transporter and 12 HMA genes were identified in the flax genome reference sequence of CDC Bethune. The ABC transporter genes were classified into eight and the HMA genes into four subfamilies. These genes were denoted as LuABCA1-LuABCA8, LuABCB1-LuABCB48, LuABCC1-LuABCC19, LuABCD1-LuABCD5, LuABCE1-LuABCE2, LuABCF1-LuABCF9, LuABCG1-LuABCG85, LuABCI1-LuABCI22, and HMA1-12. The basic information of these genes in terms of subfamilies including the protein identifier, coding sequence (CDS) length (bp), and protein properties such as number of amino acid (aa) residues, molecular weight (kDa), isoelectric point (pIs), and grand average of hydropathicity (GRAVY) are listed in Table S1. The CDS length ranged from 663 to 7,313 bp. The protein sizes varied from 220 to 2,438 aa with a molecular weight of 25.54 to 273.69 KDa. The pIs range was from 4.93 to 11.67 while the GRAVY values varied from − 0.606 to 0.619. Most genes (132/210) had positive values of GRAVY, indicating hydrophobic properties.

Predictions confirmed that LuABC and LuHMA proteins can be localized to a wide range of subcellular compartments such as plasma membrane, vacuoles, endoplasmic reticulum, nucleus, cytoplasm, chloroplast, golgi apparatus and mitochondrion. Thus, based on the observed sequence divergence and variations in GRAVY and pIs values among subfamilies of LuABC and LuHMA, we speculate that various members of these two gene families have the ability to respond to a variety of environmental cues at the micro or macro-environment levels.

Gene Annotation and Phylogenetic Analysis of ABC Transporter and HMA Genes in Plant Species

The gene annotation analysis based on the chromosomal positions of LuABC and LuHMA provides putative function(s) for each gene and the detailed information is given in Table S2. In brief, based on the predicted function, almost all the LuABC and LuHMA genes were responsive to ABC transporter, ATP binding and heavy metal ATPase. In addition, the LuABC genes were also involved in other important functions. For example, the LuABCC genes acted as multidrug resistance; the LuABCE genes were responsive to RNAse l inhibitor protein, whereas LuABCG was involved in regulating pleiotropic drug resistance. There were some other functions as well, which are assumed to be assisted by LuABCs (Table S2). Thus, the annotation analysis clearly shows the functional diversity of ABC transporters and HMA genes in flax.

Unrooted phylogenetic trees were constructed using the protein sequences for each of the eight ABC and four HMA subfamilies of Linum usitatissimum, Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera, and Brachypodium distachyon (Fig. 1 and Table S3). The phylogenetic relationships within LuABC, AtABC, PtABC, VvABC, and BdABC were highly conserved. Based on the phylogenetic relationships of flax and other species, the ABC transporter proteins were divided into eight subfamilies: ABCA-ABCG and ABCI (Fig. 1a-h). The same two subfamilies ABCB and ABCG, were the largest across all species, while the two subfamilies ABCD and ABCE had the fewest members of genes among all the species. The ABCG was the largest subfamily containing 81 members and dominant in the ABC transporter genes in flax compared with other species. Similarly, the HMA genes of different species were divided into four subfamilies based on their phylogenetic relationships (Fig. 1i and Table S4). The subfamily II was the largest one consisting of 20 members, of which five belonged to flax. The subfamily I was the smallest, consisting of six members in all the species. In short, the subfamily ABCG and HMA II had the highest number of genes in flax compared to other species except Populus trichocarpa in HMA. The distribution patterns of both ABC and HMA genes in five species in terms of different subfamilies are given in Table 1.

Based on previously reports in Arabidopsis, rice, and maize, several ABC and HMA genes are associated with Cd tolerance, including AtABCC1, AtABCC2, AtABCG32, AtABCG36, AtHMA3 and AtHMA4 in Arabidopsis, OsABCG31, OsABCG36 and OsHMA2 in rice, ZmHMA2 and ZmHMA3 in maize. We identified seven ABC transporter genes (LuABCC9-10, LuABCG58-59 and LuABCG71-73), and two HMA genes (LuHMA3-4) which were orthologous with the above Cd related genes (Fig. 1c, g and h). Those genes were assumed to be potential Cd associated candidates.

Chromosomal Localization, Syntenic Relationships, and Gene Duplication of ABC Transporters and HMAs

A total of 197 LuABC and 11 LuHMA genes were located on the 15 chromosomes of flax and three (LusABCG10, LusABCG15, and LuHMA5) were found on scaffolds which were not sorted onto chromosomes (Table S1). The locations of LuABC genes revealed the uneven patterns across the flax chromosomes and within subfamilies. The highest number of ABC transporter and HMA genes was on Chr3 (23), followed by Chr11 (18), and Chr1 (17). Specifically, the nine predicted Cd candidate genes were located on different flax chromosomes: LuABCG71 on Chr1, LuABCG73 on Chr3, LuABCG59 on Chr6, LuABCC9 and LuHMA3 on Chr7, LuABCC10, LuABCG58 and LuHMA4 on Chr12, and LuABCG72 on Chr14 (Fig. 2). The gene collinearity analysis revealed high conservation among subfamilies of both ABC and HMA with Arabidopsis orthologues (Fig. 2).

Four different types of gene duplications were observed from the identified ABC transporter and HMA genes, including 162 WGD or segmental, 29 dispersed, 14 tandem, and 4 proximal duplications. Only one ABC transporter gene (LuABCI16) was a singleton. Eight of the nine flax Cd candidate genes were of the segmental type and only one gene (LuHMA4) was a tandem duplication. Thus, segmental duplications played a dominant role in the expansion of the ABC transporter and HMA gene families in flax and proved our hypothesis.

Table 1

The distribution patterns of ABC and HMA genes in five different species.
Gene Families	Subfamilies	Lu	Ath	Ptr	Vv	Bd
	ABCA	8	12	5	5	6
	ABCA	48	28	40	30	32
	ABCC	19	15	25	26	19
ABC	ABCD	5	2	3	1	4
	ABCE	2	3	2	1	3
	ABCF	9	5	4	6	6
	ABCG	85	43	74	71	44
	ABCI	22	21	39	41	19
	Sub total	198	129	192	181	133
	I	2	1	1	1	1
	II	5	2	6	3	4
HMA	III	3	2	4	3	2
	IV	2	3	1	1	2
	Sub total	12	8	12	8	9
	Total	210	137	204	189	142

Lu = Linum usitatissimum, Ath = Arabidopsis thaliana, Ptr = Populus trichocarpa, Vv = Vitis vinifera, and Bd = Brachypodium distachyon.

Synonymous and Non-synonymous Substitution Rates, Gene Structure Analysis and Motif Composition

The synonymous (Ks) and non-synonymous (Ka) values were estimated based on the duplicated pairs of genes across the flax genome. The Ka/Ks ratios of five pairs (LuABCG71/LuABCG72, LuABCG61/LuABCG64, LuABCG80/LuABCG69, LuABCG4/LuABCG3, and LuHMA6/LuHMA8) exceeded one, suggesting positive selection. The remaining gene pairs underwent purifying selection with a Ka/Ks ratio of less than one. The estimated duplication time of LuABC and LuHMA gene pairs ranged from 1.53 to 28.27 million years ago (MYA), with an average of 8.59 MYA (Table S5).

Conserved motifs and gene structure organization of LuABCs and LuHMAs were analyzed to better understand the global conservation and diversification of these two gene families. A total of ten distinct conserved motifs were identified. Several motifs were highly conserved; for instance, motifs 2 and 5 commonly occurred among subfamilies LuABCA-LuABCI members and also in HMA proteins (Fig. 3a). Of these 10 motifs, motif 6 were specifically expressed in both ABC and HMA proteins except ABCB, ABCD and ABCF subfamilies. Of the nine flax Cd candidate genes, three (LuABCG71, LuABCG72, and LuABCG73) consistently had 9–10 motifs and similar gene length. However, distinct motif compositions existed among most of the subfamilies.

The gene exon structure analysis based on the coding sequences of LuABCs and LuHMAs showed diversification between different subfamilies as well as within subfamilies. Especially 19 genes of ABC contains varied number of exons ranges from 20–31 including LuABCC7, LuABCC9-11, LuABCD1, LuABCF4, LuABCG59-62, LuABCG70-71, LuABCG75, LuABCG78-81, LuABCG88, and LuABCG85 (Fig. 3b). The consistency of these exon numbers representing their high conservation among ABC. Overall, the highest numbers of exon 38 and 40 were observed in LuABCA6 and LuABCA5, the remaining ABC and HMA ranged from 1–19 numbers of exon among different subfamilies.

Gene Ontology (GO) and Expression Profiling

To predict the regulatory functions of the LuABC and LuHMA genes in flax, we performed GO enrichment analyses. The GO terms were categorized into three subgroups: molecular function (MF), cellular component (CC), and biological process (BP) as described in Table S6. The LuABC and LuHMA proteins were enriched for MF such as ATP binding (GO:0005524), ATPase activity (GO:0016887), transporter activity (GO:0005215), catalytic activity (GO:0003824), kinase activity (GO:0016301), and metal ion binding (GO:0046872). CC GO terms associated with LuABC included integral component of membrane (GO:0016021), intracellular (GO:0005622), membrane (GO:0016020), and integral component membrane (GO:0016021). BP terms comprised transport (GO:0006810), transmembrane transport (GO:0055085), signal transduction (GO:0007165), GTPase-mediated signal transduction (GO:0007264), and cation transport (GO:0006812). Taken together, GO terms indicated central processes involving ATP binding and metal ion transport but also a wide range of other processes and activities in flax.

The expression patterns of the LuABC and LuHMA genes in the root, seed, ovary, and five different stages of embryos including heart, globular, torpedo, mature and cotyledon embryo from RNA-Seq data were presented in a heatmap (Fig. 4). In general, the highest number of genes were up-regulated for both LuABC and LuHMA genes was in seed (84/152), followed by root (75/152), and ovary (70/152). Among the five different embryos stages both LuABC and LuHMA genes showed a relatively weak expression, having 47/152, 41/152, 39/152, 37/152 and 36/152 of the genes up-regulated in mature embryo, cotyledon, torpedo, globular, and heart, respectively. The remaining LuABC and LuHMA genes were not expressed or displayed a low level expression in these different organs (Fig. 4a and b). The results based on the high expression level of LuABC and LuHMA genes in root and seed, suggesting that these genes might play a generic (housekeeping) transport role in the developmental organs of flax.

Eight of the nine flax Cd candidate genes were highly expressed in different organs, such as LuABCG71, LuABCG72, and LuABCG73 in root and seed (Fig. 4c). Additionally, multidimensional scaling (MDS) also revealed overall expression differences of genes among organs and high consistency of expression data between biological replicates (Figure S1).

Functional Evolution of Duplicated Genes and Interaction Network

Expression profiling can be mined to predict the functional fate of genes, and here, investigation of the mode and tempo of duplicated genes was performed to assess their functional evolution. We utilized and took advantage of RNA-Seq data by calculating the Pearson correlation coefficient (r) of the syntenic pairs across eight different organs used in our study. The significance of expression levels were tested based on the r values. If the r values were greater than 0.156 (at a significance level of α = 0.05), positive expression correlation was inferred. Forty-two of the 51 pairs had positive expression correlations, indicative of likely functional conservation or sub-functionalization after duplication. The remaining nine pairs had correlation values below 0.276 or a negative correlation, suggesting putative neo-functionalization of at least one of the syntenic pairs (Table S7).

The interaction network of LuABC and LuHMA was examined and a highly dense network formed among the different protein subfamilies (Figure S2a and b). However, a few of the ABC proteins did not interact: LuABCG6, LuABCG66, LuABCG68, LuABCG80, and LuABCG83 (Figure S2a). When the different subfamilies were compared, the LuABCG genes were more preferentially retained in flax during the process of evolution. The gene dosage hypothesis also predicted that those genes would be retained if they were interacting in networks with other proteins [37, 38].

In the current study, we systematically identified 198 ABC transporter and 12 HMA genes in flax that accounted for 0.484% of the total 43,384 annotated proteins [30]. The observed sequence divergence and variations in the physicochemical properties of LuABC and LuHMA genes may indicate the broad diversity of their biological functions. The domain composition analysis and phylogenetic tree validated the eight subfamilies: LuABCA-LuABCG and LuABCI and four subfamilies of LuHMA proteins. The results of the phylogenetic relationships among the genes were consistent with previous findings in Arabidopsis thaliana, Brassica rapa, and Brassica napus [7, 34]. The analysis based on the number of genes reveals the dominant proportion of flax compared with other species. For example, a total of 210 genes were identified in flax followed by 204 in Populus trichocarpa, 189 in Vitis vinifera, 143 in Brachypodium distachyon, and the least number of genes 137 in Arabidopsis thaliana. In general, the LuABC and LuHMA genes showed scattered distribution based on the phylogenetic tree, suggesting that the expansion of these gene families occurred before evolutionary divergence of the common ancestor.

Previous studies indicated that two ABCC, two ABCG, and two HMA genes in Arabidopsis, rice and maize are responsive to Cd tolerance. The flax genes orthologous with these genes may be also potential candidates related to Cd tolerance. Therefore, we identified nine flax Cd candidate genes, including LuABCC9-10, LuABCG58-59, LuABCG71-73, and LuHMA3-4. A further validation will be performed through genome-wide association study. The expression profiling of genes may provide a possible clue to their function. Evidence from our study and other studies have demonstrated that ABC transporters and HMA participate in various plant growth activities and stress tolerance. Metal transporters are known to play pivotal roles in numerous aspects of plants including essential and toxic metal distributions [39]. Thus, we predicted the possible functions of LuABC transporters and LuHMA by examining their gene annotation, gene ontology, and gene expression data of different organs. Taken together, LuABCs and LuHMAs seem to play particular roles in the ATP binding, transport and metal ion binding. The gene expression results also revealed that the majority of the genes were highly expressed in one or more of the eight various organs reported, thereby confirming tissue-specific expression. Of the nine Cd candidate genes, four (LuABCG71-73 and LuABCC10) showed higher expression patterns in root as well in seed and largely conserved regions in gene structure, suggesting their putative redundant functions in developmental organs of flax. Furthermore, these genes might share a common ancestor with similar biological functions in response to Cd.

Protein sequence analysis of gene families is needed for understanding functional innovation and divergence [40]. Most angiosperms have undergone at least two WGD events [41] which are frequently found to be associated with significant evolutionary switches that can contribute to the adaptability of species to a range of environments [42]. The WGD is strongly associated with the development of distinct plant species and gene duplications are a vital force in the process of genomic evolution and functional divergence [43]. Similarly, in the process of evolutionary history, most of the higher plants underwent polyploidization, a vital event in shaping plant genome [44]. Recent studies also revealed that flax has undergone one palaeopolyploidization (23–44 MYA) and one mesopolyploidization (3.7-9 MYA) event [30, 45]. Our findings suggest an average estimated duplication divergence time of 8.59 MYA for LuABC transporter and LuHMA genes. Therefore these results were consistent with more recent (3.7-9 MYA) WGD of the flax genome. Segmental and tandem duplications are the prominent mode of expansion in Arabidopsis [46]. In flax, a total of four different types of gene duplications were observed from 210 ABC transporter and HMA genes, having 162 WGD or segmental, 29 dispersed, 14 tandem, 4 proximal duplicated genes. Segmental duplications (77.14%) contributed the most to the expansion of the two gene families in flax, a common mode of expansion for many gene families across various plant species [47–49]. The selection pressure analysis of duplicated gene pairs based on three categories (i.e., purifying, positive, and neutral selection) tends to provide valuable information regarding protein-coding genes [50]. Ka/Ks ratio values of less than one indicate purifying selection, while the values equal to one specify neutral selection and values greater than one signify positive selection [51, 52]. We noticed that most of the pairs of LuABC and LuHMA genes underwent purifying selection. Our finding suggests that these pairs of genes largely contributed to maintenance of flax growth and development and the strong positive selection in a few genes showed functional differentiation. The duplicated genes underwent one of several fates such as functional or sub-functionalization, neo-functionalization and pseudogenization [51]. The expression correlation analysis of syntenic pairs of flax across eight different organs revealed their functional roles in evolutionary fates. The results of our study suggested only two fates: functional or sub-functionalization and neo-functionalization. Thus, we can assume that most of the gene pairs maintain the same function by showing functional conservation. However, the remaining nine pairs exhibited neo-functionalization, which indicates each copy requires a new function. The structural diversity mainly contributed to the evolution of the gene families as indicated by evolutionary studies [53]. The observed diversity in a few genes of LuABC and LuHMA may have been lost during the evolutionary process and it might contribute to the functional divergence after their loss and birth.

In short, these analyses suggested that the ABC transporter and HMA gene families in flax were expanded during the process of evolution and gene duplication events. Among the nine Cd responsive genes, we observed a consisted trends among two genes i.e., LuABCG71 and LUABCG72 for various properties. Both of them showed conserved gene structure with almost identical number of introns (20 and 16). The gene duplications analysis for this pair was segmental and underwent positive selection with a Ka/Ks ratio of higher than one. The expression analysis was also higher specifically in root and seed, and their PCC further revealed functional conservation among them (Figure S3). Also, the remaining four positive pairs (i.e., LuABCG61/LuABCG64, LuABCG80/LuABCG69, LuABCG4/LuABCG3, and LuHMA6/LuHMA8) did not show as such consistency when compared with the two Cd genes based on their expression patterns and different motif structures.

However, a further functional validation step is needed to define which syntenic paralog pairs underwent functional changes during evolution either as a consequence of structural changes of the CDS or expression changes. Both epigenetic and structural modifications in cis- or trans-elements have an impact on gene expression.

A comprehensive sequence analysis of the ABC transporter and HMA gene families in flax was performed. We identified 198 LuABC transporter and 12 LuHMA genes that were clustered into eight ABC (ABCA-ABCG and ABCI) and four HMA subfamilies. From them, nine genes were predicted to be potential Cd candidates based on homology of Cd genes in Arabidopsis, rice and maize. Their phylogenetic relationships, gene annotation, motif composition, gene structure, syntenic relationships, cis-regulatory elements, and gene ontology are reported. The gene duplication analysis suggested that four different types of duplications occurred among LuABC and LuHMA genes such as WGD or segmental, tandem, dispersed, and proximal. WGD in flax contributed the most to the expansion of LuABC and LuHMA genes. The estimation divergence indicated that recent duplications (mesoopolyploidization) occurred among these two gene families. Moreover, the expression data illustrated high diversification and the evolutionary fate of syntenic gene pairs showed their functional or subfunctional conservation and neo-functionalization. Our results provide insights into the evolution and divergence of LuABC transporter and LuHMA genes in flax. These analyses will be foundational to future investigation into the biological functions of LuABC transporters and LuHMA genes, especially be helpful for further Cd marker association study in flax.

Identification of ABC Transporter, HMA Genes and Gene Duplication

Two methods were used for identification of ABC transporter genes in flax. First, the HMA genes were identified based on the eight reference gene sequences (Table S4) of Arabidopsis by BLAST search (BLASTP) method. Similarly, we performed the same Blast search method for ABC using the 129 reference sequences [54] from the Arabidopsis genome (version 10.0, http://www.arabidopsis.org/) against the flax genome at an E value of 1.0E-10.

Then, we performed a Hidden Markov Model (HMM) search against the flax genome to further confirm the presence of ABC genes with default option using HMMER (version 3.2.1) [55]. The different ABC transporter domains includes, ABC transporter (PF00005), ABC-2 transporter (PF01061), ABC transporter transmembrane region (PF00664), cytochrome c polymerization (CYT) (PF01458) or mammalian cell entry (mce) related protein (PF02470). These domains were downloaded from the Pfam (version 32.0) database (http://pfam.xfam.org/) [56]. After the results of both methods were merged, ABC and HMA genes were further screened on the basis of domains composition. The duplicated results between two methods were eliminated. A total of 745 ABC transporter and HMA protein were identified among all species except Arabidopsis thaliana.

The Cd associated genes in flax were identified based on previously reports in Arabidopsis, rice, and maize using phylogenetic relationships. Several ABC and HMA genes are responsive to Cd stress including AtABCC1, AtABCC2, AtABCG32, AtABCG36, AtHMA3 and AtHMA4 in Arabidopsis, OsABCG31, OsABCG36 and OsHMA2 in rice, ZmHMA2 and ZmHMA3 in maize.

The flax genome sequences (version 2.0) were obtained from NCBI [45] and protein sequences were downloaded from Phytozome (version 12.1) [57]. Genomic sequences of other species, Populus trichocarpa (version 3.1), Vitis vinifera (version Genoscope12X), and Brachypodium distachyon (version 1.2), were obtained from Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html) [57]. The obtained protein sequences of ABC transporter and HMA genes were further verified for ABC/HMA domain compositions in the NCBI-Conserved Domain database (https://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi) [58] and SMART (http://smart.embl-heidelberg.de/) [59]. The protein sequences with errors, short length (<100 aa), and without ABC and HMA domains were removed before further analysis.

Phylogenetic Characterization of ABC Transporter and HMA Genes, and Synonymous (Ks) and Non-synonymous (Ka) Substitution Rates for Duplicated Genes

Multiple sequence alignment for ABC transporter or HMA protein sequences for five species including Linum usitatissimum, Arabidopsis thaliana, Populus trichocarpa, Vitis vinifera, and Brachypodium distachyon were performed using MUSCLE (version 3.8.1551). Phylogenetic trees were then constructed using the PhyML software [60] with the maximum likelihood (ML) method and the Jones, Taylor, and Thornton amino acid substitution model (JTT model).

The different types of gene duplications in the flax genome were identified by using MCScanX [61]. Synonymous (Ks) and non-synonymous substitution (Ka) rates were also calculated for duplicated gene pairs as previously described [47]. Also, a substitution rate of 1.5 × 10⁻⁸ substitutions per synonymous site per year [62] was used to estimate divergence time of duplicated genes.

Conserved Motifs, Gene Structure, and Physicochemical Parameters

Multiple Em for Motif Elicitation (version 5.1.0) [63] was used for scanning of conserved motifs in LuABC and HMA proteins. The maximum number of motifs 10 with a minimum width of 50 aa and a maximum of 100 aa was set as parameters. Moreover, TBtools (version 0.66) [64] was used to visualize both the motif composition and gene structure. The ExPASY PROTPARAM tool (http://web.expasy.org/protparam/) was accessed for various physicochemical properties, i.e., molecular weight (MW), isoelectronic points (pI), and GRAVY values for each gene. The subcellular localization of genes was predicted using the web application WOLF PSORT (https://wolfpsort.hgc.jp/).

Chromosomal Location, Gene Synteny Analysis, and Protein-Protein Interaction (PPI) Analysis

The gene synteny between flax and Arabidopsis was analyzed based on gene annotation data of both species and illustrated using shinyCircos [65]. The PPI analysis for all the ABC transporter and HMA proteins was carried out using an online server STRING (version 11.0) (https://string-db.org/) [66] with the following parameters: medium score (0.400), the number of K means clustering (3), and default values for the remaining options. The results of the interaction network were further visualized using Cytoscape (version 3.4.0) [67].

Plant Materials, RNA Sequencing and Read Data Analysis

Flax cultivar CDC Bethune was planted in greenhouse under growth conditions previously described [68]. Tissues of five different organs (root, seed, anther, ovary, and embryo) and five different stages of embryo development (heart, globular, torpedo, mature and cotyledon embryo) were collected for RNA extraction with two biological replicates for each tissue. Total RNA was extracted from each collected sample following the RNAqueous kit protocol (Ambion, Catalog# 1912) and RNAqueous-Micro kit protocol (Ambion, Catalog# 1931). The samples were homogenized in lysis buffer with polypropylene pestles in 1.5 ml Eppendorf tubes on ice. For RNA-seq profile analysis, Illumina mRNA-seq libraries were prepared using the TruSeq RNA kit (ver. 1, rev A) according to the manufacturer’s instructions. An Agilent 2100 Bioanalyzer was used for quantification and quality determinations of sample libraries. For Illumina HiSeq 2000 sequencing, four indexed libraries were pooled per sequencing lane and paired-end sequencing were performed.

The raw reads were initially trimmed by trimmomatic [69] and the trimmed reads were aligned to the genomic sequences of the flax using the kallisto [30, 70]. The reads with fewer than 5 rpm in at least one library were filtered out. Normalization was performed at trimmed mean of M-values (TMM) using an R package edgeR [71]. A general linear model (GLM) was used with glmLRT function to identify differentially expressed genes with false discovery rate (FDR) less than 0.05 [71]. The anther of the nine organs was used a reference for expression analysis. Thus, expression results were presented in the remaining eight organs.

Heat maps were drawn based on normalized Log₂ scale read counts using ClustVis [72]. Bidirectional cluster analysis was conducted using maximum distance and complete linkage method. Pearson correlation coefficient (r) based on expression data among the syntenic pairs of flax was calculated using Rstudio (version 3.6.0).

The GO analysis for ABC transporter and HMA genes in flax was conducted using Phytozome database (https://phytozome.jgi.doe.gov/pz/portal.html) with keyword search options against the flax genome.

ABC	ATP Binding Cassette
HMA	Heavy Metal Associated
WGD	Whole Genome Duplication
GO Cd GLM FDR JTT ML Ks Ka HMM MDS BCV RPM TMDs NBDs MYA	Gene Ontology Cadmium General linear Model False Discovery Rate Jones, Taylor, and Thornton Maximum Likelihood Synonymous Non-synonymous Hidden Markov Model Multidimensional Scaling Plot Biological Coefficient of Variation Read Per Million Transmembrane Domains Nucleotide-binding domains Million Years Ago

Acknowledgements

Not applicable.

Author’s Contributions

conceptualization, methodology, formal analysis, investigation, writing—original draft preparation, N.K.; writing—review and editing, supervision, project administration, funding acquisition, FMY and S.C.; provided RNA_seq data, R.D.; software, S.R and B.J. All authors read and approved the final manuscript.

Funding

This research was funded by Agriculture and Agri-Food Canada. The funders had no role in the design of the study; in the collection, analysis or interpretation of the data; in the writing of the manuscript; and in the decision to publish the results.

Availability of data and materials

All the data are available in the manuscript and its supplementary materials.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Jones PM, George AM: The ABC transporter structure and mechanism: perspectives on recent research. Cell Mol Life Sci 2004, 61(6):682-699.
Rees DC, Johnson E, Lewinson O: ABC transporters: the power to change. NatRev Mol Cell Biol 2009, 10(3):218-227.
Kang J, Park J, Choi H, Burla B, Kretzschmar T, Lee Y, Martinoia E: Plant ABC transporters. The Arabidopsis Book 2011, 9:e0153-e0153.
Schneider E, Hunke S: ATP-binding-cassette (ABC) transport systems: functional and structural aspects of the ATP-hydrolyzing subunits/domains. FEMS Microbiol Rev 1998, 22(1):1-20.
Hollenstein K, Frei DC, Locher KP: Structure of an ABC transporter in complex with its binding protein. Nature 2007, 446(7132):213-216.
Mishra AK, Choi J, Rabbee MF, Baek K-H: In silico genome-wide analysis of the atp-binding cassette transporter gene family in soybean (Glycine max L.) and their expression profiling. BioMed Res Int 2019, 2019:14.
Yan C, Duan W, Lyu S, Li Y, Hou X: Genome-wide identification, evolution, and expression analysis of the ATP-binding cassette transporter gene family in Brassica rapa. Front Plant Sci 2017, 8(349).
Gadsby DC, Vergani P, Csanády L: The ABC protein turned chloride channel whose failure causes cystic fibrosis. Nature 2006, 440(7083):477-483.
Pighin JA, Zheng H, Balakshin LJ, Goodman IP, Western TL, Jetter R, Kunst L, Samuels AL: Plant cuticular lipid export requires an ABC transporter. Science 2004, 306(5696):702-704.
Sipos G, Kuchler K: Fungal ATP-binding cassette (ABC) transporters in drug resistance & detoxification. CurrDrug Targets 2006, 7(4):471-481.
Sidler M, Hassa P, Hasan S, Ringli C, Dudler R: Involvement of an ABC transporter in a developmental pathway regulating hypocotyl cell elongation in the light. Plant Cell 1998, 10(10):1623-1636.
Noh B, Murphy AS, Spalding EP: Multidrug resistance-like genes of Arabidopsis required for auxin transport and auxin-mediated development. Plant Cell 2001, 13(11):2441-2454.
Nagy R, Grob H, Weder B, Green P, Klein M, Frelet-Barrand A, Schjoerring JK, Brearley C, Martinoia E: The Arabidopsis ATP-binding cassette protein AtMRP5/AtABCC5 is a high affinity inositol hexakisphosphate transporter involved in guard cell signaling and phytate storage. JBiolChem 2009, 284(48):33614-33622.
Badone FC, Cassani E, Landoni M, Doria E, Panzeri D, Lago C, Mesiti F, Nielsen E, Pilu R: The low Phytic Acid1-241 (lpa1-241) maize mutation alters the accumulation of anthocyanin pigment in the kernel. Planta 2010, 231(5):1189-1199.
Tagashira Y, Shimizu T, Miyamoto M, Nishida S, Yoshida KT: Overexpression of a gene involved in phytic acid biosynthesis substantially increases phytic acid and total phosphorus in rice seeds. Plants 2015, 4(2):196-208.
Park J, Song W-Y, Ko D, Eom Y, Hansen TH, Schiller M, Lee TG, Martinoia E, Lee Y: The phytochelatin transporters ATABCC1 And ATABCC2 mediate tolerance to cadmium and mercury. Plant J 2012, 69(2):278-288.
Kato T, Tabata S, Sato S: Analyses of expression and phenotypes of knockout lines for Arabidopsis ABCF subfamily members. Plant Biotech 2009, 26(4):409-414.
Bessire M, Borel S, Fabre G, Carraca L, Efremova N, Yephremov A, Cao Y, Jetter R, Jacquat AC, Metraux JP et al: A member of the pleiotropic drug resistance family of ATP binding cassette transporters is required for the formation of a functional cuticle in Arabidopsis. Plant Cell 2011, 23(5):1958-1970.
Chen G, Komatsuda T, Ma JF, Nawrath C, Pourkheirandish M, Tagiri A, Hu YG, Sameri M, Li X, Zhao X et al: An ATP-binding cassette subfamily G full transporter is essential for the retention of leaf water in both wild barley and rice. ProcNatlAcadSci USA 2011, 108(30):12354-12359.
Fu S, Lu Y, Zhang X, Yang G, Chao D, Wang Z, Shi M, Chen J, Chao D-Y, Li R et al: The ABC transporter ABCG36 is required for cadmium tolerance in rice. J Exp Bot 2019, 70(20):5909-5918.
Kim D-Y, Bovet L, Maeshima M, Martinoia E, Lee Y: The ABC transporter ATPDR8 is a cadmium extrusion pump conferring heavy metal resistance. Plant J 2007, 50(2):207-218.
Bovet l, Eggmann t, Meylan-bettex m, Polier j, Kammer p, Marin e, Feller u, Martinoia e: Transcript levels of ATMRPS after cadmium treatment: induction of ATMRP3. Plant Cell Environ 2003, 26(3):371-381.
Nagajyoti PC, Lee KD, Sreekanth TVM: Heavy metals, occurrence and toxicity for plants: a review. Environ Chem Lett 2010, 8(3):199-216.
Williams LE, Pittman JK, Hall JL: Emerging mechanisms for heavy metal transport in plants. Biochim Biophys Acta 2000, 1465(1):104-126.
Yamaji N, Xia J, Mitani-Ueno N, Yokosho K, Feng Ma J: Preferential delivery of zinc to developing tissues in rice is mediated by P-type heavy metal ATPase OsHMA2. Plant Physiol 2013, 162(2):927-939.
Huang X-Y, Deng F, Yamaji N, Pinson SRM, Fujii-Kashino M, Danku J, Douglas A, Guerinot ML, Salt DE, Ma JF: A heavy metal P-type ATPase OsHMA4 prevents copper accumulation in rice grain. Nat Commun 2016, 7(1):12138.
Mikkelsen MD, Pedas P, Schiller M, Vincze E, Mills RF, Borg S, Møller A, Schjoerring JK, Williams LE, Baekgaard L et al: Barley HvHMA1 is a heavy metal pump involved in mobilizing organellar Zn and Cu and plays a role in metal loading into grains. PloS One 2012, 7(11):e49027-e49027.
Tan J, Wang J, Chai T, Zhang Y, Feng S, Li Y, Zhao H, Liu H, Chai X: Functional analyses of TaHMA2, a P1B-type ATPase in wheat. Plant Biotechnol J 2013, 11(4):420-431.
Morel M, Crouzet J, Gravot A, Auroy P, Leonhardt N, Vavasseur A, Richaud P: AtHMA3, a P_1B-ATPase allowing Cd/Zn/Co/Pb vacuolar storage in Arabidopsis. Plant Physiol 2009, 149(2):894-904.
Wang Z, Hobson N, Galindo L, Zhu S, Shi D, McDill J, Yang L, Hawkins S, Neutelings G, Datla R et al: The genome of flax (linum usitatissimum) assembled de novo from short shotgun sequence reads. Plant J 2012, 72(3):461-473.
Jasinski M, Ducos E, Martinoia E, Boutry M: The ATP-binding cassette transporters: structure, function, and gene family comparison between rice and Arabidopsis. Plant Physiol 2003, 131(3):1169-1177.
Pang K, Li Y, Liu M, Meng Z, Yu Y: Inventory and general analysis of the ATP-binding cassette (ABC) gene superfamily in maize (Zea mays L.). Gene 2013, 526(2):411-428.
Cao Y, Zhao X, Liu Y, Wang Y, Wu W, Jiang Y, Liao C, Xu X, Gao S, Shen Y et al: Genome-wide identification of ZmHMAs and association of natural variation in ZmHMA2 and ZmHMA3 with leaf cadmium accumulation in maize. Peer J 2019, 7:e7877.
Li N, Xiao H, Sun J, Wang S, Wang J, Chang P, Zhou X, Lei B, Lu K, Luo F et al: Genome-wide analysis and expression profiling of the HMA gene family in Brassica napus under Cd stress. Plant Soil 2018, 426(1):365-381.
Bhati KK, Sharma S, Aggarwal S, Kaur M, Shukla V, Kaur J, Mantri S, Pandey AK: Genome-wide identification and expression characterization of ABCC-MRP transporters in hexaploid wheat. Front Plant Sci 2015, 6:488-488.
Çakır B, Kılıçkaya O: Whole-genome survey of the putative ATP-binding cassette transporter family genes in Vitis Vinifera. PloS One 2013, 8(11):e78860-e78860.
Birchler JA, Veitia RA: The gene balance hypothesis: from classical genetics to modern genomics. Plant Cell 2007, 19(2):395-402.
Thomas BC, Pedersen B, Freeling M: Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. Genome Res 2006, 16(7):934-946.
Li D, Xu X, Hu X, Liu Q, Wang Z, Zhang H, Wang H, Wei M, Wang H, Liu H et al: Genome-wide analysis and heavy metal-induced expression profiling of the HMA gene family in Populus trichocarpa. Front Plant Sci 2015, 6:1149-1149.
Gu X, Zou Y, Su Z, Huang W, Zhou Z, Arendsee Z, Zeng Y: An update of diverge software for functional divergence analysis of protein family. MolBiolEvol 2013, 30(7):1713-1719.
Jiao Y, Wickett NJ, Ayyampalayam S, Chanderbali AS, Landherr L, Ralph PE, Tomsho LP, Hu Y, Liang H, Soltis PS et al: Ancestral polyploidy in seed plants and angiosperms. Nature 2011, 473(7345):97-100.
Clark JW, Donoghue PCJ: Whole-genome duplication and plant macroevolution. Trends Plant Sci 2018, 23(10):933-945.
Segraves KA: The effects of genome duplications in a community context. New Phytol 2017, 215(1):57-69.
Moghe GD, Shiu SH: The causes and molecular consequences of polyploidy in flowering plants. Ann N Y Acad Sci 2014, 1320:16-34.
You FM, Xiao J, Li P, Yao Z, Jia G, He L, Zhu T, Luo M-C, Wang X, Deyholos MK et al: Chromosome-scale pseudomolecules refined by optical, physical and genetic maps in flax. Plant J 2018, 95(2):371-384.
Cannon SB, Mitra A, Baumgarten A, Young ND, May G: The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol 2004, 4:10-10.
Khan N, Fatima F, Haider MS, Shazadee H, Liu Z, Zheng T, Fang J: Genome-wide identification and expression profiling of the polygalacturonase (PG) and pectin methylesterase (PME) genes in grapevine (Vitis vinifera L.). Int J Mol Sci 2019, 20(13):3180.
Shazadee H, Khan N, Wang J, Wang C, Zeng J, Huang Z, Wang X: Identification and expression profiling of protein phosphatases (PP2C) gene family in Gossypium hirsutum L. IntJMolSci 2019, 20(6):1395.
Die JV, Gil J, Millan T: Genome-wide identification of the auxin response factor gene family in Cicer arietinum. BMC Genomics 2018, 19(1):301.
Juretic N, Hoen DR, Huynh ML, Harrison PM, Bureau TE: The evolutionary fate of mule-mediated duplications of host gene fragments in rice. Genome Res 2005, 15(9):1292-1297.
Lynch M, Conery JS: The evolutionary fate and consequences of duplicate genes. Science 2000, 290(5494):1151-1155.
Li J, Zhang Z, Vang S, Yu J, Wong GK, Wang J: Correlation between ka/ks and ks is related to substitution model and evolutionary lineage. JMolEvol 2009, 68(4):414-423.
Mercereau-Puijalon O, Barale JC, Bischoff E: Three multigene families in plasmodium parasites: facts and questions. IntJParasitol 2002, 32(11):1323-1344.
Verrier PJ, Bird D, Burla B, Dassa E, Forestier C, Geisler M, Klein M, Kolukisaoglu Ü, Lee Y, Martinoia E et al: Plant ABC proteins – a unified nomenclature and updated inventory. Trends Plant Sci 2008, 13(4):151-159.
Finn RD, Clements J, Eddy SR: HMMER web server: interactive sequence similarity searching. Nucleic Acids Res 2011, 39(suppl_2):W29-W37.
El-Gebali S, Mistry J, Bateman A, Eddy SR, Luciani A, Potter SC, Qureshi M, Richardson LJ, Salazar GA, Smart A et al: The Pfam protein families database in 2019. Nucleic Acids Res 2018, 47(D1):D427-D432.
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N et al: Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res 2012, 40(Database issue):D1178-D1186.
Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki CJ, Lu S, Chitsaz F, Derbyshire MK, Geer RC, Gonzales NR et al: CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res 2017, 45(D1):D200-D203.
Letunic I, Bork P: 20 years of the smart protein domain annotation resource. Nucleic Acids Res 2017, 46(D1):D493-D496.
Guindon S, Dufayard J-F, Lefort V, Anisimova M, Hordijk W, Gascuel O: New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst Biol 2010, 59(3):307-321.
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee T-h, Jin H, Marler B, Guo H et al: MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res 2012, 40(7):e49-e49.
Koch MA, Haubold B, Mitchell-Olds T: Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis, and related genera (Brassicaceae). Mol Bio Evol 2000, 17(10):1483-1498.
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, Ren J, Li WW, Noble WS: MEME suite: tools for motif discovery and searching. Nucleic Acids Res 2009, 37.
Chen C, Chen H, He Y, Xia R: Tbtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface. bioRxiv 2018:289660.
Yu Y, Ouyang Y, Yao W: shinyCircos: an R/Shiny application for interactive creation of circos plot. Bioinformatics 2017, 34(7):1229-1231.
Szklarczyk D, Gable AL, Lyon D, Junge A, Wyder S, Huerta-Cepas J, Simonovic M, Doncheva NT, Morris JH, Bork P et al: STRING v11: protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res 2018, 47(D1):D607-D613.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 2003, 13(11):2498-2504.
Venglat P, Xiang D, Qiu S, Stone SL, Tibiche C, Cram D, Alting-Mees M, Nowak J, Cloutier S, Deyholos M et al: Gene expression analysis of flax seed development. BMC Plant Biol 2011, 11(1):74.
Bolger AM, Lohse M, Usadel B: Trimmomatic: a flexible trimmer for illumina sequence data. Bioinformatics 2014, 30(15):2114-2120.
Bray NL, Pimentel H, Melsted P, Pachter L: Near-optimal probabilistic RNA-Seq quantification. Nat Biotechnol 2016, 34(5):525-527.
Robinson MD, McCarthy DJ, Smyth GK: edgeR: a bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 2010, 26(1):139-140.
Metsalu T, Vilo J: ClustVis: a web tool for visualizing clustering of multivariate data using principal component analysis and heatmap. Nucleic Acids Res 2015, 43(W1):W566-W570.

Download PDF

Journal Publication

published 19 Oct, 2020

Read the published version in BMC Genomics →

Editorial decision: Major revision
17 Jun, 2020
Review #3 received at journal
14 Jun, 2020
Review #2 received at journal
14 Jun, 2020
Review #4 received at journal
07 Jun, 2020
Reviewer #4 agreed at journal
26 May, 2020
Reviewer #3 agreed at journal
25 May, 2020
Reviewer #2 agreed at journal
22 May, 2020
Review #1 received at journal
26 Mar, 2020
Reviewers invited by journal
18 Mar, 2020
Reviewer #1 agreed at journal
18 Mar, 2020
Editor assigned by journal
09 Mar, 2020
Submission checks completed at journal
08 Mar, 2020
Editor invited by journal
08 Mar, 2020
First submitted to journal
05 Mar, 2020

You are reading this older preprint version

Read the latest preprint version →

Genome-wide Identification of ATP Binding Cassette (ABC) Transporter and Heavy Metal Associated (HMA) Gene Families in Flax (Linum usitatissimum L.)

Status:

Journal Publication

Version 1

Abstract

Figures

Background

Results

ABC Transporter and HMA Genes in Flax and Their Physicochemical Properties

Gene Annotation and Phylogenetic Analysis of ABC Transporter and HMA Genes in Plant Species

Chromosomal Localization, Syntenic Relationships, and Gene Duplication of ABC Transporters and HMAs

Synonymous and Non-synonymous Substitution Rates, Gene Structure Analysis and Motif Composition

Gene Ontology (GO) and Expression Profiling

Functional Evolution of Duplicated Genes and Interaction Network

Discussion

Conclusion

Materials And Methods

Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1