De novo transcriptome analysis and comparative expression profiling of genes associated with the taste-modifying protein neoculin in Curculigo latifolia and Curculigo capitulata fruits

doi:10.21203/rs.3.rs-41288/v2

Download PDF

Research article

De novo transcriptome analysis and comparative expression profiling of genes associated with the taste-modifying protein neoculin in Curculigo latifolia and Curculigo capitulata fruits

https://doi.org/10.21203/rs.3.rs-41288/v2

This work is licensed under a CC BY 4.0 License

Journal Publication

published 13 May, 2021

Read the published version in BMC Genomics →

You are reading this latest preprint version

Background Curculigo latifolia is a perennial plant endogenous to Southeast Asia whose fruits contain the taste-modifying protein neoculin, which binds to sweet receptors and makes sour fruits taste sweet. Although similar to snowdrop (Galanthus nivalis) agglutinin (GNA), which contains mannose-binding sites in its sequence and 3D structure, neoculin lacks such sites and has no lectin activity. Whether the fruits of C. latifolia and other Curculigo plants contain neoculin and/or GNA family members was unclear.

Results Through de novo RNA-seq assembly of the fruits of C. latifolia and the related C. capitulata and detailed analysis of the expression patterns of neoculin and neoculin-like genes in both species, we assembled 85,697 transcripts from C. latifolia and 76,775 from C. capitulata using Trinity and annotated them using public databases. We identified 70,371 unigenes in C. latifolia and 63,704 in C. capitulata. In total, 38.6% of unigenes from C. latifolia and 42.6% from C. capitulata shared high similarity between the two species. We identified ten neoculin-related transcripts in C. latifolia and 15 in C. capitulata, encoding both the basic and acidic subunits of neoculin in both plants. We aligned these 25 transcripts and generated a phylogenetic tree. Many orthologs in the two species shared high similarity, despite the low number of common genes, suggesting that these genes likely existed before the two species diverged. The relative expression levels of these genes differed considerably between the two species: the transcripts per million (TPM) values of neoculin genes were 60 times higher in C. latifolia than in C. capitulata, whereas those of GNA family members were 15,000 times lower in C. latifolia than in C. capitulata.

Conclusions The genetic diversity of neoculin-related genes strongly suggests that neoculin genes underwent duplication during evolution. The marked differences in their expression profiles between C. latifolia and C. capitulata may be due to mutations in regions involved in transcriptional regulation. Comprehensive analysis of the genes expressed in the fruits of these two Curculigo species helped elucidate the origin of neoculin at the molecular level.

Epigenetics & Genomics

NGS

RNA-seq

neoculin

NBS

NAS

Curculigo capitulata

Curculigo latifolia

expression profile

gene duplication

Curculigo latifolia (Hypoxidaceae family, formerly classified in the Liliaceae family) is a perennial plant found in Southeast Asia, especially the Malay peninsula [1, 2]. According to the Royal Botanic Gardens, Kew, there are 27 species of Curculigo [3]. The genetic diversity and morphology of Curculigo have long been of interest [4-7]. C. latifolia and C. capitulata were previously reclassified as members of the Molineria genus, but recent discussions have suggested that they should be returned to the Curculigo genus. Here, we use the traditional name, Curculigo.

C. latifolia and C. capitulata have a similar vegetative appearance (Fig. 1), but differ in their flower and fruit morphology. In addition, C. capitulata is more widely distributed than C. latifolia. Both species are diploids (2n = 18; x = 9) [8]. C. latifolia is self-incompatible [9], but C. capitulata plants from various botanical gardens in Japan have not been successfully crossed. So, it is unknown whether C. capitulata is self-compatible or self-incompatible. The flowers, roots, stems, and leaves of Curculigo plants have traditionally been used as medicines [10-15]. Notably, C. latifolia fruits, but not those of C. capitulata, produce a taste-modifying protein, neoculin, that makes sour-tasting foods or water taste sweet [1, 16, 17, 18].

Neoculin itself has a sweet taste and is 550 times sweeter than sucrose on the percentage sucrose equivalent scale [19, 20]. Furthermore, neoculin has a taste-modifying activity that converts sourness to sweetness: for example, the sour taste of lemons is changed to a sweet orange taste. Moreover, the presence of neoculin induces sweetness in drinking water, and some organic acids taste sweet when consumed after neoculin [21]. Neoculin is perceived by the human sweet taste receptor T1R2-T1R3, a member of the G-protein-coupled receptor family [22]. Neoculin consists of two subunits that form a heterodimer: the neoculin basic subunit (NBS), also called curculin [16], and the neoculin acidic subunit (NAS) [18, 23]. NBS is a 11-kDa peptide consisting of 114 amino acid residues [16, 24], while NAS has a molecular mass of 13 kDa and 113 residues. The two subunits share 77% identity at the protein level [18]. Several essential amino acids that are responsible for the taste-modifying properties of neoculin have been identified: His-11 in NBS is responsible for the pH-dependent taste-modifying activity of neoculin [25], and Arg-48, Tyr-65, Val-72, and Phe-94 function in the binding and activation of human sweet taste receptors [26]. Changes in the tertiary structure of the subunits at these residues are thought to contribute to the taste-modifying properties of neoculin [27, 28].

Lectins are proteins that recognize and bind to specific carbohydrate structures [29, 30]. Plant lectins are classified into 12 families. Neoculin NBS and NAS are similar in protein sequence and 3-dimensional (3D) structure to the GNA (Galanthus nivalis agglutinin) family of lectins, which are present in bulbs such as snowdrop (Galanthus nivalis) and daffodil (Narcissus pseudonarcissus) and are thought to function as defense or storage proteins [31, 32, 33]. However, NBS and NAS lack a mannose-binding site (MBS) and do not have lectin activity [34-36]. Furthermore, whereas GNA family members in plants such as snowdrop contain one disulfide bond, which functions in intra-subunit bonding, neoculin forms both two intra-subunit bonds and two inter-subunit bonds between NBS and NAS [32].

The fruit of C. latifolia contains 1.3 mg neoculin per fruit [37] or 1.3 mg per one gram of fresh pulp [38]. This is thought to be considerably higher than the levels of total proteins in typical edible fruits [39]. Although the taste-modifying activity of neoculin is well-known, its biological role in C. latifolia is unknown. In addition, as neoculin is not a lectin, it was not clear which lectins are expressed in C. latifolia fruits, especially lectins of the GNA family. Finally, whether other Curculigo species also accumulate neoculin or neoculin-like proteins is unknown.

Here, we compared the gene expression profiles in the fruits of C. latifolia and C. capitulata by transcriptome deep sequencing (RNA-seq). The aim of this study was to comprehensively analyze the two species from the viewpoint of amino acid sequences and gene expression levels to shed light on the origins of neoculin.

De novo RNA-seq assembly from C. latifolia and C. capitulata fruits

We sequenced cDNA libraries from C. latifolia and C. capitulata using the Illumina HiSeq 2500 platform. To analyze the data, we filtered out raw reads with average quality values < 20, reads with < 50 nucleotides, and reads with ambiguous ‘N’ bases. After trimming reads for adapter sequences and filtering, we obtained 44,396,896 reads from C. latifolia and 43,863,400 from C. capitulata. We then assembled high-quality reads from C. latifolia and C. capitulata into 85,697 and 76,775 contigs with a mean length of 775 bp and 744 bp, respectively, using Trinity 2.11. The distribution of transcript lengths and transcripts per million (TPM) values are shown in Additional File 1 and Additional File 2. The N50 values for C. latifolia and C. capitulata transcripts were 1,324 and 1,205, respectively (Table 1). Unigene clustering using CD-Hit revealed 70,371 unigenes in C. latifolia and 63,704 in C. capitulata (Table 1).

Table 1. Overview of de novo RNA-seq assembly from C. latifolia and C. capitulata fruits.

	*C. latifolia*	*C. capitulata*
High-quality reads	44,396,896	43,863,400
Total Trinity genes	69,446	63,951
Total Trinity unigenes	70,371	63,704
Total Trinity transcripts	85,697	76,775
GC (%)	44.0	45.6
N10 (nts)	3,214	2,676
N20 (nts)	2,460	2,103
N50 (nts)	1,324	1,205
Total assembled bases	66,426,868	57,098,016

The gene repertoires of the two Curculigo species fitting the monocots

Low annotation rate of the transcripts: To gather functional information about the transcripts identified from de novo assembly, we aligned all transcripts against nucleotide sequences from various protein databases, including the nonredundant protein (NR) database at the National Center for Biotechnology Information (NCBI), RefSeq, UniProt/Swiss-Prot, Clusters of Orthologous Groups of proteins (COG), the rice (Oryza sativa) genome (Os-Nipponbare-Reference-IRGSP-1.0, Assembly: GCF_001433935.1), and the Arabidopsis (Arabidopsis thaliana) genome (Assembly: GCF_000001735.4) and selected the top hits from these queries. We obtained annotations for 38,433 out of 85,697 transcripts (44.8%) in C. latifolia and 40,554 out of 76,775 transcripts (52.8%) in C. capitulata with a threshold of 1e⁻¹⁰ by performing a Basic Local Alignment Search Tool search with our in silico-translated transcripts against protein databases (BLASTx) using the NR, RefSeq, UniProt, and COG databases and the proteomes of rice and Arabidopsis. All annotations are listed in Additional File 3. The number of annotated transcripts for each database is listed in Table 2. The low annotation rate suggests that the two Curculigo species are significantly different from classical model plant systems that drive much of the information stored in public databases.

Table 2. Number of functional annotations of transcripts from C. latifolia and C. capitulata fruits.

Annotated database	*C. latifolia*	*C. capitulata*
COG¹	11,875	12,448
RefSeq	37,922	39,369
Uniprot	36,783	38,901
NR²	37,118	39,340
Rrice³	34,761	36,204
Arabidopsis⁴	33,332	34,684
All six databases	38,433	40,554

¹ COG: Clusters Groups of proteins.

² NR: nonredundant protein databases of the National Center for Biotechnology Information.

³ Assembly: GCF_001433935.1.

⁴ Assembly: GCF_000001735.4.

Conservation across monocots: After BLASTx searches with the C. latifolia and C. capitulata transcripts against the NR database, we determined the extent of gene conservation across plant species by running Blast2GO [40]. We estimated the similarity of the two Curculigo species to various plant species by counting the number of hits from each species obtained by BLAST searches (Fig. 2). The top six species displaying the highest homology with C. latifolia and C. capitulata transcripts were monocots, like Curculigo, supporting the view that the assembled Curculigo genes are highly similar to known genes from other monocots. The top six species sharing the highest similarity with C. latifolia and C. capitulata were identical in terms of both species and rank order.

Expression of functionally similar genes between the two species: Using the COG database, we classified 11,875 transcripts from C. latifolia and 12,448 from C. capitulata into functional categories (Fig. 3). We observed no significant differences between the two species, which supports the notion that these two species have functionally similar genes.

We also analyzed the functions of the assembled transcripts via Gene Ontology (GO) analysis using the rice genome annotation (Additional File 4). Again, no significant differences were observed between the two species. The results also suggested that the repertoires of genes from the two species are similar to those of better-known species.

The genes with high similarity between C. latifolia and C. capitulata fruits are less than half of the genes

Using the unigene sequences, we analyzed the similarity of between C. latifolia and C. capitulata genes. We performed BLAST searches using each transcript from one species as the query sequence against all transcripts from the other species with a threshold E-value of 1e⁻⁵ or less and selected the reciprocal best hits. We defined unigenes with high similarity between the two species as common genes and unigenes with low similarity between the species, or present in only one species, as unique genes. In total, we deemed 38.6% (27,155 out of 70,371) of genes in C. latifolia and 42.6% (27,155 out of 63,704) of genes in C. capitulata to be common genes (Fig. 4). The relatively small number of common genes suggests that a long time has passed since the divergence of these species, which is consistent with results of lineage analysis based on plastid DNA from Hypoxidaceae family members. Indeed, although the Curculigo genus constitutes a single clade, C. latifolia and C. capitulata are not the most closely related species within this clade [5].

Next, we investigated the proportion of annotated genes in these species using the COG, RefSeq, UniProt, and NR databases and the genomes of rice and Arabidopsis (shown in Table 2). Among the common genes, 17,337 and 17,199 genes were annotated (63.8% and 63.3% of common genes) in C. latifolia and C. capitulata, respectively. By contrast, there were 11,718 annotated unique genes (27.1% of unique genes) among genes found only in C. latifolia and 14,848 (40.6% of unique genes) among those found only in C. capitulata. Thus, the annotation rate was higher for common genes than for unique genes, despite the smaller number of common genes. One possible explanation for this observation is that many of the genes common to both species may also be common genes in other model plant species that are highly represented in the databases employed.

We then compared the expression profiles of 27,155 common genes between C. latifolia and C. capitulata. Although the sequences of the corresponding genes in C. latifolia and C. capitulata were similar, their expression profiles were not necessarily equivalent. Nonetheless, only 111 out of the 27,155 common genes had TPM ratios ≥ 50 (Table 3). Of these 111 genes, five were neoculin-related genes, indicating that the expression profiles of at least some neoculin-related genes differ significantly between the two species.

Table 3. Comparison of the expression profiles of C. latifolia and C. capitulata.

*C. latifolia*			*C. capitulata*
TRYNITY_ID	RefSeq	TPM	TRYNITY_ID	RefSeq	TPM	Pident^†	E-value^†
L_19492_c6_g1_i1	trans-resveratrol di-O-methyltransferase	36282	C_19332_c0_g2_i1	trans-resveratrol di-O-methyltransferase	277	99.18	0
L_20774_c6_g2_i5	trans-resveratrol di-O-methyltransferase	31648	C_20405_c1_g1_i2	trans-resveratrol di-O-methyltransferase	573	99.02	0
*L_22219_c0_g1_i1	mannose-specific lectin-like	7634	*C_16562_c0_g1_i1	mannose-specific lectin-like	80	97.75	0
L_22040_c0_g1_i1	chalcone synthase-like	6483	C_22230_c0_g1_i1	chalcone synthase-like	69	100	0
L_39489_c0_g1_i1	cinnamoyl-CoA reductase 1-like	4584	C_43958_c0_g1_i1	cinnamoyl-CoA reductase 1-like	37	100	0
L_17418_c0_g1_i1	benzyl alcohol O-benzoyltransferase	2848	C_20771_c2_g1_i3	benzyl alcohol O-benzoyltransferase	18	96.27	0
L_18625_c0_g1_i1	glutelin type-A 1-like	2641	C_18515_c0_g1_i1	glutelin type-A 1-like	35	100	0
L_20161_c0_g1_i1	probable polyamine oxidase 5	2333	C_20921_c0_g1_i1	probable polyamine oxidase 5	38	99.17	0
L_20171_c0_g1_i1	pyruvate decarboxylase 1 isoform X1	2140	C_19622_c0_g1_i1	pyruvate decarboxylase 1 isoform X1	30	99.74	0
L_19390_c0_g1_i1	benzyl alcohol O-benzoyltransferase-like	1721	C_20336_c0_g1_i1	benzyl alcohol O-benzoyltransferase-like	25	99.01	0
L_17288_c0_g1_i1	5-methyltetrahydropteroyl-triglutamate--homocysteine methyltransferase 1	1527	C_20491_c0_g1_i4	5-methyltetrahydropteroyl-triglutamate--homocysteine methyltransferase 2-like	19	98.22	0
L_22101_c0_g1_i1	cytochrome P450 71A1-like	1130	C_20591_c0_g1_i1	cytochrome P450 71A1-like	14	100	0
L_9054_c0_g2_i1	uncharacterized protein LOC105052971	891	C_20462_c0_g1_i1	uncharacterized protein LOC105052971	16	99.15	0
L_19899_c1_g1_i5	elongation factor 1-alpha-like	720	C_16211_c0_g1_i1	hypothetical protein CARUB_v100096370mg, partial	11	99.75	0
L_39417_c0_g1_i1	palmitoyl-acyl carrier protein thioesterase, chloroplastic-like	659	C_1125_c0_g1_i1	palmitoyl-acyl carrier protein thioesterase, chloroplastic-like	0.89	99.88	0
L_8999_c0_g1_i1	probable protein Pop3	657	C_3239_c0_g1_i1	probable protein Pop3	10	99.79	0
*L_16562_c0_g1_i1	mannose-specific lectin-like	652	*C_16324_c0_g1_i1	mannose-specific lectin-like	8	98.8	0
L_20784_c0_g1_i1	mannan endo-1,4-beta-mannosidase 5-like	477	C_20300_c0_g1_i1	mannan endo-1,4-beta-mannosidase 5-like	8	99.81	0
L_17063_c0_g1_i1	uncharacterized protein LOC103705182	457	C_15604_c0_g1_i1		7	99.51	0
L_9763_c0_g1_i1	4-hydroxyphenyl-pyruvate dioxygenase	441	C_17419_c0_g1_i2	4-hydroxyphenyl-pyruvate dioxygenase	6	97.85	0
L_15645_c0_g1_i1	hypothetical protein PHAVU_005G042200g	378	C_19503_c0_g1_i2	uncharacterized protein LOC103713005	4	98.6	0
L_39500_c0_g1_i1	uncharacterized protein C24B11.05-like isoform X2	323	C_15665_c0_g1_i2	uncharacterized protein C24B11.05-like isoform X2	6	96.74	0
L_16206_c0_g1_i1	cytochrome P450 71A1-like	295	C_18399_c0_g1_i1	cytochrome P450 71A1-like	5	99.88	0
L_9770_c0_g1_i1	Os09g0480700, partial	278	C_11365_c0_g1_i1	Os09g0480700, partial	3	99.52	0
L_20943_c2_g1_i1	LOW QUALITY PROTEIN: ATP-citrate synthase beta chain protein 1-like	276	C_20189_c1_g1_i6	LOW QUALITY PROTEIN: ATP-citrate synthase beta chain protein 1-like	5	99.74	0
L_5031_c0_g1_i1		265	C_26197_c0_g1_i1		3	99.53	2E-108
L_19581_c0_g1_i1	peroxidase 43	244	C_20763_c0_g1_i7	peroxidase 43	3	99.32	0
L_22200_c0_g1_i1		237	C_21279_c0_g3_i1		0	92.42	0
L_16082_c0_g1_i1	uncharacterized protein LOC105035694	230	C_20815_c0_g1_i2	uncharacterized protein LOC105035694	4	97.73	0
L_1821_c0_g1_i1	protein EARLY RESPONSIVE TO DEHYDRATION 15-like	213	C_5863_c0_g3_i1	protein EARLY RESPONSIVE TO DEHYDRATION 15-like	1	94.17	0
L_21840_c4_g7_i1		197	C_46444_c0_g1_i1		2	100	0
L_11489_c0_g1_i1		189	C_51079_c0_g1_i1		3	95.13	4E-114
L_21813_c0_g1_i1	protein kinase APK1B, chloroplastic-like	184	C_20869_c0_g1_i9	protein kinase APK1B, chloroplastic-like	0.97	95.04	0
L_16611_c0_g1_i1		163	C_8161_c0_g1_i1		2	100	0
L_12355_c0_g1_i1	myb-related protein 306-like	160	C_7266_c0_g1_i1	myb-related protein 306-like	3	99.89	0
L_18378_c0_g1_i1	probable L-ascorbate peroxidase 4	158	C_17994_c1_2_i1	probable L-ascorbate peroxidase 4	2	96.39	0
L_21677_c0_g1_i1	S-adenosylmethionine decarboxylase proenzyme-like	149	C_15562_c0_g2_i1	S-adenosylmethionine decarboxylase proenzyme-like	0.92	96.67	0
L_14830_c0_g1_i1	NAC transcription factor 29-like	135	C_20428_c0_g1_i1	NAC transcription factor 29-like	0	97.61	0
L_14165_c0_g2_i1	probable peroxygenase 4	131	C_17339_c0_g1_i2	probable peroxygenase 4	2	95.32	0
L_21840_c4_g4_i2		130	C_11729_c0_g1_i1		2	100	0
L_39737_c0_g1_i1	Glutathione peroxidase 2	127	C_8347_c0_g1_i1	Glutathione peroxidase 2	2	96.26	0
L_4928_c0_g1_i1		124	C_44794_c0_g1_i1		1	100	3E-101
L_20250_c0_g1_i1	protein NRT1/ PTR FAMILY 5.6-like	114	C_29979_c0_g1_i1	protein NRT1/ PTR FAMILY 5.6-like	2	97.44	0
L_15628_c0_g1_i1	formin-A-like	103	C_20575_c0_g1_i5	formin-A-like	0	90.44	0
L_21235_c2_g9_i1		101	C_9877_c0_g1_i1		1	92.77	3E-98
*L_19752_c0_g1_i1	mannose-specific lectin 3-like	33	*C_18595_c0_g1_i1	mannose-specific lectin 3-like	2301	97.6	0
L_16463_c0_g2_i2	LOW QUALITY PROTEIN: S-norcoclaurine synthase-like	16	C_6989_c0_g1_i1	LOW QUALITY PROTEIN: S-norcoclaurine synthase-like	8393	91.39	0
L_32395_c0_g1_i1		14	C_4973_c0_g1_i1		8765	86.17	4E-92
L_19456_c0_g1_i1	polyphenol oxidase, chloroplastic-like	13	C_20237_c3_g1_i1	polyphenol oxidase, chloroplastic-like	1496	83.82	0
L_14333_c0_g1_i1		12	C_13197_c0_g1_i1		42047	94.54	7E-75
L_55067_c0_g1_i1	defensin Ec-AMP-D1 {ECO:0000303\| PubMed:18625284}-like	9	C_39416_c0_g1_i1	defensin Ec-AMP-D1 {ECO:0000303\| PubMed:18625284}-like	2475	95.1	0
L_5253_c0_g1_i1	Disease resistance-responsive (dirigent-like protein) family protein, putative	9	C_16870_c0_g2_i1	Disease resistance-responsive (dirigent-like protein) family protein, putative	547	94.54	0
L_1586_c0_g1_i1	glycine-rich protein-like isoform X1	8	C_39384_c0_g1_i1		2895	94.72	0
L_23556_c0_g1_i1	basic blue protein-like	5	C_14117_c0_g1_i1	basic blue protein-like	606	94.75	0
L_13618_c0_g1_i1	non-specific lipid-transfer protein 1-like	5	C_13976_c0_g1_i1	lipid transfer protein precursor	655	96.6	0
L_465_c0_g2_i1	microsomal glutathione S-transferase 3-like	5	C_4959_c0_g1_i1	microsomal glutathione S-transferase 3-like	246	93.89	0
L_21384_c3_g4_i1		5	C_17484_c0_g1_i1		424	86.76	1E-56
L_9003_c0_g1_i1	dirigent protein 22-like isoform X1	5	C_19511_c0_g1_i1	dirigent protein 22-like	834	96.09	0
L_4015_c0_g1_i1	CASP-like protein 2A1	4	C_4840_c0_g1_i1	CASP-like protein 2A1	241	98.25	0
L_16618_c0_g1_i1	hypothetical protein SORBIDRAFT_05g026700	3	C_4999_c0_g1_i1	Bowman-Birk type trypsin inhibitor-like isoform X2	5459	86.06	1E-135
L_4834_c0_g2_i1	xylem serine proteinase 1-like	3	C_9966_c0_g1_i1	subtilisin-like protease	232	96.91	0
L_6907_c0_g1_i1	serine/threonine-protein kinase CDL1-like	3	C_11871_c1_g1_i1	serine/threonine-protein kinase CDL1-like	183	96.44	0
L_17444_c0_g1_i1	cytochrome P450 CYP82D47-like	3	C_20684_c0_g1_i1	cytochrome P450 CYP82D47-like	182	94.92	0
L_40485_c0_g3_i1	non-specific lipid-transfer protein 1-like	3	C_39065_c0_g1_i1	non-specific lipid-transfer protein 1-like	1455	90.92	0
L_18380_c0_g1_i2	conserved hypothetical protein	3	C_39186_c0_g1_i1	conserved hypothetical protein	654	93.2	4E-127
L_31252_c0_g1_i1		3	C_12650_c0_g1_i1	non-specific lipid-transfer protein-like	283	94.9	4E-65
L_42464_c0_g2_i1	alpha carbonic anhydrase 8-like, partial	3	C_40148_c0_g1_i1	alpha carbonic anhydrase 7-like	235	93.1	0
L_13852_c0_g1_i1	endoglucanase 6	3	C_18579_c0_g1_i1	endoglucanase 19-like	707	97.65	0
L_39898_c0_g1_i1	oxygen-evolving enhancer protein 3-1, chloroplastic-like	2	C_12932_c0_g1_i1	oxygen-evolving enhancer protein 3-1, chloroplastic-like	160	96.93	0
L_6056_c0_g1_i1	Calvin cycle protein CP12-1, chloroplastic-like	2	C_12691_c0_g1_i1	calvin cycle protein CP12-1, chloroplastic	215	92.42	3E-128
L_6093_c0_g1_i1		2	C_41578_c0_g1_i1		170	93.87	2E-135
L_17773_c0_g1_i1	uncharacterized protein LOC105056845	2	C_12165_c0_g1_i1	uncharacterized protein LOC105056845	249	92.36	0
L_24151_c0_g1_i1	ribonuclease 3-like	2	C_39292_c0_g1_i1	ribonuclease 3-like	389	98.07	0
L_8678_c0_g1_i1	uncharacterized protein LOC105056672	2	C_4730_c0_g1_i1	uncharacterized protein LOC105056672	116	98.71	0
L_19431_c2_g4_i1	polyubiquitin 4-like, partial	2	C_20039_c0_g8_i1	hypothetical protein PHAVU_003G1236000g, partial	2116	94.38	6E-106
L_250_c1_g1_i1	probable glutathione S-transferase parA	2	C_16559_c0_g1_i1	probable glutathione S-transferase parA	419	98.26	0
L_10676_c0_g1_i1	probable linoleate 9S-lipoxygenase 5	2	C_17658_c0_g1_i1	probable linoleate 9S-lipoxygenase 5	1644	98.82	0
L_8975_c0_g1_i1	chitinase-like protein 1	2	C_16475_c1_g1_i1	chitinase-like protein 1	183	97.92	0
L_44393_c0_g1_i1	hypothetical protein POPTR_0004s03650g	2	C_18495_c2_g1_i1	conserved hypothetical protein	2751	92.78	3E-66
L_759_c0_g1_i1	CAS1 domain-containing protein 1-like	2	C_21365_c0_g1_i1	CAS1 domain-containing protein 1-like isoform X2	183	99.26	0
L_30327_c0_g1_i1	conserved hypothetical protein	2	C_23037_c0_g1_i1	conserved hypothetical protein	134	96.47	2E-116
L_5572_c0_g1_i1	short-chain type dehydrogenase/reductase-like	2	C_14194_c0_g1_i1	short-chain type dehydrogenase/reductase-like	205	96.22	0
L_45139_c0_g1_i1	putative germin-like protein 2-1	2	C_21890_c0_g1_i1	putative germin-like protein 2-1	111	96.2	1E-146
L_22251_c0_g1_i1	xyloglucan endotransglucosylase/ hydrolase protein 9-like	2	C_40495_c0_g2_i1	LOW QUALITY PROTEIN: xyloglucan endotransglucosylase/ hydrolase protein 9-like	131	98.37	0
L_56341_c0_g1_i1	peroxidase 4-like	1	C_10149_c0_g1_i1	peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase A-like	1301	93.81	2E-93
L_56680_c0_g1_i1	peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase A-like	1	C_19920_c0_g1_i1	peroxidase 4-like	583	98.68	7E-113
*L_307_c0_g2_i1	mannose-specific lectin-like	1	*C_9931_c0_g1_i1	mannose-specific lectin-like	14867	99.35	0
L_48085_c0_g1_i1	probable indole-3-acetic acid-amido synthetase GH3.1	1	C_19080_c1_g1_i1	probable indole-3-acetic acid-amido synthetase GH3.1	107	84.39	2E-70
L_46946_c0_g1_i1	chlorophyll a-b binding protein 7, chloroplastic-like	1	C_41884_c0_g1_i1	chlorophyll a-b binding protein, chloroplastic	141	98.41	0
L_4845_c0_g1_i1	chlorophyll a-b binding protein CP26, chloroplastic-like	1	C_10575_c0_g1_i1	chlorophyll a-b binding protein CP26, chloroplastic-like	353	98.25	0
L_23363_c0_g1_i1	uncharacterized protein LOC105056050	1	C_21609_c0_g1_i1	uncharacterized protein LOC105056050	1612	98.84	0
*L_30823_c0_g1_i1	mannose-specific lectin-like	1	*C_17363_c2_g1_i3	mannose-specific lectin-like	317	98.64	0
L_645_c0_g1_i1	putative lipid-transfer protein DIR1	1	C_12082_c0_g1_i1	putative lipid-transfer protein DIR1	108	97.09	0
L_50661_c0_g1_i1	oxygen-evolving enhancer protein 2, chloroplastic-like	1	C_14711_c0_g1_i1	oxygen-evolving enhancer protein 2, chloroplastic-like	133	97.74	0
L_16663_c0_g1_i2		1	C_20564_c0_g1_i1		642	89.54	2E-112
L_41624_c0_g1_i1	isocitrate lyase	1	C_15046_c0_g1_i1	isocitrate lyase	116	98.32	7E-180
L_33923_c0_g1_i1	galactinol synthase 2-like isoform X1	1	C_13705_c0_g1_i1	galactinol synthase 1-like	127	92.82	0
L_36400_c0_g1_i1	putative cell wall protein	0.98	C_26021_c0_g1_i1	putative cell wall protein	117	98.05	2E-98
L_53880_c0_g1_i1	uncharacterized protein LOC105056050	0.93	C_5177_c0_g1_i1	proactivator polypeptide-like 1	644	98.94	0
L_6399_c0_g1_i1	auxin-induced protein 22D-like	0.93	C_10469_c0_g1_i1	auxin-induced protein 22D-like	171	96.71	0
L_10569_c0_g2_i1		0.91	C_16356_c0_g1_i1		1072	93.62	0
L_21646_c1_g1_i3	protein HOTHEAD-like	0.91	C_14207_c0_g1_i1	protein HOTHEAD-like	330	96.12	0
L_22097_c0_g1_i1		0.82	C_9693_c0_g2_i1		155	92.67	0
L_30250_c0_g1_i1	polygalacturonase inhibitor	0.64	C_17486_c1_g1_i2	Polygalacturonase inhibitor	171	93.72	2E-170
L_50985_c0_g1_i1	putative phytosulfokines 6 isoform X1	0.47	C_22933_c0_g1_i1	putative phytosulfokines 6 isoform X2	136	95.71	0
L_39567_c0_g2_i1	profilin-1	0	C_15886_c0_g1_i1	profilin-1	615	97.97	0
L_5103_c0_g1_i1	trans-resveratrol di-O-methyltransferase-like	0	C_39904_c0_g1_i1	trans-resveratrol di-O-methyltransferase-like	430	79.15	0
L_24431_c0_g6_i1	60S ribosomal protein L24	0	C_1942_c0_g1_i1	60S ribosomal protein L24	278	97.76	0
L_3220_c0_g1_i1		0	C_1273_c0_g1_i1	chlorophyll a-b binding protein 6, chloroplastic	264	92.97	3E-47
L_16735_c0_g2_i2	uncharacterized protein LOC105047938	0	C_39063_c0_g1_i1	uncharacterized protein LOC105047938	172	92.49	0
L_256_c0_g1_i2	Os06g0133500	0	C_16734_c1_g2_i1	Os06g0133500	151	92.07	9E-165

*: neoculin-related transcripts (cf. Fig. 5 and Additional File 6)

^†: Pident and E-value are BLASTN results performed with C. latifolia as query against C. capitulata.

Common genes with TPM value ≥ 50 between the two species, except when the TPM values of both genes is < 100. The genes were sorted based on the TPM value of C. latifolia along with the corresponding genes of C. capitulata. Note that there were no cases of genes that were highly expressed in both species. This pattern strongly suggests changes in the gene expression regulatory system due to divergence of two species.

Lectin genes expressed in C. latifolia and C. capitulata fruits

We previously demonstrated that C. latifolia fruits contain a taste-modifying protein consisting of a NBS-NAS heterodimer that is similar to lectins in the GNA family. We therefore investigated the number of lectin genes expressed in the fruits of C. latifolia and C. capitulata that were categorized into each of the 12 lectin families to better understand the general outline of the GNA gene family in these species. To determine the number of lectin genes, we performed tBLASTN searches against all transcripts in each species using the sequences of 12 representative lectins as query [41] (Table 4). In both species, the largest lectin family was the GNA family, which includes the neoculin (NBS and NAS) genes. Ten of the 45 lectin genes in C. latifolia and 13 of the 49 lectin genes in C. capitulata belonged to the GNA family. Thus, we analyzed the many GNA family genes in these species, including the neoculin genes, in more detail.

Table 4. Number of predicted lectin genes using tBLASTN in C. latifolia and C. capitulata fruits.

Lectin domain	Model lectin	*C. latifolia*	*C. capitulata*
ABA domain	Agaricus bisporus agglutinin	0	0
Amaranthin domain	Amaranthus caudatus agglutinin	0	0
CRA domain	Robinia pseudoacacia chitinase-related agglutinin	3	4
Cyanovirin domain	Nostoc ellipsosporum agglutinin	0	0
EUL domain	Euonymus europaeus agglutinin	1	1
GNA domain	Galanthus nivalis agglutinin	10	13
Hevein domain	Hevea brasiliensis agglutinin	3	2
JRL domain	Artocarus integer agglutinin	9	4
Legume domain	Glycine max agglutinin	8	16
LysM domain	Brassica juncea LysM domain	1	1
Nictaba domain	Nicotiana tabacum agglutinin	10	8
Ricin-B domain	Ricinus communis agglutinin	0	0
Total number of lectin genes		45	49

Analysis of GNA family and neoculin-related transcripts

We constructed a phylogenetic tree using the deduced protein sequences from 17 transcripts of well-known GNA family members and 25 full-length neoculin-related transcripts from Curculigo (10 from C. latifolia and 15 from C. capitulata; Fig. 5); the method used for sequence selection is shown in Additional File 5. The TPM values (calculated by RSEM) are listed after the transcript IDs. An alignment of all sequences is shown in Additional File 6. The C. latifolia transcript L_16562_c0_g1_i1 was a good match for NBS, while L_16562_c0_g1_i2 was a good match for NAS, except for one amino acid substitution (Additional File 7); these transcripts will be referred to as NBS and NAS hereafter. The predicted proteins derived from neoculin-related transcripts formed a distinct group separate from known GNA family members. Neoculin-like sequences formed one group that included NBS and NAS (named the ‘neoculin group’), as well as two other large groups (group 1 and group 2) (Fig. 5). In addition to NBS and NAS, the neoculin group also included proteins whose transcripts were highly expressed (C_9931_c0_g1_i1) and that presented the conserved amino acid residues critical for binding mannose (and thus have the potential for lectin activity). In addition, each transcript had an ortholog in both Curculigo species. Furthermore, transcripts from this group exhibited such a high DNA sequence identity that qRT-PCR analysis could not be performed with high accuracy on individual members.

Many highly expressed transcripts belonged to group 1 (L_22219_c0_g1_i1 [TPM: 7,600]; C_18595_c_g1_i1 [TPM: 2,300]; C_9454_c0_g1_i1 [TPM: 2,000]). Although these highly expressed transcripts encode proteins that are very similar to mannose-binding lectins, they are not mannose-binding lectins, as they lack the conserved and essential amino acid residues that form the mannose-binding sites. At this time, we do not know their physiological functions or the reason for their high expression. Predicted proteins encoded by group 2 transcripts were also relatively close to the lectins Polygonatum multiflorum agglutinin (PMA) and Polygonatum roseum agglutinin (PRA) from the Polygonatum genus. Unlike in group 1, there were no highly expressed transcripts in this group.

In each group, we detected neoculin-related orthologous transcripts with high similarity between C. latifolia and C. capitulata. The existence of many orthologs in each species, combined with the presence of relatively few common genes (comprising only approximately 40% of all transcripts in both species; Fig. 4), is noteworthy. We infer that these orthologs probably existed before the divergence of these two species, whereas their amino acid differences probably arose afterwards. Genetic diversity is beneficial for plants, including Curculigo, due to their lack of mobility to increase population survival against multiple stresses. It would be interesting to determine whether Curculigo plants other than C. latifolia and C. capitulata contain neoculin-related genes, especially genes in the neoculin group.

Within the neoculin group, we identified transcripts encoding proteins with high similarity to NBS and NAS in both C. latifolia and C. capitulata. Notably, although the corresponding NBS and NAS genes were highly expressed in C. latifolia, their C. capitulata orthologs were only weakly expressed (C_16324_c0_g1_i1 and C_16324_c0_g1_i2). The TPM values for NBS and NAS genes in C. latifolia were approximately the same, with 650 and 620 TPMs, respectively. This result is in agreement with the finding that their encoded proteins form a heterodimer [18]. Although C_9931_c0_g1_i1 was highly expressed in C. capitulata, with a TPM value of 15,000 (the fifth highest expression level among all C. capitulata transcripts), its C. latifolia ortholog (L_307_c0_g1_i1, L_307_c0_g2_i1) was expressed at a very low level. Curiously, in all three groups (neoculin group, groups 1 and 2) for which there were orthologs in both species, if a gene was highly expressed in one species, its ortholog was weakly expressed in the other species; we did not identify a single case where orthologs were highly expressed in both species. The data shown in Table 3 also support this pattern. These results strongly suggest changes in the gene expression regulatory system due to divergence of the two species.

Next, we aligned the deduced amino acid sequences for the proteins belonging to the neoculin group (Fig. 6a). We divided the sequences into nine regions, including the regions removed by cleavage of the secretion signal peptide and three mannose binding site (MBS)-like regions: N pro-sequence (N-Pro), N-terminal (N-term), MBS1, inter1, MBS2, inter2, MBS3, C-terminal (C-term), and C pro-sequence (C-Pro). The His-11 residue was present in the N-term region of NBS and in the predicted proteins encoded by transcripts L_16562_c0_g1_i1 in C. latifolia and C_16324_c0_g1_i1 in C. capitulata. This site essential for the pH-dependent taste-modifying activity of neoculin. By contrast, transcripts C_9931_c0_g1_i1 in C. capitulata and L_307_c0_g1_i1 and L_307_c0_g2_i1 in C. latifolia (abbreviated ‘C_9931 series’) did not code for His-11, which was replaced by Tyr-11, as in NAS. In addition, Cys-77 and Cys-109, which form an intermolecular disulfide bond between NBS and NAS, were present within the inter2 and C-term regions in both species, but were absent in the C_9931 series. Thus, it is likely that proteins corresponding to the C_9931 series do not form dimers.

Four residues are responsible for the binding and activation of the human sweet receptor: Arg-48, Tyr-65, Val-72, and Phe-94 [26]. Although Tyr-65 and Val-72 were identified in the C_9931 series, Leu-48 and Val-94 were missing. The lack of His-11 and these four indispensable residues, as well as the lack of dimerization, indicate that the C_9931 series proteins may not possess the sweet taste or taste-modifying properties of classic neoculin. Indeed, a preliminary test indicated that C. capitulata fruits did not have a sweet taste or taste-modifying properties despite the high expression level of C_9931_c0_g1_i1 (data not shown). Three sites similar to the MBS were present in the MBS1, MBS2, and MBS3 regions of this protein. Moreover, whereas NBS and NAS lack the essential residues of the MBS, all of these residues were conserved in C_9931_c0_g1_i1, making C_9931_c0_g1_i1 a likely lectin candidate.

Based on this protein alignment, we investigated all amino acid substitutions in each region in comparison to the two reference sequences, NBS and NAS (Additional File 8). The amino acid substitution rate with reference to NBS is shown in the heatmap in Fig. 6b. Between the NBS series and the NAS series, 18% to 27% of substitutions occurred in the overall regions from the N-term region to C-term region (23%, 26 of 114 residues in NBS). The highest substitution rate was 27% in the MBS2 region, followed by 24% in the inter2 and C-term regions. In the C_9931 series, the highest substitution rate was 53% in the C-term region, followed by the MBS3 region (44%) and inter2 region (43%). These results suggest that the region from inter2 to C-term is the main source of sequence diversity among neoculin group members.

Biochemical analysis

We extracted proteins from C. latifolia and C. capitulata fruits and subjected them to SDS-PAGE, followed by Coomassie brilliant blue (CBB) staining and immunoblotting using a mixture of polyclonal anti-NAS and anti-NBS specific antibodies (Fig. 7). The CBB-stained gel is shown in Fig. 7a and the corresponding immunoblot in Fig. 7b. By CBB staining, we detected an 11-kDa band representing NBS and a 13-kDa band representing NAS in C. latifolia fruit samples (Fig. 7a). In C. capitulata fruits, some bands around 11 kDa may be the protein encoded by C_9931_c0_g1_i1, which had a high TPM value. Immunoblotting confirmed the identity of the bands corresponding to NBS and NAS in C. latifolia fruits. However, we detected no such bands in C. capitulata fruits (Fig. 7b), perhaps because NBS and NAS accumulate at very low levels in this species, as reflected by the low TPM values of their encoding transcripts (as described above). The amino acid sequence of the C-term region, which is recognized by the antibody, was also very different in C_9931_c0_g1_i1 compared to both NBS and NAS, which is consistent with the finding that the proteins detected by CBB staining were not detected by immunoblotting.

The C. latifolia and C. capitulata transcriptomes contain many neoculin-related genes that are similar within and between species. This diversity is thought to result from gene duplication, which is known to contribute to plant evolution [41-47]. Such gene duplication might place some genes under the same transcriptional regulation. The neoculin genes NBS and NAS are likely paralogs that arose due to tandem duplication before the divergence of C. latifolia and C. capitulata. The characteristics of NBS and NAS genes in C. latifolia and C. capitulata are summarized in Table 5. Both C. latifolia and C. capitulata produce NBS and NAS transcripts, and the sequences of the C_9931 series transcripts matched those of active GNA family members. However, their expression levels in the two species were very different.

C. latifolia fruits have been reported to accumulate 1.3 mg neoculin g⁻¹ fresh pulp. Because neoculin is 550 times as sweet as sucrose [19, 20], one gram of C. latifolia fruit pulp is thus estimated to be equivalent to 715 mg of sucrose in sweetness, explaining the sweet taste of these fruits. Given that the TPM values of the neoculin genes in C. capitulata were only 1/60 those detected in C. latifolia, C. capitulata fruits would be expected to contain only approximately 22 ng neoculin g⁻¹ fresh pulp and have the same sweetness as 12 mg of sucrose. Based on these values, it seemed likely that C. capitulata fruits would not taste sweet, which we confirmed in a preliminary test. Thus, neoculin levels, and therefore taste, differ greatly between these fruits, paralleling the difference in the expression of NBS and NBA genes in the two species. The taste of C. latifolia fruits may strongly influence its survival strategies. For example, the sweet taste conferred by neoculin may facilitate seed spread by animals.

Table 5. Summary of neoculin group transcripts in fruits of two Curculigo species.

	Transcript ID	Reference transcript	No. of substitutions (amino acid)	Heterodimerization (no. of Cys)	Lectin activity (no. of MBS*)	Taste modification	Expression (TPM**)
*C. latifolia*	L_16562_c0_g1_i1 (NBS)	NBS	0	Yes (4)	No (0)	Yes	High (650)
	L_16562_c0_g1_i2 (NAS)	NAS	1	Yes (4)	No (0)	Yes	High (620)
	L_307_c0_g2_i1	C_9931_c0_g1_i1	1	Unknown (2)	Unknown (3)	Unknown	Very low (1.4)
	L_307_c0_g1_i1	C_9931_c0_g1_i1	1	Unknown (2)	Unknown (3)	Unknown	Very low (0.35)
*C. capitulata*	C_16324_c0_g1_i1	NBS	6	Probably Yes (4)	Probably No (0)	Probably Yes	Low (8.0)
	C_16324_c0_g1_i2	NAS	0	Probably Yes (4)	Probably No (0)	Probably Yes	low (11)
	C_9931_c0_g1_i1	C_9931_c0_g1_i1	-	Unknown (2)	Unknown (3)	Unknown (Fruits have no activity)	Very high (15,000)

The number of amino acid residues difference from the reference sequences, the potential for heterodimerization, lectin activity, taste modification, and expression levels of the transcripts from C. latifolia or C. capitulata fruits are summarized. As the reference sequences, amino acid sequences of NBS, NAS, and C_9931_c0_g1_i1 of C. capitulata were used.

* MBS, mannose-binding site.

** TPM, transcripts per million.

The structure of the taste-modifying protein miraculin is similar to those of the soybean (Glycine max) Kunitz trypsin inhibitor and thaumatin, a sweet protein with an α-amylase or trypsin-inhibitor-like structure. Similarly, neoculin has a structure similar to that of lectin, a common molecular structure in plants [48-54]. Trypsin inhibitors, amylase inhibitors, and lectins commonly accumulate in fruits and seeds. The diversity of these proteins arose from gene duplications and mutations during evolution. It appears that over the course of evolution, neoculin, miraculin, and thaumatin all acquired sweetness or taste-modifying activity in regard to human senses.

Lectins are thought to play important protective and storage roles in general plants. Thus, the high expression levels of lectin genes in C. capitulata fruits is likely to reflect important roles of lectins in this plant. In contrast, the low expression levels of neoculin genes in C. capitulata suggest that the encoded protein may be less beneficial in this species. Similarly, and in contrast to C. capitulata, active GNA family members were barely expressed in C. latifolia fruits. Neoculin genes were highly expressed in C. latifolia but weakly expressed in C. capitulata despite the similar vegetative appearance of the two plants (Fig. 1). These physiological differences might be due to mutation(s) of the cis-regulatory elements in these genes. Cis-elements, including promoters, enhancers, and silencers, are very important for the regulation of gene expression [41, 55-58]. Likewise, the different expression levels of related genes in C. latifolia vs. C. capitulata might be caused by mutations in their cis-elements. For example, the cis-elements of the NBS and NAS genes may have mutated after the divergence of the two species, or the genes may have acquired mutations or lost cis-elements during the gene duplication events that led to their divergence, leading to different expression patterns. Deciphering the genomic information of these two species further might help verify this notion and distinguish among these possible mechanisms.

RNA-seq analysis and de novo transcriptome assembly of C. latifolia and C. capitulata fruits revealed the presence of numerous neoculin-like genes. Among the various neoculin-related genes that arose from gene duplication, several mutations accumulated, resulting in the genes encoding NBS and NAS. These proteins form the heterodimeric protein neoculin, which exhibits taste-modifying activity in humans. Our comprehensive investigation of the genes expressed in the fruits of these two Curculigo species will help uncover the origin of neoculin at the molecular level.

Plant materials

C. latifolia (voucher ID 26092) was obtained from the Research Center for Medicinal Plant Resources, National Institutes of Biomedical Innovation, Health, and Nutrition, Tsukuba, Japan (originated in Indonesia). C. capitulata (voucher ID 31481) was obtained from The Naito Museum of Pharmaceutical Science and Industry, Kakamigahara, Japan. The plants were cultivated in a greenhouse at the Yamashina Botanical Research Institute. Photographs of the fruits of these plants are shown in Fig. 1.

Fruit setting

C. latifolia flowers were pollinated by hand in the morning on the first day of flowering. C. capitulata flowers were placed in 50 ppm of 1-naphthylacetic acid (NAA) in the morning of both the first and second days of flowering. This is the first report of a method to induce C. capitulata fruit set through plant hormone application. About 60 days after flowering, mature fruits were harvested and immediately soaked in RNA later^TMsolution (Thermo Fisher Scientific, MA. USA). The fruits were stored at −80 ºC until use. The samples were ground into a powder in liquid nitrogen prior to RNA extraction. Total RNA was extracted from the frozen samples using the phenol-SDS method, and poly(A)⁺mRNA was purified using an mRNA Purification Kit (Amersham Biosciences, Buckinghamshire, UK).

Sequencing

mRNA sequencing was performed by Hokkaido System Science Co., Ltd. (Hokkaido, Japan). A cDNA library was generated using TruSeq RNA Sample Prep Kit v2 (Illumina, Inc., CA. USA) and sequenced on an Illumina HiSeq 2500 platform (101 bp read length, paired-end, unstranded). The raw reads were cleaned using cutadapt1.1 [59] and trimmomatic0.32 [60]. We removed adapter sequences, low-quality sequences (reads with ambiguous ‘N’ bases), and reads with Q-value < 20 bases. Sequences smaller than 50 bases were eliminated. The remaining high-quality reads were assembled into contigs using Trinity2.11 [61] with default options. We quantified transcript levels as transcripts per million (TPM) values using Bowtie1.12 [62] and RSEM (RNA-Seq by Expectation-Maximization) [63] in the Trinity package.

Sequence clustering

The assembled sequences were compared against the NCBI NR, prot-plant from RefSeq, UniProt, the rice genome (Os-Nipponbare-Reference-IRGSP-1.0, Assembly: GCF_001433935.1), and the Arabidopsis genome (Arabidopsis thaliana, Assembly: GCF_000001735.4) with an E-value < 1e⁻¹⁰. BLAST analysis was performed using BLAST version 2.2.31. CD-Hit (cd-hit-est) [64, 65] was used for clustering with the option of threshold (-c) 0.9 to obtain unigenes.

Comparison of gene expression in C. latifolia vs. C. capitulata fruits

To compare the transcripts in C. latifolia vs. C. capitulata fruits, a BLASTN search was performed with E-value < 1e⁻⁵ using each transcript from one species as the query against all transcripts from the other species, and then the best hits were selected.

Identification of lectin gene transcripts in C. latifolia and C. capitulata fruits

A tBLASTN search (E-value < 1e⁻⁴; other options set to the default) was performed against all transcripts in C. latifolia and C. capitulata fruits with the following protein sequences as the queries, which represent each plant lectin family [41]: Agaricus bisporus (white mushroom) agglutinin (UniProtKB/Swiss-Prot: Q00022.3—ABA), Amaranthus caudatus (foxtail amaranth) agglutinin (GenBank: AAL05954.1—amaranthin), Robinia pseudoacacia (black locust) chitinase-related agglutinin (GenBank: ABL98074.1—CRA), Nostoc ellipsosporum (cyanobacterium) agglutinin (UniProtKB/Swiss-Prot: P81180.2—cyanovirin), Euonymus europaeus (European spindle) agglutinin (GenBank: ABW73993.1—EUL), Galanthus nivalis (snowdrop) agglutinin (UniProtKB/Swiss-Prot: P30617.1—GNA), Hevea brasiliensis (rubber tree) agglutinin (GenBank: ABW34946.1—hevein), Artocarpus integer (chempedak) agglutinin (GenBank: AAA32680.1—JRL), Glycine max (soybean) agglutinin (UniProtKB/Swiss-Prot: P05046.1—legume lectin), Brassica juncea (brown mustard) LysM domain (GenBank: BAN83772.1—LysM), Nicotiana tabacum (tobacco) agglutinin (GenBank: AAK84134.1—Nictaba), and the lectin chain of Ricinus communis (castor bean) agglutinin (GenBank: PDB: 2AAI_B—ricin B). The top hits were selected.

Phylogenetic analysis of the GNA protein family

The sequences of 17 well-known GNA proteins were selected according to Shimizu-Ibuka et al. [36]. The protein sequences for ASA, Allium sativum (garlic) (1BWU); GNA, Galanthus nivalis (snowdrop) (1MSA); and NPL, and Narcissus pseudonarcissus (wild daffodil) (1NPL) were obtained from the Protein Data Bank. Others sequences were selected from GenBank as follows: PRA, Polygonatum roseum (AY899824); PMA, Polygonatum multiflorum (Solomon’s seal) (U44775); CMA, Clivia miniata (kaffir lily) (L16512); ZCA, Zephyranthes candida (autumn zephyr lily) (AF527385); AAA, Allium ascalonicum (shallot) (L12172); ACA, Allium cepa (onion) (AY376826); AUA, Allium ursinum (wild garlic) (U68531); THC, Tulipa hybrid cultivar (tulip) (U23043); ZOA, Zingiber officinale (ginger) (AY657021); ACO, Ananas comosus (pineapple) (AY098512); AKA, Amorphophallus konjac (konjac) (AY191004); DPA, Dioscorea polystachya (yam tuber) (AB178475); CHC, Cymbidium hybrid cultivar (cymbidium) (U02516); and EHA, Epipactis helleborine (broad-leaved helleborine) (U02515). These 17 sequences and 25 neoculin-related proteins predicted from full-length transcripts (10 transcripts from C. latifolia and 15 from C. capitulata; Fig. 5) were aligned using ClustalX [66], and the neighbor-joining tree was generated and analyzed with 1,000 replicates for bootstrap testing.

Biochemical analysis

SDS-PAGE was carried out using fruit extracts from C. latifolia and C. capitulata. The proteins were visualized by Coomassie brilliant blue (CBB) staining. Immunoblot analysis was carried out using anti-NAS and anti-NBS specific polyclonal antibodies [38, 67], which were raised against the C terminus of NAS or NBS, respectively. Preparation and purification of fruit extracts were performed as described previously [18, 38]. Each 0.1 g pulp sample was treated with 0.5 mL of 0.5 M NaCl to obtain an extract, which was combined with the appropriate volume of buffer containing 2-mercaptoethanol for SDS-PAGE. We measured protein contents with a Pierce^TM BCA protein assay kit (Thermo Fischer Scientific, MA. USA). 20 µg protein of fruit extract from C. latifolia and C. capitulata were subjected to SDS-PAGE.

AAA, Allium ascalonicum agglutinin; ACA, Allium cepa agglutinin; ACO, Ananas comosus lectin; AKA, Amorphophallus konjac agglutinin; ASA, Allium sativum agglutinin; AUA, Allium ursinum agglutinin; CHC, Cymbidium hybrid cultivar agglutinin; CMA, Clivia miniata agglutinin; COG, Cluster of Orthologous Groups; DPA, Dioscorea polystachya agglutinin; EHA, Epipactis helleborine agglutinin; GNA, Galanthus nivalis agglutinin; GO, Gene Ontology; NAA, 1-naphthylacetic acid; NAS, neoculin acidic subunit; NCBI, the National Center for Biotechnology Information; NBS, neoculin basic subunit; NGS, next generation sequencing; NPL, Narcissus pseudonarcissus lectin; PMA, Polygonatum multiflorum agglutinin; PRA, Polygonatum roseum agglutinin; THC, Tulipa hybrid cultivar lectin; ZOA, Zingiber officinale agglutinin; TPM, transcripts per million; ZCA, Zephyranthes candida agglutinin.

Ethics approval and consent to participate

Not applicable

Consent for publication

Not applicable

Availability of data and materials

The raw data and processed data from this study have been uploaded to the NCBI Gene Expression Omnibus (GSE151377) and are available in the NCBI database under accession number PRJNA635640, https://www.ncbi.nlm.nih.gov/bioproject/635640

Competing interests

The authors declare that they have no competing interests.

Funding

This study was supported by the Cross-ministerial Strategic Innovation Promotion Program (Grant No. 14532924; K.A.), a Grant-in-Aid for Scientific Research B (Grant No. 19300248; T.A.) from the Society for the Promotion of Science in Japan and Adaptable and Seamless Technology transfer Program through Target-driven R&D (A-STEP) from Japan Science and Technology Agency to T.A. (Grant No.JPMJTR194F).

Author contributions

TA conceived the study and participated in the design of all experiments. SO1, KT, SO2, TY, TM, KN and KA analyzed and interpreted data. SO1 and TY cultivated plants and performed sample preparation. SO1 and SO2 performed biological experiments. SO1 and KT wrote the manuscript. KA discussed the experiments and manuscript. All authors read and approved the final manuscript.

Acknowledgements

We are grateful to the late Dr. Kosaburo Nishi of Research Center for Medicinal Plant Resources (Tsukuba Division), National Institutes of Biomedical Innovation, Health and Nutrition, for kindly providing Curculigo latifolia plants. We also thank Hiroshi Morita, the Director of The Naito Museum of Pharmaceutical Science and Industry, for kindly providing Curculigo capitulata plants. Computations were partially performed on the NIG supercomputer.

Authors information

Satoshi Okubo and Kaede Terauchi contributed equally to this work.

Burkill IH. A dictionary of the economic products of the Malay Peninsula. London: Crown Agents for the Colonies; 1966. pp. 713-4
Perry LM. Medicinal plants of East and Southeast Asia. Cambridge: MIT Press; 1895. p. 12.
Plants of the World Online. http://www.plantsoftheworldonline.org/. Accessed May 17 2020.
Kocyan A. The discovery of polyandry in Curculigo (Hypoxidaceae): implications for androecium evolution of asparagoid monocotyledons. Ann Bot. 2007;100(2):241-8.
Kocyan A, Snijman DA, Forest F, Devey DS, Freudenstein JV, Wiland-Szymańska J, et al. Molecular phylogenetics of Hypoxidaceae-evidence from plastid DNA data and inferences on morphology and biogeography. Mol Phylogenet Evol. 2011;60(1):122-36.
Liu KW, Xie GC, Chen LJ, Xiao XJ, Zheng YY, Cai J, et al. Sinocurculigo, a new genus of Hypoxidaceae from China based on molecular and morphological evidence. PLoS One. 2012;7(6):e38880.
Ranjbarfard A, Saleh G, Abdullah NAP, Kashiani P. Genetic diversity of lemba (Curculigo latifolia) populations in Peninsular Malaysia using ISSR molecular markers. Australian Journal of Crop Science. 2014;8(1):9-17.
Eksomtramage L, Kwandarm M, Purintavaragul C. Karyotype of some Thai Hypoxidaceae species. Songklanakarin J Sci Technol. 2013;35(4):379-82.
Okubo S, Yamada M, Yamaura T, Akita T. Effects of the pistil size and self-incompatibility on fruit production in Curculigo latifolia (Liliaceae). J Japan Soc Hort Sci. 2010;79(4):354-9.
Asif M. A review on phytochemical and ethnopharmacological activities of Curculigo orchioides. Mahidol University Journal of Pharmaceutical Sciences. 2012;39(3-4):1-10.
Babaei N, Abdullah NAP, Saleh G, Abdullah TL. An efficient in vitro plantlet regeneration from shoot tip cultures of Curculigo latifolia, a medicinal plant. ScientificWorldJournal. 2014;2014:275028.
Ishak NA, Ismail M, Hamid M, Ahmad Z, Abd Ghafar SA. Antidiabetic and hypolipidemic activities of Curculigo latifolia fruit: root extract in high fat fed diet and low dose STZ induced diabetic rats. Evid Based Complement Alternat Med. 2013;2013:601838.
Li S, Yu JH, Fan YY, Liu QF, Li ZC, Xie ZX, et al. Structural elucidation and total synthesis of three 9-torlignans from Curculigo capitulata. J Org Chem. 2019;84(9):5195-202.
Nie Y, Dong X, He Y, Yuan T, Han T, Rahman K, et al. Medicinal plants of genus Curculigo: traditional uses and a phytochemical and ethnopharmacological review. J Ethnopharmacol. 2013;147(3):547-63.
Wang KJ, Zhu CC, Di L, Li N, Zhao YX. New norlignan derivatives from Curculigo capitulata. Fitoterapia. 2010;81(7):869-72.
Yamashita H, Theerasilp S, Aiuchi T, Nakaya K, Nakamura Y, Kurihara Y. Purification and complete amino acid sequence of a new type of sweet protein taste-modifying activity, curculin. J Biol Chem. 1990;265(26):15770-5.
Nakajima K, Asakura T, Oike H, Morita Y, Shimizu-Ibuka A, Misaka T, et al. Neoculin, a taste-modifying protein, is recognized by human sweet taste receptor. Neuroreport. 2006;17(12):1241-4.
Shirasuka Y, Nakajima K, Asakura T, Yamashita H, Yamamoto A, Hata S, et al. Neoculin as a new taste-modifying protein occurring in the fruit of Curculigo latifolia. Biosci Biotechnol Biochem. 2004;68(6):1403-7.
Kant R. Sweet proteins--potential replacement for artificial low calorie sweeteners. Nutr J. 2005;4:5.
Yamashita H, Akabane T, Kurihara Y. Activity and stability of a new sweet protein with taste-modifying action, curculin. Chem Senses. 1995;20(2):239-43.
Nakajima K, Koizumi A, Iizuka K, Ito K, Morita Y, Koizumi T, et al. Non-acidic compounds induce the intense sweet taste of neoculin, a taste-modifying protein. Biosci Biotechnol Biochem. 2011;75(8):1600-2.
Koizumi A, Nakajima K, Asakura T, Morita Y, Ito K, Shmizu-Ibuka A, et al. Taste-modifying sweet protein, neoculin, is received at human T1R3 amino terminal domain. Biochem Biophys Res Commun. 2007;358(2):585-9.
Suzuki M, Kurimoto E, Nirasawa S, Masuda Y, Hori K, Kurihara Y, et al. Recombinant curculin heterodimer exhibits taste-modifying and sweet-tasting activities. FEBS Lett. 2004;573(1-3):135-8.
Abe K, Yamashita H, Arai S, Kurihara Y. Molecular cloning of curculin, a novel taste-modifying protein with a sweet taste. Biochim Biophys Acta. 1992;1130(2):232-4.
Nakajima K, Yokoyama K, Koizumi T, Koizumi A, Asakura T, Terada T, et al. Identification and modulation of the key amino acid residue responsible for the pH sensitivity of neoculin, a taste-modifying protein. PLoS One. 2011;6(4):e19448.
Koizumi T, Terada T, Nakajima K, Kojima M, Koshiba S, Matsumura Y, et al. Identification of key neoculin residues responsible for the binding and activation of the sweet taste receptor. Sci Rep. 2015;5:12947.
Morita Y, Nakajima K, Iizuka K, Terada T, Shimizu-Ibuka A, Ito K, et al. pH-dependent structural change in neoculin with special reference to its taste-modifying activity. Biosci Biotechnol Biochem. 2009;73(11):2552-5.
Ohkubo T, Tamiya M, Abe K, Ishiguro M. Structural basis of pH dependence of neoculin, a sweet taste-modifying protein. PLoS One. 2015;10(5):e0126921.
Van Damme EJM, Peumans WJ, Barre A, Rougé P. Plant lectins: a composite of several distinct families of structurally and evolutionary related proteins with diverse biological roles. Critical Reviews in Plant Sciences. 1998;17:575-692.
Van Damme EJM, Lannoo N, Peumans WJ. Plant lectins. Advances in Botanical Research: Elsevie; 2008. pp. 107-209.
Barre A, Van Damme EJM, Peumans WJ, Rougé P. Structure-function relationship of monocot mannose-binding lectins. Plant Physiol. 1996;112(4):1531-40.
Shimizu-Ibuka A, Morita Y, Terada T, Asakura T, Nakajima K, Iwata S, et al. Crystal structure of neoculin: insights into its sweetness and taste-modifying activity. J Mol Biol. 2006;359(1):148-58.
Kurimoto E, Suzuki M, Amemiya E, Yamaguchi Y, Nirasawa S, Shimba N, et al. Curculin exhibits sweet-tasting and taste-modifying activities through its distinct molecular surfaces. J Biol Chem. 2007;282(46):33252-6.
Barre A, Van Damme EJM, Peumans WJ, Rougé P. Curculin, a sweet-tasting and taste-modifying protein, is a non-functional mannose-binding lectin. Plant Mol Biol. 1997;33(4):691-8.
Harada S, Otani H, Maeda S, Kai Y, Kasai N, Kurihara Y. Crystallization and preliminary X-ray diffraction studies of curculin. A new type of sweet protein having taste-modifying action. J Mol Biol. 1994;238(2):286-7.
Shimizu-Ibuka A, Nakai Y, Nakamori K, Morita Y, Nakajima K, Kadota K, et al. Biochemical and genomic analysis of neoculin compared to monocot mannose-binding lectins. J Agric Food Chem. 2008;56(13):5338-44.
Nakajo S, Akabane T, Nakaya K, Nakamura Y, Kurihara Y. An enzyme immunoassay and immunoblot analysis for curculin, a new type of taste-modifying protein: cross-reactivity of curculin and miraculin to both antibodies. Biochim Biophys Acta. 1992;1118(3):293-7.
Okubo S, Asakura T, Okubo K, Abe K, Misaka T, Akita T. Neoculin, a taste-modifying sweet protein, accumulates in ripening fruits of cultivated Curculigo latifolia. J Plant Physiol. 2008;165(18):1964-9.
Standard tables of food composition in Japan - 2015 - (Seventh revised version) 2015. MEXT. https://www.mext.go.jp/en/policy/science_technology/policy/title01/detail01/1374030.htm. Accessed May 17 2020.
Götz S, García-Gómez JM, Terol J, Williams TD, Nagaraj SH, Nueda MJ, et al. High-throughput functional annotation and data mining with the Blast2GO suite. Nucleic Acids Res. 2008;36(10):3420-35.
De Schutter K, Tsaneva M, Kulkarni SR, Rougé P, Vandepoele K, Van Damme EJM. Evolutionary relationships and expression analysis of EUL domain proteins in rice (Oryza sativa). Rice (N Y). 2017;10(1):26.
Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004;4:10.
Copley SD. Evolution of new enzymes by gene duplication and divergence. FEBS J. 2020;287(7):1262-83.
Dang L, Van Damme EJM. Genome-wide identification and domain organization of lectin domains in cucumber. Plant Physiol Biochem. 2016;108:165-76.
Fukushima K, Fang X, Alvarez-Ponce D, Cai H, Carretero-Paulet L, Chen C, et al. Genome of the pitcher plant Cephalotus reveals genetic changes associated with carnivory. Nat Ecol Evol. 2017;1(3):59.
Panchy N, Lehti-Shiu M, Shiu SH. Evolution of gene duplication in plants. Plant Physiol. 2016;171(4):2294-316.
Yan J, Li G, Guo X, Li Y, Cao X. Genome-wide classification, evolutionary analysis and gene expression patterns of the kinome in Gossypium. PLoS One. 2018;13(5):e0197392.
de Vos AM, Hatada M, van der Wel H, Krabbendam H, Peerdeman AF, Kim SH. Three-dimensional structure of thaumatin I, an intensely sweet protein. Proc Natl Acad Sci U S A. 1985;82(5):1406-9.
Kurihara Y. Characteristics of antisweet substances, sweet proteins, and sweetness-inducing proteins. Crit Rev Food Sci Nutr. 1992;32(3):231-52.
Liu JJ, Sturrock R, Ekramoddoullah AK. The superfamily of thaumatin-like proteins: its origin, evolution, and expression towards biological function. Plant Cell Rep. 2010;29(5):419-36.
Petre B, Major I, Rouhier N, Duplessis S. Genome-wide analysis of eukaryote thaumatin-like proteins (TLPs) with an emphasis on poplar. BMC Plant Biol. 2011;11:33.
Selvakumar P, Gahloth D, Tomar PP, Sharma N, Sharma AK. Molecular evolution of miraculin-like proteins in soybean Kunitz super-family. J Mol Evol. 2011;73(5-6):369-79.
Theerasilp S, Hitotsuya H, Nakajo S, Nakaya K, Nakamura Y, Kurihara Y. Complete amino acid sequence and structure characterization of the taste-modifying protein, miraculin. J Biol Chem. 1989;264(12):6655-9.
Witty M, Higginboyham JD. Thaumatin. Florida, USA: CRC Press, Inc.; 1994. pp. 20-35.
Jiang SY, Ma Z, Ramachandran S. Evolutionary history and stress regulation of the lectin superfamily in higher plants. BMC Evol Biol. 2010;10:79.
Lambin J, Asci SD, Dubiel M, Tsaneva M, Verbeke I, Wytynck P, et al. OsEUL lectin gene expression in rice: stress regulation, subcellular localization and tissue specificity. Front Plant Sci. 2020;11:185.
Li XQ. Developmental and environmental variation in genomes. Heredity (Edinb). 2009;102(4):323-9.
Wittkopp PJ, Kalay G. Cis-regulatory elements: molecular mechanisms and evolutionary processes underlying divergence. Nat Rev Genet. 2011;13(1):59-69.
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet.journal. 2011;17(1).
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114-20.
Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Trinity: reconstructing a full-length transcriptome without a genome from RNA-seq data. Nat Biotechnol. 2013;29(7):644-52.
Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol. 2009;10(3):R25.
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323.
Fu L, Niu B, Zhu Z, Wu S, Li W. CD-HIT: accelerated for clustering the next-generation sequencing data. Bioinformatics. 2012;28(23):3150-2.
Li W, Godzik A. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006;22(13):1658-9.
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, et al. Clustal W and Clustal X version 2.0. Bioinformatics. 2007;23(21):2947-8.
Nakajima K, Asakura T, Maruyama J, Morita Y, Oike H, Shimizu-Ibuka A, et al. Extracellular production of neoculin, a sweet-tasting heterodimeric protein with taste-modifying activity, by Aspergillus oryzae. Appl Environ Microbiol. 2006;72(5):3716-23.

AdditionalFile1.pdf
Additional File 1. Supplemental Figure 1. Length distribution of the assembled transcripts (>1 TPM) from C. latifolia (purple) and C. capitulata (orange) fruits.
AdditionalFile2.pdf
Additional File 2. Supplemental Figure 2. Distribution of transcripts per million (TPM) values of the assembled transcripts from C. latifolia (purple) and C. capitulata (orange) fruits. Average, median and mode of C. latifolia were 11.7, 1.6, and 2, respectively, and those of C. capitulata, 13.0, 1.9, and 2, respectively.
AdditionalFile3.xlsx
Additional File 3. Supplemental annotation. Transcripts from C. latifolia and C. capitulata fruits were annotated by BLASTX (E-value < 1e−10) against the NCBI NR, RefSeq, UniProt, and COG databases and genomes from rice (Os-Nipponbare-Reference-IRGSP-1.0, Assembly: GCF_001433935.1) and Arabidopsis (Arabidopsis thaliana, Assembly: GCF_000001735.4). Percentage identity (Pident) and E-value are BLASTN results performed using C. latifolia as the query against C. capitulata. Expression values (TPM), unigenes clustered by CD-Hit, and the unique or common genes from C. capitulata or C. latifolia are also included.
AdditionalFile4.pdf
Additional File 4. Supplemental Figure 3. Gene Ontology (GO) annotation of transcripts from C. latifolia (purple) and C. capitulata (orange) fruits. In total, 28,100 (C. latifolia) and 29,614 (C. capitulata) transcripts were classified based on GO terms. No significant differences were observed between the two species.
AdditionalFile5.pdf
Additional File 5. Supplemental Table 1. Selection of sequences for phylogenetic analysis using BLAST search of transcripts from C. latifolia and C. capitulata fruits. The query sequences were the amino acid sequences (AA) of GNA (UniProtKB/Swiss-Prot: P30617.1) and the nucleotide sequences (nucl) and AA number of NBS (GenBank: X64110.1, GenBank: CAA45476.1) and NAS (GenBank: AB167079.1, GenBank: BAD29946.1). Each top hit was selected. The contigs that were selected for each query and used as neoculin-related sequences in phylogenetic analysis (Fig. 5) are indicated by checkmarks (√).
AdditionalFile6.pdf
Additional File 6. Supplemental Figure 4. Amino acid sequence alignment of 10 C. latifolia transcripts, 15 C. capitulata transcripts, and 17 well-known GNA family members used in the phylogenetic analysis shown in Fig. 5.
AdditionalFile7.pdf
Additional File 7. Supplemental Figure 5. Comparison of the protein sequences of neoculin (NBS and NAS) from public databases and the Curculigo latifolia proteins identified in the present study. Amino acid residues that differ between NBS and NAS are shown in blue for NBS residues and red for NAS residues. Asn-2 of L_16562_c0_g1_i2 was the only residue that differed with NAS, which has Ser at this position (GenBank: BAD29946.1). These de novo assemblies were good matches with the known sequences (NBS and NAS). This level of matching supports the accuracy of the assembly.
AdditionalFile8.pdf
Additional File 8. Supplemental Table 2. Numbers of amino acid substitutions in neoculin group proteins. The values show the number of substituted amino-acids in each region. The predicted proteins were divided into nine regions—N-Pro, N-term, MBS1, inter1, MBS2, inter2, MBS3, C-term and C-Pro—based on the regions removed by processing, the N- or C-terminal regions, the mannose-binding sites MBS 1 to 3, and the regions between the MBSs. “A” indicates residues in NAS that are different from those of NBS. “B” indicates residues in NBS that are different from those of NAS. “C” indicates residues different from both NBS and NAS. “D” indicates the residues only present in C_9931_c0_g1_i1.

Download PDF

Journal Publication

published 13 May, 2021

Read the published version in BMC Genomics →

Editorial decision: Major revision
27 Jan, 2021
Review #2 received at journal
14 Jan, 2021
Review #1 received at journal
07 Jan, 2021
Reviewer #2 agreed at journal
27 Dec, 2020
Editor assigned by journal
21 Dec, 2020
Reviewers invited by journal
21 Dec, 2020
Reviewer #1 agreed at journal
21 Dec, 2020
Submission checks completed at journal
21 Dec, 2020
Editor invited by journal
21 Dec, 2020

You are reading this latest preprint version

De novo transcriptome analysis and comparative expression profiling of genes associated with the taste-modifying protein neoculin in Curculigo latifolia and Curculigo capitulata fruits

Status:

Journal Publication

Version 2

Abstract

Figures

Background

Results

Discussion

Conclusions

Methods

List Of Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 2