Expression profiling of Expansin A4, BURP Domain protein RD22-like and E6-like
Overall expression of Expansin A4 gene was remarkably high in rapid elongating fibre during 10 DPA in all interspecific lines and Gossypium species. Maximum transcripts were found in SL-19 (Fig. 1). Expression of BURP Domain protein RD22-like was almost remained constant from 10-20 DPA fibre in all genotypes except in Gossypium anomalum. Transcripts of BURP Domain protein RD22-like gene were maximum in 10 DAP fibre as compared to 5 DPA. In all three interspecific lines, highest expression was detected at 15 and 20 DPA fibre stages in SL-19, SL-79 and SL-369 respectively (Fig. 2). Expression pattern of E6-like showed that high expression was detected at 10 and 15 DPA fibre stages predicting its main role in fibre elongation. In interspecific lines, transcripts of E6-like gene were varied from 0 DPA till 20 DPA. In SL-19, expression of fibre gene starts to increase from 0 DPA and reached at maximum level at 15 DPA and after that slightly decreases at 20 DPA (Fig. 3).
To validate expression results, the target gene transcriptomic profiles (E6-like, Expansin A4 & BURP Domain protein RD22-like) were validated by using existing RNA-seq data on Cotton FGD. The results of available fibre specific genes were generally similar with our expression analysis results. Heat map was created on the basis of RNA-seq data of related expressed in transcript per Million (TPM) during different fibre development stages. E6-like, Expansin A4 and BURP Domain RD22-like showed similarity with gene Gh-D05G160200, Gh_A10G149600 and Gh_D05G052400 respectively. (Fig. 4). An expression trend of gradual increasing from 5 DPA to 10 DPA were identified, while similar tendencies were also observed in our experiment.
values of log2, day post anthesis and fragments per kilobase of transcript per million mapped reads.
Sequence comparison of interspecific lines and species
In E6-like DNA sequencing, all interspecific lines exhibited sequences more similar to G. hirsutum as depicted at nucleotide positions 213, 217 and 221-226. In Expansin A4, interspecific lines were also more closely related to Gossypium hirsutum predicted at 390, 393, 507, 519 & 657bp which also confirm its breeding history. In BBURP Domain RD22-like, it was also predicted that almost all dissimilar nucleotide (241-300, 301-360, 361-420) were observed in G. anomalum as compared to other species of cotton (Fig. 5).
In silico analysis of E6-like, Expansin A4 and BURP Domain RD-22
Physicochemical properties
Expasy’s Protpam analysis of predicted protein showed that Protein E6-like and RD-22 was characterized as unstable as value of instability index was 47.75 and 44.72 respectively (Table 3). Expansin A4 was characterized as a stable protein with value of instability index of 29.01.
Table 3 Physicochemical properties of fibre genes
Physicochemical properties
|
E6-like
|
Expansin A4
|
BURP Domain RD-22
|
Number of amino acids
|
241
|
258
|
335
|
Total negatively amino acid charged residues (Asp + Glu)
|
37
|
13
|
35
|
Total positively amino acid charged residues (Arg + Lys)
|
25
|
16
|
34
|
Molecular weight
|
28223.37
|
27936.46
|
36595.05
|
Theoretical pI
|
5.00
|
8.36
|
6.89
|
Aliphatic index
|
32.37
|
62.83
|
75.64
|
Grand average of hydropathicity (GRAVY)
|
-1.356
|
-0.090
|
-0.266
|
Instability index (II)
|
47.75
|
29.01
|
44.72
|
Subcellular Localization
DeepLoc analysis designated that protein. Proteins E6-like, Expansin A4 and BURP Domain RD22-like were a membrane soluble protein family. Location in different organelles with the approximate values (Table 4) predicted the probability of protein location in different organelles. Highest Extracellular values of Proteins E6-like, Expansin A4 and BURP Domain RD22-like (0.819, 0.729 and 0.843 respectively) showed that these proteins are extracellular.
Table 4 Predicted subcellular localization of E6-like, Expansin A4 and BURP Domain RD-22
Fibre gene
|
Extracellular
|
Lysosome
|
Endoplasmic reticulum
|
Cell membrane
|
Golgi apparatus
|
Cytoplasm
|
E6-like
|
0.8195
|
0.1706
|
0.0083
|
0.0013
|
0.0002
|
0.0002
|
Expansin A4
|
0.7293
|
0.2373
|
0.0329
|
0.0005
|
0
|
0
|
BURP Domain RD-22
|
0.8435
|
0.1316
|
0.0237
|
0.0008
|
0
|
0.0003
|
Signal peptide analysis
In E6-like, Expansin-A4 and BURP Domain RD22 were characterizes as extracellular membrane that’s why signal peptide was present in protein coding sequence. Score values of C, S, 3Y is more than 0.45 (Table 5) that shows that peptide signal is present.
Table 5 Signal peptide Analysis of E6-like, Expansin A4 and BURP Domain RD22-like
Fibre gene
|
Measure
|
Position
|
Value
|
Cut Off
|
Signal Peptide
|
E6-like
|
max.C
|
26
|
0.792
|
|
|
max.Y
|
26
|
0.840
|
|
|
max.S
|
15
|
0.941
|
|
|
Mean S
|
1-25
|
0.891
|
|
|
D
|
1-25
|
0.868
|
0.450
|
Yes
|
Expansin A4
|
max.C
|
30
|
0.427
|
|
|
max.Y
|
30
|
0.586
|
|
|
max.S
|
9
|
0.950
|
|
|
Mean S
|
1-29
|
0.821
|
|
|
D
|
1-29
|
0.713
|
0.45
|
yes
|
BURP Domain RD22-like
|
max.C
|
30
|
0.427
|
|
|
max.Y
|
30
|
0.586
|
|
|
max.S
|
9
|
0.950
|
|
|
Mean S
|
1-29
|
0.821
|
|
|
D
|
1-29
|
0.713
|
0.450
|
Yes
|
Promoter sequence Analysis
Sequence analysis of cotton E6-like, Expansin A4 and BURP Domain protein RD22-likepromoter using PlantCARE predicted many vital motifs in this region related to gene expression (Fig. 4). There are few transcriptions activation related motifs along with core promoter elements like TATA and CAAT boxes. These motifs are light responsive, hormone and stress regulated cis elements. These motifs are involved in the light, stress and hormones responsiveness. There were other vital core promoter elements required for promoter activity including TATA box and CAAT box (Tale-6). Cis-acting essential element for the abscisic acid reaction (Hordeum vulgare), light response elements (Arabidopsis thaliana), gibberellin-enhancer element (Brassica oleracea) and element for variation of the palisade mesophyll cells (Arabidopsis thaliana) were present in E6-like promoter region. Similarly, in Expansin A4 various cis acting premotor elements were identified. Abscisic acid responsiveness elements were identified in Arabidopsis thaliana, light responsiveness in Zea mays, element responsive for transcription start in Brassica oleracea and MeJA-responsiveness in Hordeum vulgare. In BURP Domain RD22-like, elements essential for light responsiveness were present in Petroselinum crispum while promoter and enhancer regions were identified in Arabidopsis thaliana. MYBHv1 binding site, MeJA and anaerobic induction responsive elements were present in Hordeum vulgare and Zea mays respectively.
Table 6 Cis acting promoter elements in promoter region
E6-like
|
Site Name
|
Organism
|
Position
|
Strand
|
Score.
|
Sequence
|
Function
|
ABRE
|
Hordeum vulgare
|
425
|
-
|
9
|
GCAACGTGTC
|
cis-acting element involved in the abscisic acid responsiveness
|
AE-box
|
Arabidopsis thaliana
|
748
|
+
|
8
|
AGAAACAA
|
part of a module for light response
|
CAAT-box
|
Arabidopsis thaliana
|
638
|
+
|
5
|
CCAAT
|
common cis-acting element in promoter and enhancer regions
|
CAAT-box
|
Pisum sativum
|
852
|
-
|
5
|
CAAAT
|
common cis-acting element in promoter and enhancer regions
|
GARE motif
|
Brassica oleracea
|
615
|
-
|
7
|
TCTGTTG
|
gibberellin-responsive element
|
HD-Zip 1
|
Arabidopsis thaliana
|
564
|
-
|
8
|
CAAT(A/T) ATTG
|
element involved in differentiation of the palisade mesophyll cells
|
TATA-box
|
Arabidopsis thaliana
|
575
|
-
|
4
|
TATA
|
core promoter element around -30 of transcription start
|
TC-richrepeats
|
Nicotiana tabacum
|
380
|
+
|
9
|
GTTTTCTTAC
|
cis-acting element involved in defense and stress responsiveness
|
TCT-motif
|
Arabidopsis thaliana
|
384
|
+
|
6
|
TCTTAC
|
part of a light responsive element
|
Expansin A-4
|
G-Box
|
Pisum sativum
|
507
|
-
|
6
|
CACGTT
|
cis-acting regulatory element involved in light responsiveness
|
ABRE
|
Arabidopsis thaliana
|
508
|
+
|
5
|
ACGTG
|
cis-acting element involved in the abscisic acid responsiveness
|
ABRE
|
Arabidopsis thaliana
|
508
|
+
|
5
|
ACGTG
|
cis-acting element involved in the abscisic acid responsiveness
|
ATC-motif
|
Zea mays
|
384
|
-
|
9
|
TGCTATCCG
|
part of a conserved DNA module involved in light responsiveness
|
CAAT-box
|
Pisum sativum
|
361
|
-
|
5
|
CAAAT
|
common cis-acting element in promoter and enhancer regions
|
CAAT-box
|
Arabidopsis thaliana
|
581
|
-
|
8
|
CCCAATTT
|
common cis-acting element in promoter and enhancer regions
|
CAAT-box
|
Petunia hybrida
|
694
|
-
|
7
|
TGCCAAC
|
common cis-acting element in promoter and enhancer regions
|
TATA-box
|
Arabidopsis thaliana
|
527
|
-
|
4
|
TATA
|
core promoter element around -30 of transcription start
|
TGACG-motif
|
Hordeum vulgare
|
532
|
-
|
5
|
TGACG
|
cis-acting regulatory element involved in the MeJA-responsiveness
|
BURP Domain RD22-like
|
ABRE
|
Triticum aestivum
|
181
|
-
|
9
|
GACACGTGGC
|
cis-acting element involved in the abscisic acid responsiveness
|
ARE
|
Zea mays
|
542
|
+
|
6
|
AAACCA
|
cis-acting regulatory element essential for the anaerobic induction
|
Box 4
|
Petroselinum crispum
|
450
|
-
|
6
|
ATTAAT
|
part of a conserved DNA module involved in light responsiveness
|
CAAT-box
|
Arabidopsis thaliana
|
55
|
+
|
5
|
CCAAT
|
common cis-acting element in promoter and enhancer regions
|
CCAAT-box
|
Hordeum vulgare
|
440
|
+
|
6
|
CAACGG
|
MYBHv1 binding site
|
CGTCA-motif
|
Hordeum vulgare
|
515
|
+
|
5
|
CGTCA
|
cis-acting regulatory element involved in the MeJA-responsiveness
|
TATA-box
|
Arabidopsis thaliana
|
291
|
+
|
4
|
TATA
|
core promoter element around -30 of transcription start
|
TGACG-motif
|
Hordeum vulgare
|
512
|
+
|
5
|
TGACG
|
cis-acting regulatory element involved in the MeJA-responsiveness
|
TGACG-motif
|
Hordeum vulgare
|
515
|
-
|
5
|
TGACG
|
cis-acting regulatory element involved in the MeJA-responsiveness
|