| |
Citation: Subramanian R, Muthurajan R, Ayyanar M (2008) Comparative Modeling and Analysis of 3-D Structure of EMV2, a
Late Embryogenesis Abundant Protein of Vigna Radiata (Wilczek). J Proteomics Bioinform 1: 401-407.
|
Copyright: © 2008 Subramanian R, etal. This is an open-access article distributed under the terms of the Creative Commons Attribution
License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and
source are credited.
|
Abstract
LEA proteins are ubiquitous among photosynthetic organisms and have been reported in mono- and dicot
plants as well as in nematodes, yeast, bacteria and cyanobacteria. EMV2 is a Group 1 LEA protein isolated fromVigna radiata, which is speculated to impart desiccation tolerance in plants. The homology model of this protein
was generated by using the LOOPP software based on available structural homologues in protein databases. The
final model obtained by molecular mechanics and dynamics method was assessed by PROCHECK that showed
that the final refined model is reliable. The model could prove useful in further functional characterization of this
protein.
Keywords
LEA proteins; Vigna radiate; desiccation; Homology modeling; validation
Abbreviations
SCOP-Structural Classification Of Proteins, PDB-Protein Data Bank, LOOPP -Learning, Observing and
Outputting Protein Patterns
Introduction
Late embryogenesis abundant (LEA) protein genes are
highly expressed during late stages of seed development at
normal growth condition, but many of the LEA class genes
are also frequently expressed in vegetative tissues when
plants are exposed to environmental stress (Bray et al., 2000).
Several groups of LEA protein genes have been demonstrated
to confer water-deficit and salt-stress tolerance.
On the basis of sequence similarities, LEA proteins have
been classified in six groups (Dure, 1993; Bray, 1993). Group
2 LEA proteins or dehydrins are by far the most frequently
described LEA protein family and have been classified in
distinct groups (Close, 1997) that differ in the arrangement
and number of conserved motifs: the lysine-rich repeat (KIKEKLPG) or K segment, the stretch of serine or S segment
and the V/T DEYGNP motif or Y segment. Some of
these structural motifs are predicted to form amphipathic
alpha helices, which may be important for their function in
protecting plant cells against dehydration. Evidence of functional
links between LEA protein accumulation and improved
stress tolerance of transgenic yeast and plants support this
hypothesis (Imai, 1996; Xu et al., 1996; Sivamani et al.,
2000). It was therefore proposed that most LEA and
dehydrin proteins exist as largely unfolded structures in their
native state, although a few members exist as dimers or
tetramers (Ceccardi et al., 1994; Kazuoka and Oeda, 1994).
Hydrophilicity is a common characteristic of LEA-type and
other osmotic stress-responsive proteins. Our earlier work on the occurrence of LEA proteins in the embryonic axes
of Vigna radiata (L.) Wilczek referred as EMV proteins,
the first ever report in the Fabaceae family (Manickam and
Carlier, 1980). cDNA encoding these proteins were isolated,
characterized (Manickam et al., 1996). In silico analysis of
the 20-mer motif of this EMV2 categorize this protein to
Group1 LEA and hypothesize to function as DNA/RNA
binding proteins in stabilizing membranes/macromolecules
at the time of dehydration process (Rajesh and Manickam,
2006, Gilles et al., 2007).
In the present study, effort was made to generate the
three-dimensional (3D) structure of the EMV2 protein based
on the available template structural homologues from Protein
Data Bank and SCOP databases and the model validated
with standard parameters. This study could prove
useful in further functional characterization of this important
group of proteins.
Material and Methods
Datasets
The peptide sequences of Vigna radiata, EMV2 (NCBI
GenBank accession number U31211; UniProt acc. Nos.
Q41685 ) and other sequences examined in this study were
retrieved from the public databases, http://www.ncbi.nlm.nih.gov and http://www.ebi.ac.uk. Structurally homologous subsets of the experimentally determined
3D structures of the EMV proteins were retrieved from
PDB and SCOP databases. The template used for comparative
modeling of EMV2 is a DNA Binding Protein from
Homo sapiens (1IG6_A.pdb) and a bifunctional inhibitor/
lipid-transfer protein/seed storage 2S albumin (1RZL.pdb)
with the sequence similarity of 56.44 and 56.67 % at the
loop/coiled coil regions. Similarity at these regions was considered
because LEA proteins are generally loosely structured
with predominantly random coiled regions. However
structure was further refined with sequence of 1RZL.pdb
as it showed overall 40 % sequence identity in the sequence
length of 89% to a bifunctional inhibitor/lipid transfer protein/
seed storage 2S albumin of rice.
Comparative Modeling of EMV2 Protein
Tertiary structure of the Vigna radiata LEA protein, EMV2
was modeled by submitting the deduced amino acid sequences
to the Computational Biology Service Unit, Cornell
Theory Center, Cornell University. The Atomic coordinates
for the protein models were generated by aligning to the
structural homologues in the fold recognition program of
LOOPP v3.0 server (Teodorescu et al., 2004).
Validation of EMV2 Protein Model
PROCHECK, a versatile protein structure analysis program (Laskowski et al., 1993) available at the Joint Centre
for Structural Genomics, Bioinformatics core, University of
California, San Diego; was used in validation of protein structure
and models by verifying the parameters like
Ramachandran plot quality, peptide bond planarity, Bad nonbonded
interactions, main chain hydrogen bond energy, Calpha
chirality and over-all G factor and the side chain parameters
like standard deviations of chi1 gauche minus, trans
and plus, pooled standard deviations of chi1 with respect to
refined structures( Morris et al., 1992).
|
Figure 1: Predicted 3-Dimensional Structure of the EMV2 protein. Ribbon view of EMV2 structure for residues 1-99.
Numbers represent the order of helices. N and C termini of the protein are labeled.
|
Results
Comparative Modeling of EMV2 Protein
Tertiary structure of a protein is build by packing of its
secondary structure elements to form discrete domains or autonomous folding units. Comparative modeling to build
3D structure of the EMV2 protein was made based on the
experimentally solved structural homologues. The amino acid
sequences of EMV2 were submitted to LOOPP server,
Cornell Bioinformatics Structural Unit (CBSU) and atomic
coordinates for the proteins were generated based on Hidden
Markov Model. The hypothetical protein models created were stored as PDB output file. The hypothetical proteins
were visualized and computed by Swiss PDB Viewer
and Rastop. The 3D structure of the proteins were represented
by cartoon display and colored based on the secondary
structure (Fig. 1).
Validation of Protein Structures of EMV2
The hypothetical protein models generated were analyzed
online by submitting to Joint Center for Structural Genomics
(JCSG), Bioinformatics core, University of California, San
Diego. Accuracy of the protein model generated was judged
by validity report generated by PROCHECK. Parameter
comparisons of these proteins were made with well-refined
structures that have similar resolution.
The main chain parameters plotted are Ramachandran
plot quality, peptide bond planarity, Bad non-bonded interactions,
main chain hydrogen bond energy, C-alpha chirality
and over-all G factor. In the Ramamchandran plot analysis,
the residues were classified according to its regions in the
quadrangle. The Ramachandran map for EMV2 (Fig. 2)
and the plot statistics (Table 1) is represented. Non-bonded
interactions check revealed the value of 30 bad contacts per 100 residues for EMV2 and the chirality measure of 0.1
for the zeta torsion angle is observed. The over-all G factor
computed for the EMV2 is 0.1, which is higher than the
ideal value (-0.4).
|
Figure 2: Ramachandran map of EMV2 protein. The Plot calculation was done with PROCHECK program.
|
Table 1: Ramachandran plot statistics of EMV2 computed by PROCHECK.
|
The protein models were analyzed for the side chain parameters
like standard deviations of chi1 gauche minus, trans
and plus, pooled standard deviations of chi1 with respect to
refined structure. The standard deviations of Chi1 gauche
minus, trans and plus are in better range and within limits
for EMV2 hypothetical protein model.
Computations Performed in the EMV2 Model
Computation of Electrostatic Potential
The electrostatic potential of the residues in the hypothetical
protein was computed based on Coulomb’s method.
The cloud of charged residues is represented in blue and
red colors and the proteins visualized as density map of
EMV2 (Fig. 3).
|
Figure 3: Electron density map of EMV2.
|
Computation of Force Field Energy
Force field energy was computed for EMV2 protein model.
Positive values were observed (64x108 forEMV2). Model
refinement was done by energy minimization. Energy minimization
was carried out to reduce clashing amino acids,
using GROMOS96 force field algorithm. Decrease in the
force field energy was observed for both the protein models
after successive energy minimization (-1718 for EMV2).
The energy minimized models will however needs further
refinement in order to reduce the non-bonded interactions
for the model to be judged as a good homology model.
Discussion
3D Modeling of EMV2, LEA Protein from Vigna Radiata
Prediction of tertiary structure of a protein molecule signifies
an important step towards understanding the structure–function relationships in the concerned protein family.
Recently, the first solution structure of a LEA protein, LEA14
from Arabidopsis thaliana has been reported (Singh et al.,
2005). In the present study, model of EMV2 LEA protein of
Vigna radiata was generated from the LOOPP server, based
on the structural homologues derived from the SCOP and
protein data banks.
There exists biological sequence-structure deficit with
more than 3 lakhs protein sequences and millions of partial
nucleotide sequences, available in the public non-redundant
databases (Boguski et al., 1994); and by contrast, the number
of unique 3D structures in the protein data bank is still
less than 1500 (Attwood and Parry-Smith, 2005). The difference
of scale in sequence and structural information is
an important factor to be considered when assigning functions
to hypothetical proteins. Structure based functional
implications of such proteins have always been speculative.
Generally, under stress situations the plants may induce
formation of coiled coil / folding of the natively unfolded
proteins into more rigid structures upon binding to the partner
molecules. Since all natively unfolded proteins have
defined partner molecules that can be as small as nucleotide
or cations or a macromolecule, LEA proteins being
natively unfolded is believed to have such binding partners
to attain a rigid structures.
The LEA proteins from Vigna radiata are being classified
as Group 1 LEA protein because of its extreme hydrophilicity
and adoption of helical conformation as revealed by ab
initio secondary structure predictions, in combination with the predominant random-coiled arrangement of the residues
of Vigna LEA protein, is hypothesized to function as water
replacement molecule. Such a property may facilitate hydrogen
bonding of this EMV proteins with essentially any
macromolecular or membrane surface. However, additional
experiments on physico-chemical analyses including examination
of hydration properties of these proteins need to be
done to determine if EMV proteins can adopt certain structures
upon interaction with other macromolecules.
Structure homologues identified for these EMV proteins
show closest structural homology to proteins with helical
bundles of small proteins and DNA/RNA binding proteins.
These observations are contradictory to the earlier findings
from our group that the low molecular weight protein isolated
from Vigna was believed to be located in the cytoplasm
The structural motifs of these proteins are predicted
to form amphipathic a-helices which may be important for
their function in protecting cells against dehydration. However,
not all LEA proteins are folded and structured. Group
1 LEA proteins are reported to be very hydrophilic, loosely
structured with predominantly random-coiled structures.
These proteins are reported to form regular a-helical structure
when subjected to altered physiological conditions.
Observations from ab initio predictions of these Em proteins
of Vigna indicate 32.32% of EMV2 proteins attains
helical conformation as represented by helical blocks in the
3D hypothetical models (Manickam and Carlier, 1980).
Temperature-induced extended helix/random coil transition
was reported for a Group 1 LEA protein from soybean.
These proteins are by native, largely unstructured but attained
6- 14% helical conformation under temperature stress
or at high salt concentrations (Soulages et al., 2002). Similar
reports from (Goyal et al. 2003) for AavLEA1, a Group
3 LEA protein from the nematode, Aphelenchus avenae indicate
oligomerization of these proteins in immunoblotting
and cross-linking experiments, however majority of these
proteins was found to be monomeric in analytical ultracentrifugation
and gel filtration studies. Also, formation of
a-helical structures on drying was reported in partially characterized
protein from Typha latifolia, probably a Group 3
LEA protein based on Fourier transform-Infra red (FT-IR)
spectroscopy studies (Wolkers et al., 2001).
The LEA proteins from Vigna radiata are being classified
as Group 1 LEA protein because of its extreme hydrophilicity
and adoption of helical conformation as revealed by ab
initio secondary structure predictions, in combination with
the predominant random-coiled arrangement of the residues
of Vigna LEA protein, is hypothesized to function as water replacement molecule. Such a property may facilitate hydrogen
bonding of this EMV proteins with essentially any
macromolecular or membrane surface. However, additional
experiments on physico-chemical analyses including examination
of hydration properties of these proteins need to be
done to determine if EMV proteins can adopt certain structures
upon interaction with other macromolecules.
Validation of the Model
The hypothetical protein model generated was subjected
to structure validation, for testing the accuracy of the model.
The quality of the final ensemble of conformers was assessed
using PROCHECK, a protein structure validation
program. The visual displays of the models were performed
with either the Swiss PDB viewer (Guex and Peitsch, 1997)
or RasTop (Sayle and Milner-white, 1995).
Stereochemical parameters of the proteins like main-and
side chains data of EMV2 was considered for determining
the quality of the model. The main chain parameters like
Ramachandran plot quality; peptide bond planarity, C-alpha
chirality and over-all G factor are found to be within the
limits for the model. However, the bad contacts per 100
residues are high. The side chain parameters are in better
range and within the limits for EMV2. These parameters
are compared to essentially satisfy the generated models
with well-refined structures at similar resolution as described
by (Morris et al. 1992).
The validation reports for the protein models are analyzed,
and energy minimization of the models was made after
checking the force field energy of the models. For a model
to be validated based on quality, a good quality protein model
should have 90% or more residues in the most favored regions
of quadrangle in the Ramachandran plot. In the generated
model of EMV2, the distribution of residues in the
most favored regions is 87.6 and 93 %, respectively. This
infers that EMV2 as good hypothetical protein model. Also,
a tertiary structure of a protein can be worth considering
only from its solution structures, obtained from the experimentations
using either NMR or crystallographic studies.
The homology model of mungbean LEA proteins, thus generated
in this study, could some extent stimulate investigations
at determining the mechanistic function of this stress
associated proteins.
Future Perspectives
Functional implications based on the structural homologues
hypothesize EMV2 to act as a water replacement molecule.
LEA proteins generally are reported to posses multiple functions
like salt, drought, heat and cold tolerance. Hence, the present study will be useful in further in vitro studies by
over expression in model systems like E.coli or yeast cells
and the recombinant protein can be subjected to Salinity,
Cold shock, thermal stability analysis and the stress induced
structural changes can be monitored to ascertain the possible
functions of this important class of proteins.
Acknowledgements
S.Rajesh is grateful to the Council of Scientific and Industrial
Research, New Delhi for grant of Research Fellowship.
We are grateful to the crews of NCBI, EBI, MRC
Lab-UK and SIB for making computational biology data/
tools publicly available.
Reference
-
Attwood TK, Parry SDJ (2005) In. Introduction to
Bioinformatics. Ed Wood (ed.). Pearson Education Asia
ltd India p3.
- Boguski MS, Tolstoshev CM, Bassett DE (1994) Gene
discovery in DBEST. Science 265: 194-199. [ FIND THIS ARTICLE ONLINE ]
- Bray EA (1993) Molecular responses to water deficit.
Plant Physiol 103: 1035-1040. [ FIND THIS ARTICLE ONLINE ]
- Bray EAJ, Bailey SE, Weretilnyk E (2000) Responses
to abiotic stresses, In: Biochemistry and Molecular Biology
of Plants. Buchanan W Gruissem R Jones (Eds.)
American Society of Plant Physiologists pp1158-1176.
- Ceccardi TL, Meyer NC, Close TJ (1994) Purification
of a maize dehydrin. Protein Express Purif 5: 266-269. [ FIND THIS ARTICLE ONLINE ]
- Close TJ (1997) Dehydrins: A commonality in the response
of plants to dehydration and low temperature.
Physiol Plant 100: 291-296. [ FIND THIS ARTICLE ONLINE ]
- Dure L (1993) A repeating 11-mer amino acid motif and
plant desiccation. Plant J 3: 363-369. [ FIND THIS ARTICLE ONLINE ]
- Gilles GJ, Hines KM, Manfre AJ, Marcotte WR (2007)
A predicted N-terminal helical domain of a Group 1 LEA
protein is required for protection of enzyme activity from
drying. Plant Physiol Biochem 45: 389-399. [ FIND THIS ARTICLE ONLINE ]
- Goyal K, Tisi L, Basran A, Browne J, Burnell A, etal.
(2003) Transition from natively unfolded to folded state
induced by desiccation in an anhydrobiotic nematode
protein. J Biol Chem 278: 129770-12984. [ FIND THIS ARTICLE ONLINE ]
- Guex N, Peitsch MC (1997) SWISS-MODEL and the
Swiss-PdbViewer: An environment for comparative protein modeling. Electrophoresis 18: 2714-2723. [ FIND THIS ARTICLE ONLINE ]
- Imai R, Chang L, Ohta A, Bray EA, Takagi M (1996) A
lea-class gene of tomato confers salt and freezing tolerance
when expressed in Saccharomyces cerevisiae.
Gene 170: 243-248. [ FIND THIS ARTICLE ONLINE ]
- Kazuoka T, Oeda K (1994) Purification and characterization
of COR85-oligomeric complex from cold-acclimated
spinach. Plant Cell Physiol 35: 601-611. [ FIND THIS ARTICLE ONLINE ]
- Laskowski RA, MacArthur MW, Moss DS, Thornton
JM (1993) PROCHECK: a program to check the stereochemical
quality of protein structures. J Appl Cryst 26:
283-291. [ FIND THIS ARTICLE ONLINE ]
- Manickam A, Carlier AR (1980) Isolation and Function
of a Low Molecular Weight Protein of Mung Bean Embryonic
Axes. Planta 149: 234-240. [ FIND THIS ARTICLE ONLINE ]
- Manickam A, Van Damme EJM, Kalaiselvi K, Verhaert
PW, Peumans WJ (1996) Isolation and cDNA cloning
of an Em-like protein from mungbean (Vigna radiata)
axes. Physiol Plant 97: 524-530. [ FIND THIS ARTICLE ONLINE ]
- Morris AL, MacArthur MW, Hutchinson EG, Thornton
JM (1992) Stereochemical quality of protein structure
coordinates. Proteins 12: 345-364. [ FIND THIS ARTICLE ONLINE ]
- Rajesh S, Manickam A (2006) Prediction of functions
for two LEA proteins from mungbean. Bioinformation
1: 133-138. [ FIND THIS ARTICLE ONLINE ]
- Sayle RA, Milner WEJ (1995) RASMOL: Biomolecular graphics for all. Trends Biochem Sci 20: 374-376. [ FIND THIS ARTICLE ONLINE ]
- Singh S, Cornilescu CC, Tyler RC, Cornilescu G, Tonelli
M, etal. (2005) Solution structure of a late embryogenesis
abundant protein (LEA14) from Arabidopsis thaliana,
a cellular stress-related protein. Protein Science 14: 2601-
2609. [ FIND THIS ARTICLE ONLINE ]
- Sivamani E, Bahieldin A, Wraith JM, Al-Niemi T, Dyer
WE, etal. (2000) Improved biomass productivity and water
use efficiency under water deficit conditions in
transgenic wheat constitutively expressing the barley
HVA1 gene. Plant Sci 155: 1-9. [ FIND THIS ARTICLE ONLINE ]
- Soulages JL, Kim K, Walters C, Cushman JC (2002)
Temperature induced extended helix/random coil transitions
in a group 1 late embryogenesis-abundant protein
from soybean. Plant Physiol 128: 822-832. [ FIND THIS ARTICLE ONLINE ]
- Teodorescu O, Galor T, Pillardy J, Elber R (2004) Enriching
the sequence substitution matrix by structural information.
Proteins Str Fun Genet 54: 41-48. [ FIND THIS ARTICLE ONLINE ]
- Wolkers WF, McCready S, Brandt S, Lindsey GG,
Hoekstra FA (2001) Isolation and characterization of a
D-7 LEA protein from pollen that stabilizes glasses in
vitro. Biochim Biophys Acta 1544: 196-206. [ FIND THIS ARTICLE ONLINE ]
- Xu D, Duan X, Wang B, Hong B, Ho TD, etal. (1996)
Expression of a late embryogenesis abundant protein
gene, HVA1, from barley confers tolerance to water
deficit and salt stress in transgenic rice. Plant Physiol
110: 249-257. [ FIND THIS ARTICLE ONLINE ]
Top
|
|