<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD 2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
	<front>
		<journal-meta>
			<journal-id journal-id-type="nlm-ta">J Proteomics Bioinform</journal-id>
			<journal-id journal-id-type="publisher-id">opg</journal-id>						
			<journal-title>Journal of Proteomics &amp; Bioinformatics</journal-title>			 
			<issn pub-type="epub">0974-276X</issn>
			<publisher>
				<publisher-name>OMICS Publishing Group</publisher-name>
				<publisher-loc>India, USA</publisher-loc>
			</publisher>
		</journal-meta>
		<article-meta>			
			<article-id pub-id-type="publisher-id">000063</article-id>
			<article-categories>
				<subj-group subj-group-type="heading">
					<subject>Research Article</subject>
				</subj-group>
				<subj-group subj-group-type="Discipline">
					<subject>Biochemistry</subject>
				</subj-group>
				<subj-group subj-group-type="System Taxonomy">
					<subject>Proteomics</subject>
					<subject>Bioinformatics</subject>
					<subject>Genomics</subject>
					<subject>Transcriptomics</subject>
					<subject>Biomarkers</subject>
				</subj-group>
			</article-categories>
			<title-group>
				<article-title>IMPPDS - Indian Medicinal Plants Protein Dataset</article-title>
			</title-group>
			<contrib-group>
				<contrib contrib-type="author">
					<name>
						<surname>Anandakumar</surname>
						<given-names>S.</given-names>
					</name>
				</contrib>
				<contrib contrib-type="author">
					<name>
						<surname>Saravanan</surname>
						<given-names>V.</given-names>
					</name>
				</contrib>
				<contrib contrib-type="author">
					<name>
						<surname>Shanmughave</surname>
						<given-names>P.</given-names>
					</name>
					<xref ref-type="corresp" rid="cor1">&ast;</xref>
				</contrib>
			</contrib-group>
			<aff id="af1">DBT Bioinformatics Centre, Department of Bioinformatics, Bharathiar University</aff>
			<author-notes>
				<corresp id="cor1">&ast; To whom correspondence should be addressed: P. Shanmughavel, Coimbatore – 641046, TamilNadu, India, E-mail: <email>shanvel_99@yahoo.com</email></corresp>
			</author-notes>
			<pub-date pub-type="collection">
			     <month>07</month>
				 <year>2008</year>
			</pub-date>
			<pub-date pub-type="epub">
				<day>17</day>
				<month>07</month>
				<year>2008</year>
			</pub-date>			
			<volume>1</volume>
			<issue>4</issue>
			<fpage>230</fpage>
			<lpage>232</lpage>
			<history>
			<date date-type="received">
			     <day>02</day>
				 <month>07</month>
				 <year>2008</year>
			</date>
			<date date-type="accepted">
			      <day>16</day>
				  <month>07</month>
				  <year>2008</year>
			</date>
			</history>
			<permissions>			
			<copyright-statement><bold>Copyright:</bold> &copy; 2008 S Anandakumar, etal.</copyright-statement>
			<copyright-year>2008</copyright-year>
			<license license-type="open access">
			 <p>Anandakumar et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.</p>
			</license>
			</permissions>			
			<abstract>
				<p>There is a growing concenter on the importance of medicinal plants in modern health care. The medicinal plant proteins have therapeutic, cosmetic, and other beneficial properties for mankind to treat various diseases. In this regard, 3D models of medicinal plants protein will enhance our knowledge about the identification of new drugs for various diseases and disorders. So, in this work we modeled about 181 various plant proteins of 18 different medicinal plants using molecular modeling techniques (Indian medicinal plants) and made into a dataset IMPPDS. The models are constructed using MODELLER 9v2, with careful manual pair wise alignment. The model templates are obtained based on the sequence similarity (> 40% identity) of known protein structure using BLASTP (Basic Local Alignment Tool for Protein). One or more potential structural templates were selected from PDB (Protein Databank). We used iterative conjugate gradient method for optimizing the model and PROCHECK for validation.</p>
			<p><bold>Availability:</bold><ext-link ext-link-type="uri" xlink:href="www.mmppdb.googlepages.com/index.htm"> http://mmppdb.googlepages.com/index.htm</ext-link></p>
			</abstract>
			<kwd-group>
				<kwd>Medicinal Plant</kwd>
				<kwd>Molecular Modeling</kwd>
				<kwd>Pairwise alignment</kwd>
				<kwd>Dataset</kwd>
				<kwd>optimizing</kwd>
			</kwd-group>
			<custom-meta-wrap>
				<custom-meta>
					<meta-name>citation</meta-name>
					<meta-value>S Anandakumar, V Saravanan, P Shanmughavel (2008)IMPPDS - Indian Medicinal Plants Protein Dataset</meta-value>
				</custom-meta>
			</custom-meta-wrap>
		</article-meta>
	</front>
	<body>
		<sec>
			<title>Introduction</title>
				<p>Many of the medicinal plant proteins have therapeutic, cosmetic, or other beneficial properties (Balunas and Kinghorn, 2005). Three-dimensional structures for most of the Indian medicinal plants protein are limited <ext-link ext-link-type="uri" xlink:href="www.modbase.compbio.ucsf.edu/modbase-cgi/index.cgi">http://modbase.compbio.ucsf.edu/modbase-cgi/index.cgi</ext-link>, <ext-link ext-link-type="uri" xlink:href="www.rcsb.org/pdb">http://www.rcsb.org/pdb</ext-link>. Detailed study on such proteins structure will provide room for innovative applications and help the researchers to have a better insight on the functions and molecular properties of the proteins and to come out with new therapeutic values from corresponding protein <ext-link ext-link-type="uri" xlink:href="www.sristi.org/cms/">http://www.sristi.org/cms/</ext-link>. Also, there exists the obstacle of designing the accurate protein models based on automated computational modeling, chiefly in pair wise alignment (Kolinski and Gront, 2007). So, we performed the modeling procedure in a semi-automated manner with necessary manual pair wise alignment and error interpretation. In this article we described about the dataset of protein model containing 181 different proteins from 18 different medicinal plants.</p>
		</sec>
		<sec sec-type="methods">
			<title>Methodology</title>
				<sec>
					<title>Target Sequence Retrieval</title>
						<p>Names of Medicinal plants protein which have medicinal properties were obtained from the SRISTI <ext-link ext-link-type="uri" xlink:href="www.sristi.org/cms/">http://www.sristi.org/cms/</ext-link> and sequences were taken from dataset of Swissprot/TrEMBL <ext-link ext-link-type="uri" xlink:href="www.expasy.ch/sprot/">http://www.expasy.ch/sprot/</ext-link>. All the taken protein sequences were ascertained that the three-dimensional structure of the corresponding proteins were not available in Protein Data Bank and Modbase <ext-link ext-link-type="uri" xlink:href="www.modbase.compbio.ucsf.edu/modbase-cgi/index.cgi">http://modbase.compbio.ucsf.edu/modbase-cgi/index.cgi</ext-link>, <ext-link ext-link-type="uri" xlink:href="www.rcsb.org/pdb">http://www.rcsb.org/pdb</ext-link>. Totally 181 proteins sequence from 18 different medicinal plants were taken.</p>
				</sec>
				<sec>
					<title>Template Searching</title>
						<p>One or more suitable template proteins for each of the annotated medicinal plants protein were identified based on the sequence similarity (= 40% identity) using BLASTP (Basic Local Alignment Tool for Protein). For each candidate, templates with higher similarity were obtained from Protein Databank <ext-link ext-link-type="uri" xlink:href="www.rcsb.org/pdb">http://www.rcsb.org/pdb</ext-link>.</p>
				<fig id="g1">
					<label>Figure 1</label>
					<caption>
						<title>Indian Medicinal Plants Protein Dataset, the dataset snapshot and visual models are shown</title>
					</caption>
					<graphic xlink:href="JPB-01-230-g001.tif"/>
				</fig>
				</sec>
				<sec>
					<title>Sequence Alignment</title>
						<p>Sequence alignment of target and template proteins were derived using the program align2d (alignment package in Modeler). Further, each alignment is manually curated for reduced gaps, insertions and deletions.</p>
				</sec>
				<sec>
					<title>Rough Model</title>
						<p>A rough 3-D model was constructed for each proteins from the sequence alignment between annotated protein and its corresponding template proteins using MODELLER 9v2 (Eswar et al, 2003) with default parameters of energy minimization value.</p>
				</sec>
				<sec>
					<title>Model Refinement</title>
						<p>Using the steepest descent and conjugate gradient technique (Sali and Blundell, 1993), the rough constructed models were solvated and subjected to constraint energy minimization. To eliminate bad contacts between protein atoms and structural water molecules the harmonic constraint was set to100 kJ/mol/Å2, applied for all protein atoms.</p>
				</sec>
				<sec>
					<title>Evaluation of Refined Model</title>
						<p>The refined structure of the models were subjected to a series of tests (Sali et al, 1995), which includes backbone conformation, evaluated by the inspection of the Psi/Phi Ramachandran plot obtained from PROCHECK <ext-link ext-link-type="uri" xlink:href="www.biochem.ucl.ac.uk/~roman/procheck/procheck.html">http://www.biochem.ucl.ac.uk/~roman/procheck/procheck.html</ext-link> analysis, packing quality of the refined structure, investigated by the calculation of PROCHECK Quality Control value and the models can be viewed using PyMol <ext-link ext-link-type="uri" xlink:href="www.delsci.com/rel/099/">http://www.delsci.com/rel/099/</ext-link> and RasMol <ext-link ext-link-type="uri" xlink:href="www.bernstein-plus-sons.com/software/RasMol_2.7.2.1.1/ README.html">http://www.bernstein-plus-sons.com/software/RasMol_2.7.2.1.1/ README.html</ext-link>.</p>
				</sec>
		</sec>
		<sec>
			<title>Usefullness And Biological Community</title>
				<p>This dataset contains 181 protein models [<xref ref-type="fig" rid="g1">figure 1</xref>] from eighteen different various Indian Medicinal plants such as Bauhinia variegata, Camellia sinensis, Allium cepa, Allium ascalonicum, Hibiscus rosasinensis, Punica granatum, Brassica juncea, Mangifera indica, Euphorbia pulcherrima, Aloe vera, Crotalaria juncea, Coriandrum sativum, Cuscuta reflexa, CyNodon dactylon, Citrus sinensis, Citrus reticulata, Citrus clementina and Citrus limon. Each protein models were curated and verified by manually. Detailed study on such modelled Indian medicinal plants protein structure will provide room for innovative applications and help the researchers to have a better insight on the functions and molecular properties of the proteins and to come out with new therapeutic values.</p>
		</sec>		
	</body>
	<back>
		<ref-list>
           <title>References</title>
		   <ref id="r1">
					<citation citation-type="journal">
				             <person-group>
							 <name>
							    <surname>Balunas</surname>
								<given-names>MJ</given-names>
							</name>
							<name>
								<surname>Kinghorn</surname>
								<given-names>AD</given-names>
							</name>							
							</person-group>
							<year>2005</year>
							<article-title>Drug discovery from medicinal plants</article-title>
							<source>Life Sci</source>
							<volume>8</volume>
							<fpage>431</fpage>
							<lpage>441</lpage>
			 	</citation>
				</ref>
				<ref id="r2">
					<citation citation-type="journal">
				             <person-group>
							 <name>
							    <surname>Eswar</surname>
								<given-names>N</given-names>
							</name><etal/>														
							</person-group>
							<year>2003</year>
							<article-title>Tools for comparative protein structure modeling and analysis</article-title>
							<source>Nucleic Acids Res</source>
							<volume>31</volume>
							<fpage>3375</fpage>
							<lpage>3380</lpage>
			 	</citation>
				</ref>
				<ref id="r3">
					<citation citation-type="journal">
				             <person-group>
							 <name>
							    <surname>Kolinski</surname>
								<given-names>A</given-names>
							</name>
							<name>
								<surname>Gront</surname>
								<given-names>D</given-names>
							</name>							
							</person-group>
							<year>2007</year>
							<article-title>Comparative modeling without implicit sequence alignments Bioinformatics</article-title>
							<source>Bioinformatics</source>
							<volume>23</volume>
							<fpage>2522</fpage>
							<lpage>2527</lpage>
			 	</citation>
				</ref>
				<ref id="r4">
					<citation citation-type="journal">
				             <person-group>
							 <name>
							    <surname>Sali</surname>
								<given-names>A</given-names>
							</name>
							<name>
								<surname>Blundell</surname>
								<given-names>TL</given-names>
							</name>							
							</person-group>
							<year>1993</year>
							<article-title>Comparative protein modelling by satisfaction of spatial restraints</article-title>
							<source>J Mol Biol</source>
							<volume>234</volume>
							<fpage>779</fpage>
							<lpage>815</lpage>
			 	</citation>
				</ref>
				<ref id="r5">
					<citation citation-type="journal">
				             <person-group>
							 <name>
							    <surname>Sali</surname>
								<given-names>A</given-names>
							</name><etal/>														
							</person-group>
							<year>1995</year>
							<article-title>Evaluation of comparative protein  modeling by MODELLER</article-title>
							<source>Proteins</source>
							<volume>23</volume>
							<fpage>318</fpage>
							<lpage>326</lpage>
			 	</citation>
				</ref>
</ref-list>				 
	</back>
</article>
