Research Article (Open access) |
---|
Int. J. Life. Sci. Scienti. Res.,
3(3):
1039-1046,
May 2017
Structural Biology Laboratory,
Department of Bioinformatics, Bharathiar University,
Coimbatore, Tamil Nadu, India
*Address
for Correspondence: Princy Vijayababu,
Research Scholar, Structural Biology Laboratory, Department of Bioinformatics, Bharathiar University, Coimbatore-46, Tamil Nadu, India
Abstract- Online
Bioinformatics tool needs access through the World Wide Web, which requires
Internet service. To overcome this we programmed a new offline tool, ProGene1.0
for advanced protein and nucleotide analysis. ProGene
1.0 developed by programming language Microsoft Visual Basic 6.0 designed to predict chemical properties,
Molecular mass, Isoelectric point, Functional group
variation, Molecular composition, Nucleotide composition and Nucleotide to
protein conversion. ProGene 1.0 is a desktop based
tool that allows researchers to get the sequence level information on a novel
and unknown or hypothetical protein or nucleotide.
Keywords: Molecular mass, Microsoft Visual Basic6.0, Protein
sequence analysis, ProGene1.0
INTRODUCTION-
With the number of interdisciplinary subject in life
science, Bioinformatics is an emerging field interface of computer science
mathematics and biology [1]. It is a foundation of engineering
depends of both experimental and derived data. Nucleotide and protein sequence
data has considerably increased this pool of data [2-3]. Sequence
data and subsequence analysis by a bioinformatics tool is attracts most
attention, because it promises a considerable reduction in both time and cost [4].
Protein sequence comparison has become one of the most powerful tools for
characterizing protein sequences because of the enormous amount of information
that is preserved throughout the evolutionary process [5]. A general
approach for functional characterization of unknown proteins is to infer
protein functions based on sequence similarity. One of the successful
approaches is to define signatures of known families. Signatures usually
identify conserved regions among the family of proteins, revealing the
importance of their structural or physicochemical properties [6].
Relations between protein sequence and structure can be analyzed by either
determining the sequence features of predefined structures and properties [7]. Due to the advancement in Bioinformatics,
many tools were available for analysis of protein and nucleotide sequences [8].
One of the major drawbacks of those tools is their web based application which
needs to be accessed through the World Wide Web [9].
Visual
Basic is a language for developing graphical user interface [10]. It
is an event driven programming language where the program is not based on code
but it is more based on the event given [11]. Because of this, it is
mainly used to develop tools and software for Bioinformatics more preferably as
offline application. Most standalone Bioinformatics software has been developed
with VB only [12]. ProGene1.0
is also based on this language.
Fig
1: Architecture of ProGene 1.0
Fig 2: Home page of ProGene 1.0
Fig 3: Hierarchy of
capability of ProGene 1.0
Fig 4: Functional properties interface of ProGene 1.0
DISCUSSION-
The basic purpose of this tool is to get primary level information of protein.
One of the major application of ProGene 1.0 is
prediction of Chemical properties like Molecular weight [13] and Isoelectric point (PI) used in protein identification.
Proteins have an amazing range of structural and catalytic properties as a
result of their varying amino acid composition [14] thus calculating
amino acid composition is an another important application of ProGene 1.0. Functional group variation module capable to
calculate number of Polar, Non polar, Aromatics, Positive and Negative amino
acids. Molecular Composition application can compute five major chemical
compounds of protein molecule resembling Carbon, Hydrogen, Nitrogen, Oxygen,
and Sulfur [15] and total number of atoms present in proteins. It
also calculates Molecular formula of the protein sequence. Gene converter can
convert nucleotide sequences to their respective transcribed reverse
transcribed and translated sequences. Another important application of
ProGene1.0 is to provide the detailed information of the nucleotide sequences
via Nucleic acids Composition. It predicts total number of nucleotide
sequences, total composition of Adenine (A) and Thiamine (T) combination as
well as Guanine (G) and Cytosine (C) combination [16-17], to know
the stability of DNA structure [18] and the GC content percentage
especially for the optimized primer
design [19]. As a result ProGene 1.0
enables more flexibility and expandability compared with other online tools.
Author`s contribution-
NSB identified the need had input on tool design and features. SG designs the
work and also contributed to manuscript writings VP develop the tool and
manuscript. All the authors participated in the writing and approval of the
manuscripts. All authors read and approved the final manuscript.
REFERENCES
2.
Wang X, Slebos RJ, Wang D, Halvey PJ,
Tabb DL, Liebler DC,
and Zhang B. Protein identification using customized protein sequence
databases derived from RNA-Seq data. J Proteome
Res. 2012; 11(2), 1009–1017.
3.
Smalheiser
NR. Linking investigators. A centralized
linking facility for data sharing and coordination of samples in tissue
banks. EMBO Rep. 2003; 4(2):108-110.
4.
Moore A
and Brailsford T. Unified Hyperstructures
for Bioinformatics: Escaping the Application Prison. Journal of Digital
Information. 2006; 5(1):254.
5.
Dayhoff MO.,
Eck RV., and Park CM. Atlas of protein sequence and structure. Washington. National Biomedical Research Foundation. 1972.
6.
Hulo N, Sigrist CJ, Le Saux V, Langendijk-Genevaux PS, Bordoli
L, Gattiker A, De Castro E, Bucher P and Bairoch A. Recent improvements to the PROSITE database.
Nucleic Acids Res, 2004; 32:134-137.
7.
Bystroff C,
Simons KT, Han KF and Baker D. Local sequence-structure correlations in
proteins. Curr
Opin Biotechnol. 1996;
7(4):417-21.
8.
Xiong J.Essential Bioinformatics. New
York, Cambridge University Press. 2006.
9.
Afzal M1, Shahid AA, Shehzadi A, Nadeem S and Husnain T. RDNAnalyzer: A tool for DNA secondary structure prediction
and sequence analysis. Bioinformation. 2012;
8(14):687-690.
10. Haggard G, Hutchison W and Shibata C.
Introduction: Visual BASIC 6.0. 1st
ed, Bookboon, 2013.
11. Schneider ID. An Introduction to Programming
Using Visual Basic 6.0. 6th ed, USA,
University of Maryland, Prentic Hall publishers.
2006.
12. Jung SK, McDonald K. Visual gene developer: A
fully programmable bioinformatics software for synthetic gene optimization.
BMC Bioinformatics. 2011; 16; 12:340.
13. Gasteiger E, Hoogland C, Gattiker
A, Duvaud S, Wilkins RM, Appel
DR and Bairoch
A . Protein Identification and Analysis Tools on the ExPASy
Server. In The Proteomics Protocols Handbook, Humana Press. 2005;571-607.
14. Wade
LG. Organic chemistry.6th ed, India,
Pearson Education publisher. 2005.
15. Cohn EJ. Some Physical-Chemical
Characteristics of Protein Molecules. Chemistry Reviews.1939;24 (2): 203–232.
16. Lercher MJ, Urrutia AO, Pavlícek
A, Hurst LD. A unification of mosaic structures in the human genome. Hum Mol
Genet. 2003; 12(19):2411-2415.
17. Raymond A,
Lovell S, Lorimer D, Walchli
J, Mixon M,
Wallace E , Thompkins K , Archer K , Burgin A and
Stewart L . Combined protein construct and synthetic gene engineering for heterologous protein expression and crystallization using
Gene Composer. BMC Biotechnology, 2009; 9-37.
18. Lercher JM, Urrutia OA, Pavlíček
A,Laurence D. Hurst DL. A unification
of mosaic structures in the human genome. Hum Mol Genet. 2003; 12 (19):
2411-2415.
19. Kämpke T, Kieninger M, Mecklenburg M. Efficient primer design algorithms. Bioinformatics.
2001; 17 (3):214-225.