WASET
	%0 Journal Article
	%A G. Lavanya Devi and  Allam Appa Rao and  A. Damodaram and  GR Sridhar and  G. Jaya Suma
	%D 2008
	%J International Journal of Biomedical and Biological Engineering
	%B World Academy of Science, Engineering and Technology
	%I Open Science Index 15, 2008
	%T Clustering Protein Sequences with Tailored General Regression Model Technique
	%U https://publications.waset.org/pdf/10604
	%V 15
	%X Cluster analysis divides data into groups that are
meaningful, useful, or both. Analysis of biological data is creating a
new generation of epidemiologic, prognostic, diagnostic and
treatment modalities. Clustering of protein sequences is one of the
current research topics in the field of computer science. Linear
relation is valuable in rule discovery for a given data, such as if value
X goes up 1, value Y will go down 3", etc. The classical linear
regression models the linear relation of two sequences perfectly.
However, if we need to cluster a large repository of protein sequences
into groups where sequences have strong linear relationship with
each other, it is prohibitively expensive to compare sequences one by
one. In this paper, we propose a new technique named General
Regression Model Technique Clustering Algorithm (GRMTCA) to
benignly handle the problem of linear sequences clustering. GRMT
gives a measure, GR*, to tell the degree of linearity of multiple
sequences without having to compare each pair of them.
	%P 67 - 71