{"title":"Multiple Sequence Alignment Using Optimization Algorithms","authors":"M. F. Omar, R. A. Salam, R. Abdullah, N. A. Rashid","volume":5,"journal":"International Journal of Computer and Information Engineering","pagesStart":1512,"pagesEnd":1521,"ISSN":"1307-6892","URL":"https:\/\/publications.waset.org\/pdf\/14036","abstract":"<p>Proteins or genes that have similar sequences are likely to perform the same function. One of the most widely used techniques for sequence comparison is sequence alignment. Sequence alignment allows mismatches and insertion\/deletion, which represents biological mutations. Sequence alignment is usually performed only on two sequences. Multiple sequence alignment, is a natural extension of two-sequence alignment. In multiple sequence alignment, the emphasis is to find optimal alignment for a group of sequences. Several applicable techniques were observed in this research, from traditional method such as dynamic programming to the extend of widely used stochastic optimization method such as Genetic Algorithms (GAs) and Simulated Annealing. A framework with combination of Genetic Algorithm and Simulated Annealing is presented to solve Multiple Sequence Alignment problem. The Genetic Algorithm phase will try to find new region of solution while Simulated Annealing can be considered as an alignment improver for any near optimal solution produced by GAs.<\/p>\r\n","references":"[1] N. M. Luscombe, D. Greebaum, M. Gerstein, (2001). What is\r\nbioinformatics? A proposed definition and overview of the field.\r\nDepartment of Molecular Biophysics and Biochemistry, Yale University,\r\nUSA.\r\n[2] R. Musick, T. Slezak, T. Crithlow, (2001). An Overview of\r\nBioinformatics Research at Lawrence Livermore National Laboratory.\r\nJoint Genome Institute \/ Computing Application Organization .Lawrence\r\nLivermore National Laboratory.\r\n[3] Online Lectures on Bioinformatics (September 2003) (Online),\r\nAvailable: http:\/\/lectures.molgen.mpg.de\/Biol\/index.html.\r\n[4] P. C. Turner, A. G. McLennan, A. D Bates, M. R. H. White, (1997).\r\nNucleic Acid Structure. Instant Notes in Molecular Biology C1:31-35.\r\nBios Scientific Publishers Limited, Liverpool.\r\n[5] M. Tompa .(2000). Lecture Notes on Biological Sequence Analysis.\r\nTechnical Report #2000-06-01.Department of Computer Science and\r\nEngineering, University of Washington.\r\n[6] W. Zhong(2003). Using Travel Salesman Problem Algorithms To\r\ndetermine Multiple Sequence Alignment Orders. Master Thesis,\r\nUniversity of Georgia: Athens, Georgia.\r\n[7] C. Karostensky, G. Gonnet. Near Optimal Multiple Sequence Alignment\r\nusing a Traveling Salesman Problem approach. Swiss Federal Institute\r\nof Technology, Institute of Scientific Computing. ETH Zurich,\r\nSwitzerland (2000).\r\n[8] R. Choudry, (1999). Application of Evolutionary Algorithms for Multiple\r\nSequence Alignment. Stanford University.\r\n[9] D. J. Lipman, S. F. Altschul, J. D. Kececioglun, (1989). A tool for\r\nMultiple Sequence Alignment. Vol. 86.pp 4412-4415, Biochemistry.\r\nProc. Natl. Acad. Sci. USA.\r\n[10] M. Gribskov, J. Devereux, (1991). Similarity and Homology. Sequence\r\nAnalysis Primer.3: 89-157. Stockton Press. NY.\r\n[11] C. Notredame, (2002). Recent Progress in Multiple Sequence Alignment:\r\nA Survey., Pharmacogenomics, 3(1):131-144.\r\n[12] D. Higgins, (1999). Multiple Sequence Alignment. Genetics\r\nDatabases.9:165-183 Academic Press, London, UK .\r\n[13] J. D. Thompson, F. Plewniak, and O. Poch. (1999). A comprehensive\r\ncomparison of multiple sequence alignment programs. Nucleic Acids\r\nResearch, 27(13):2682-2690.\r\n[14] Introduction to hidden markov models (June 2003) (Online), Available: www.scs.leeds.ac.uk\/scsonly\/teachingmaterials\/HiddenMarko\r\nvModels\/html dev\/main.html.\r\n[15] Introduction to HMM (April 2003) (Online) Available:\r\nhttp:\/\/www.cs.brown.edu\/research\/ai\/dynamics\/tutorial\/Documents\/Hidd\r\nenMarkovModels.html.\r\n[16] S. R. Eddy, (1998). Profile Hidden Markov Models. Bioinformatics\r\nReview. Vol. 14. 9: 755-763. Oxford University Press.\r\n[17] P. Baldi, S. Brunak, Y. Chauven, A. Krogh (1997). Hidden Markov\r\nModel For Human Genes: Periodic Pattern in Exon Sequence.\r\nTheoretical and Computational Methods in Genome Research, 2: 15-32.\r\nPlenum Press, New York.\r\n[18] S. R. Eddy, (1996). Multiple Alignment Using Hidden Markov Models.\r\nDept. of Genetics, Washington University School of Medicine St. Louis,\r\nUS.\r\n[19] M. Gen, R. Cheng. (2000). Genetic Algorithms and Engineering\r\nOptimization. John Wiley & Sons: Canada.\r\n[20] D. Beasley, D. R. Bull, R. R. Martin, (1993). An Overview of Genetic\r\nAlgorithms: Part 1, Fundamentals. Inter-University Committee on\r\nComputing. 15(2) 58-69. [21] CCS 501 Neural Network and Genetic\r\nAlgorithm http:\/\/ office1.cs.usm.my.\r\n[21] C. Notredame, and D. G. Higgins, (1996). SAGA: sequence alignment\r\nby genetic algorithm, Nuc. Acids Res., 24(8), 1515-1524.\r\n[22] M. Isokawa, M. Wayama, T. Shimizu, (1996). Multiple Sequence\r\nAlignment using Genetic Algorithm. Department of Information Science,\r\nFaculty of Science, Hirosaki University, Japan.\r\n[23] J. T. Horng, C. N. Lin, B. H. Yang, and C. Y. Kao, (2001). A genetic\r\nalgorithm for multiple sequence alignment. In Poster in German\r\nConference on Bioinformatics.\r\n[24] S. Kirkpatrick, C. D. J. Gellat, and M. P. Vecchi, (1983). Optimization\r\nby Simulated Annealing, Science, 220 pp. 671-680.","publisher":"World Academy of Science, Engineering and Technology","index":"Open Science Index 5, 2007"}