Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 30172
Sequence-based Prediction of Gamma-turn Types using a Physicochemical Property-based Decision Tree Method

Authors: Chyn Liaw, Chun-Wei Tung, Shinn-Jang Ho, Shinn-Ying Ho

Abstract:

The γ-turns play important roles in protein folding and molecular recognition. The prediction and analysis of γ-turn types are important for both protein structure predictions and better understanding the characteristics of different γ-turn types. This study proposed a physicochemical property-based decision tree (PPDT) method to interpretably predict γ-turn types. In addition to the good prediction performance of PPDT, three simple and human interpretable IF-THEN rules are extracted from the decision tree constructed by PPDT. The identified informative physicochemical properties and concise rules provide a simple way for discriminating and understanding γ-turn types.

Keywords: Classification and regression tree (CART), γ-turn, Physicochemical properties, Protein secondary structure.

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1331255

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1217

References:


[1] E. Milner-White, B. M. Ross, R. Ismail, K. Belhadj-Mostefa, and R. Poet, "One type of gamma-turn, rather than the other gives rise to chain-reversal in proteins," J Mol Biol, vol. 204, pp. 777-82, Dec 5 1988.
[2] I. Alkorta, M. Suarez, R. Herranz, R. Gonzalez-Muniz, and M. Garcia-Lopez, "Similarity Study on Peptide?-turn Conformation Mimetics," J Mol Model, vol. 2, pp. 16-25, 1996.
[3] H. Kaur and G. P. Raghava, "A neural-network based method for prediction of gamma-turns in proteins from multiple sequence alignment," Protein Sci, vol. 12, pp. 923-9, May 2003.
[4] K. Guruprasad, S. Shukla, S. Adindla, and L. Guruprasad, "Prediction of gamma-turns from amino acid sequences," J Pept Res, vol. 61, pp. 243-51, May 2003.
[5] X. Hu and Q. Li, "Using support vector machine to predict beta- and gamma-turns in proteins," J Comput Chem, vol. 29, pp. 1867-75, Sep 2008.
[6] S. Jahandideh, A. S. Sarvestani, P. Abdolmaleki, M. Jahandideh, and M. Barfeie, "gamma-Turn types prediction in proteins using the support vector machines," J Theor Biol, vol. 249, pp. 785-90, Dec 21 2007.
[7] S. Jahandideh, S. Hoseini, M. Jahandideh, A. Hoseini, and F. M. Disfani, "Gamma-turn types prediction in proteins using the two-stage hybrid neural discriminant model," J Theor Biol, vol. 259, pp. 517-22, Aug 7 2009.
[8] W. Kabsch and C. Sander, "Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features," Biopolymers, vol. 22, pp. 2577-637, Dec 1983.
[9] L. Breiman, Classification and regression trees: Chapman & Hall/CRC, 1984.
[10] C.-W. Tung and S.-Y. Ho, "POPI: predicting immunogenicity of MHC class I binding peptides by mining informative physicochemical properties," Bioinformatics, vol. 23, pp. 942-9, Apr 15 2007.
[11] C.-W. Tung and S.-Y. Ho, "Computational identification of ubiquitylation sites from protein sequences," BMC Bioinformatics, vol. 9, p. 310, 2008.
[12] W.-L. Huang, C.-W. Tung, H.-L. Huang, S.-F. Hwang, and S.-Y. Ho, "ProLoc: Prediction of protein subnuclear localization using SVM with automatic selection from physicochemical composition features," Biosystems, Jan 4 2007.
[13] K.-T. Hsu, H.-L. Huang, C.-W. Tung, Y.-H. Chen, and S.-Y. Ho, "Analysis of physicochemical properties on prediction of R5, X4, and R5X4 HIV-1 coreceptor usage," Int J Biol Life Sci, vol. 5, pp. 208-15, 2009.
[14] S. Kawashima, P. Pokarowski, M. Pokarowska, A. Kolinski, T. Katayama, and M. Kanehisa, "AAindex: amino acid index database, progress report 2008," Nucleic Acids Res, vol. 36, pp. D202-5, Jan 2008.
[15] L.-T. Huang, M. M. Gromiha, and S.-Y. Ho, "iPTREE-STAB: interpretable decision tree based method for predicting protein stability changes upon mutations," Bioinformatics, vol. 23, pp. 1292-3, May 15 2007.
[16] M. Charton and B. I. Charton, "The structural dependence of amino acid hydrophobicity parameters," J Theor Biol, vol. 99, pp. 629-44, Dec 21 1982.
[17] D. H. Wertz and H. A. Scheraga, "Influence of water on protein structure. An analysis of the preferences of amino acid residues for the inside or outside and for specific conformations in a protein molecule," Macromolecules, vol. 11, pp. 9-15, Jan-Feb 1978.
[18] T. M. Mitchell, Machine Learning. New York: McGraw-Hill, 1997.
[19] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, and I. Witten, "The WEKA data mining software: An update," ACM SIGKDD Explorations Newsletter, vol. 11, pp. 10-18, 2009.
[20] M. Charton and B. I. Charton, "The dependence of the Chou-Fasman parameters on amino acid side chain structure," J Theor Biol, vol. 102, pp. 121-34, May 7 1983.
[21] M. Geisow and R. Roberts, "Amino acid preferences for secondary structure vary with protein class," Int J Biol Macromol, vol. 2, pp. 387-389, 1980.
[22] M. Prabhakaran, "The distribution of physical, chemical and conformational properties in signal and nascent peptides," Biochem J, vol. 269, pp. 691-6, Aug 1 1990.
[23] K. T. O'Neil and W. F. DeGrado, "A thermodynamic scale for the helix-forming tendencies of the commonly occurring amino acids," Science, vol. 250, pp. 646-51, Nov 2 1990.
[24] D. E. Goldsack and R. C. Chalifoux, "Contribution of the free energy of mixing of hydrophobic side chains to the stability of the tertiary structure of proteins," J Theor Biol, vol. 39, pp. 645-51, Jun 1973.
[25] F. R. Maxfield and H. A. Scheraga, "Status of empirical methods for the prediction of protein backbone topography," Biochemistry, vol. 15, pp. 5138-53, Nov 16 1976.