Principal Component Analysis using Singular Value Decomposition of Microarray Data
Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 33156
Principal Component Analysis using Singular Value Decomposition of Microarray Data

Authors: Dong Hoon Lim

Abstract:

A series of microarray experiments produces observations of differential expression for thousands of genes across multiple conditions. Principal component analysis(PCA) has been widely used in multivariate data analysis to reduce the dimensionality of the data in order to simplify subsequent analysis and allow for summarization of the data in a parsimonious manner. PCA, which can be implemented via a singular value decomposition(SVD), is useful for analysis of microarray data. For application of PCA using SVD we use the DNA microarray data for the small round blue cell tumors(SRBCT) of childhood by Khan et al.(2001). To decide the number of components which account for sufficient amount of information we draw scree plot. Biplot, a graphic display associated with PCA, reveals important features that exhibit relationship between variables and also the relationship of variables with observations.

Keywords: Principal component analysis, singular value decomposition, microarray data, SRBCT

Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1087562

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 3258

References:


[1] Alter, O, Brown, P. O. and Botstein, D. (2000). Singular value decomposition for genome-wide expression data processing and modeling PNAS 97: 10101-10106.
[2] Bradu, D., Gabriel K.R.(1978). The biplot as a diagnostic tool for models of two-way tables. Technometrics, 20, 47-68.
[3] Deshmukh, S. R. and Purohit, S. G. (2007)). Microarray Data, Statistical Analysis Usng R, Alpha Science International Ltd.
[4] Everitt, B. and T Hothorn. (2011). An Introduction to Applied Multivariate Analysis with R (Use R!). Springer, New York, NY.
[5] Geladi, Paul; Kowalski, Bruce (1986). ”Partial Least Squares Regression: A Tutorial”. Analytica Chimica Acta 185: 1?17.
[6] Khan, J. and Wei, J. S. and Ringner, M. and Saal, L. H. and Ladanyi, M. and Westermann, F. and Berthold, F. and Schwab, M. and Antonescu, C. R. and Peterson, C. and Meltzer, P. S. (2001). Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks, Nature Medecine, 7, 673-679.
[7] Wall M.E., Dyck P.A., Brettin T.S.(2001). SVDMAN – singular value decomposition analysis of microarray data. Bioinformatics 17:566-68.
[8] Will, T. (1999). Introduction to the singular value decomposition.