Principal Component Analysis using Singular Value Decomposition of Microarray Data
Authors: Dong Hoon Lim
Abstract:
A series of microarray experiments produces observations of differential expression for thousands of genes across multiple conditions. Principal component analysis(PCA) has been widely used in multivariate data analysis to reduce the dimensionality of the data in order to simplify subsequent analysis and allow for summarization of the data in a parsimonious manner. PCA, which can be implemented via a singular value decomposition(SVD), is useful for analysis of microarray data. For application of PCA using SVD we use the DNA microarray data for the small round blue cell tumors(SRBCT) of childhood by Khan et al.(2001). To decide the number of components which account for sufficient amount of information we draw scree plot. Biplot, a graphic display associated with PCA, reveals important features that exhibit relationship between variables and also the relationship of variables with observations.
Keywords: Principal component analysis, singular value decomposition, microarray data, SRBCT
Digital Object Identifier (DOI): doi.org/10.5281/zenodo.1087562
Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 3253References:
[1] Alter, O, Brown, P. O. and Botstein, D. (2000). Singular value decomposition
for genome-wide expression data processing and modeling PNAS
97: 10101-10106.
[2] Bradu, D., Gabriel K.R.(1978). The biplot as a diagnostic tool for models
of two-way tables. Technometrics, 20, 47-68.
[3] Deshmukh, S. R. and Purohit, S. G. (2007)). Microarray Data, Statistical
Analysis Usng R, Alpha Science International Ltd.
[4] Everitt, B. and T Hothorn. (2011). An Introduction to Applied Multivariate
Analysis with R (Use R!). Springer, New York, NY.
[5] Geladi, Paul; Kowalski, Bruce (1986). ”Partial Least Squares Regression:
A Tutorial”. Analytica Chimica Acta 185: 1?17.
[6] Khan, J. and Wei, J. S. and Ringner, M. and Saal, L. H. and Ladanyi, M.
and Westermann, F. and Berthold, F. and Schwab, M. and Antonescu, C.
R. and Peterson, C. and Meltzer, P. S. (2001). Classification and diagnostic
prediction of cancers using gene expression profiling and artificial neural
networks, Nature Medecine, 7, 673-679.
[7] Wall M.E., Dyck P.A., Brettin T.S.(2001). SVDMAN – singular value
decomposition analysis of microarray data. Bioinformatics 17:566-68.
[8] Will, T. (1999). Introduction to the singular value decomposition.