TY - GEN
T1 - Learning the local molecular pattern of Alzheimer's disease by non-negative matrix factorization
AU - Kong, Wei
AU - Mou, Xiaoyang
AU - Li, Qiao
AU - Song, Yipeng
PY - 2010
Y1 - 2010
N2 - Gene microarray technology is an effective tool to monitor simultaneous activity of multiple cellular pathways from thousands of genes in a single chip. Many clustering methods have been developed to identify groups of genes or experimental conditions that exhibit similar expression patterns from gene expression data, such as hierarchical clustering, k-means, and self-organizing maps (SOM). The limitations of these clustering algorithms are: they group genes (or conditions) based on global similarities in their expression profiles and only assign each gene to a single cluster. In this work we present a biclustering method-nonnegtive matrix factorization (NMF) to avoid the above drawbacks and discover the local molecular pattern from gene expression datasets of Alzheimer's disease (AD). NMF can be applied to reduce the dimensionality of the data and describe the data as a positive linear combination of a reduced number of factors. By applying a sparseness enforcement variable into classical NMF, the more local structures with meaningful biological information inherent in the data are captured by clustering genes and samples simultaneously, and the classification of samples is well improved. The analysis and discussion of the identified local structures demonstrated that they related many pathways which play a prominent role in AD and the activation patterns to AD phenotypes.
AB - Gene microarray technology is an effective tool to monitor simultaneous activity of multiple cellular pathways from thousands of genes in a single chip. Many clustering methods have been developed to identify groups of genes or experimental conditions that exhibit similar expression patterns from gene expression data, such as hierarchical clustering, k-means, and self-organizing maps (SOM). The limitations of these clustering algorithms are: they group genes (or conditions) based on global similarities in their expression profiles and only assign each gene to a single cluster. In this work we present a biclustering method-nonnegtive matrix factorization (NMF) to avoid the above drawbacks and discover the local molecular pattern from gene expression datasets of Alzheimer's disease (AD). NMF can be applied to reduce the dimensionality of the data and describe the data as a positive linear combination of a reduced number of factors. By applying a sparseness enforcement variable into classical NMF, the more local structures with meaningful biological information inherent in the data are captured by clustering genes and samples simultaneously, and the classification of samples is well improved. The analysis and discussion of the identified local structures demonstrated that they related many pathways which play a prominent role in AD and the activation patterns to AD phenotypes.
UR - http://www.scopus.com/inward/record.url?scp=77956567973&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=77956567973&partnerID=8YFLogxK
U2 - 10.1109/ICGCS.2010.5542987
DO - 10.1109/ICGCS.2010.5542987
M3 - Conference contribution
AN - SCOPUS:77956567973
SN - 9781424468775
T3 - 1st International Conference on Green Circuits and Systems, ICGCS 2010
SP - 621
EP - 625
BT - 1st International Conference on Green Circuits and Systems, ICGCS 2010
T2 - 1st International Conference on Green Circuits and Systems, ICGCS 2010
Y2 - 21 June 2010 through 23 June 2010
ER -