Skip Navigation

Nucleic Acids Research 2006 34(4):1261-1269; doi:10.1093/nar/gkl013
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (501K) Freely available
Right arrow Screen PDF (276K) Freely available
Right arrow Supplementary Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (17)
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Ma, P.
Right arrow Articles by Liu, J. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ma, P.
Right arrow Articles by Liu, J. S.
Related Collections
Right arrow Computational methods
Right arrow Monitoring gene expression
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Published online 1 March 2006

© The Author 2006. Published by Oxford University Press. All rights reserved
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use, please contact journals.permissions{at}oxfordjournals.org


Article

A data-driven clustering method for time course gene expression data

Ping Ma, Cristian I. Castillo-Davis, Wenxuan Zhong and Jun S. Liu*

Department of Statistics, Harvard University Cambridge, MA 02138, USA

*To whom correspondence should be addressed. Tel: +1 617 495 1600; Fax: +1 617 496 8057; Email: jliu{at}stat.harvard.edu

Received December 1, 2005. Revised January 29, 2006. Accepted February 13, 2006.

Gene expression over time is, biologically, a continuous process and can thus be represented by a continuous function, i.e. a curve. Individual genes often share similar expression patterns (functional forms). However, the shape of each function, the number of such functions, and the genes that share similar functional forms are typically unknown. Here we introduce an approach that allows direct discovery of related patterns of gene expression and their underlying functions (curves) from data without a priori specification of either cluster number or functional form. Smoothing spline clustering (SSC) models natural properties of gene expression over time, taking into account natural differences in gene expression within a cluster of similarly expressed genes, the effects of experimental measurement error, and missing data. Furthermore, SSC provides a visual summary of each cluster's gene expression function and goodness-of-fit by way of a ‘mean curve’ construct and its associated confidence bands. We apply this method to gene expression data over the life-cycle of Drosophila melanogaster and Caenorhabditis elegans to discover 17 and 16 unique patterns of gene expression in each species, respectively. New and previously described expression patterns in both species are discovered, the majority of which are biologically meaningful and exhibit statistically significant gene function enrichment. Software and source code implementing the algorithm, SSCLUST, is freely available (http://genemerge.bioteam.net/SSClust.html).


The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BiostatisticsHome page
R. Tang and H.-G. Muller
Time-synchronized clustering of gene expression trajectories
Biostat., January 1, 2009; 10(1): 32 - 45.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
C.-T. Li, Y. Yuan, and R. Wilson
An unsupervised conditional random fields approach for clustering gene expression time series
Bioinformatics, November 1, 2008; 24(21): 2467 - 2473.
[Abstract] [Full Text] [PDF]


Home page
GeneticsHome page
B.-R. Kim, L. Zhang, A. Berg, J. Fan, and R. Wu
A Computational Approach to the Functional Clustering of Periodic Gene-Expression Profiles
Genetics, October 1, 2008; 180(2): 821 - 834.
[Abstract] [Full Text] [PDF]


Home page
Cancer Prevention ResearchHome page
Z. H. Gumus, B. Du, A. Kacker, J. O. Boyle, J. M. Bocker, P. Mukherjee, K. Subbaramaiah, A. J. Dannenberg, and H. Weinstein
Effects of Tobacco Smoke on Gene Expression and Cellular Pathways in a Cellular Model of Oral Leukoplakia
Cancer Prevention Research, July 1, 2008; 1(2): 100 - 111.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Sahoo, D. L. Dill, R. Tibshirani, and S. K. Plevritis
Extracting binary signals from microarray time-course data
Nucleic Acids Res., June 28, 2007; 35(11): 3705 - 3712.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Wang, G. Chen, and H. Li
Group SCAD regression analysis for microarray time course gene expression data
Bioinformatics, June 15, 2007; 23(12): 1486 - 1494.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.