Skip Navigation



Nucleic Acids Research Advance Access published online on August 15, 2007

Nucleic Acids Research, doi:10.1093/nar/gkm537
This Article
Right arrow Full Text Freely available
Right arrow Print PDF (2851K) Freely available
Right arrow Screen PDF (750K) Freely available
Right arrow Supplementary Material
Right arrowOA All Versions of this Article:
35/16/e102    most recent
gkm537v2
gkm537v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Calza, S.
Right arrow Articles by Pawitan, Y.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Calza, S.
Right arrow Articles by Pawitan, Y.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© 2007 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Computational Biology

Filtering genes to improve sensitivity in oligonucleotide microarray data analysis

Stefano Calza1,2, Wolfgang Raffelsberger3, Alexander Ploner1, Jose Sahel4, Thierry Leveillard4 and Yudi Pawitan1,*

1Department of Medical Epidemiology and Biostatistics - Karolinska Institute, Stockholm, Sweden, 2Section of Medical Statistics and Biometry, Department of Biomedical Sciences and Biotechnology - University of Brescia, Italy, 3Laboratoire de Bioinformatique et Génomique Intégratives, Institut de Génétique et de Biologie Moléculaire et Cellulaire (IGBMC), Illkirch Strasbourg, France and 4Laboratoire de Physiopathologie Cellulaire et Moléculaire de la Retine - Faculté de Médecine, Université Pierre et Marie Curie, Paris, France

*To whom correspondence should be addressed. Tel: +46-8-5248 3983; Fax: +46-8-314 975; Email: yudi.pawitan{at}ki.se

Received March 27, 2007. Revised June 29, 2007. Accepted July 3, 2007.

Many recent microarrays hold an enormous number of probe sets, thus raising many practical and theoretical problems in controlling the false discovery rate (FDR). Biologically, it is likely that most probe sets are associated with un-expressed genes, so the measured values are simply noise due to non-specific binding; also many probe sets are associated with non-differentially-expressed (non-DE) genes. In an analysis to find DE genes, these probe sets contribute to the false discoveries, so it is desirable to filter out these probe sets prior to analysis. In the methodology proposed here, we first fit a robust linear model for probe-level Affymetrix data that accounts for probe and array effects. We then develop a novel procedure called FLUSH (Filtering Likely Uninformative Sets of Hybridizations), which excludes probe sets that have statistically small array-effects or large residual variance. This filtering procedure was evaluated on a publicly available data set from a controlled spiked-in experiment, as well as on a real experimental data set of a mouse model for retinal degeneration. In both cases, FLUSH filtering improves the sensitivity in the detection of DE genes compared to analyses using unfiltered, presence-filtered, intensity-filtered and variance-filtered data. A freely-available package called FLUSH implements the procedures and graphical displays described in the article.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Brief Funct Genomic ProteomicHome page
F. Cordero, M. Botta, and R. A. Calogero
Microarray data analysis and mining approaches
Brief Funct Genomic Proteomic, January 22, 2008; (2008) elm034v1.
[Abstract] [Full Text] [PDF]



Disclaimer: Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.