Nucleic Acids Research Advance Access published online on March 11, 2008
Nucleic Acids Research, doi:10.1093/nar/gkn110
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Methods online |
wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data
1Department of Internal Medicine and Department of Genetics, Division of Oncology, Stem Cell Biology Section, Washington University, St Louis, MO, 2Roche NimbleGen, Inc., Madison, WI and 3Institute for Pharmacogenomics and Individualized Therapy, University of North Carolina, Chapel Hill, NC, USA
*To whom correspondence should be addressed. Tel: 314 747 4437; Fax: 314 362 9333; Email: graubert{at}medicine.wustl.edu
Received September 18, 2007. Revised February 26, 2008. Accepted February 27, 2008.
Copy number variants (CNVs) are currently defined as genomic sequences that are polymorphic in copy number and range in length from 1000 to several million base pairs. Among current array-based CNV detection platforms, long-oligonucleotide arrays promise the highest resolution. However, the performance of currently available analytical tools suffers when applied to these data because of the lower signal:noise ratio inherent in oligonucleotide-based hybridization assays. We have developed wuHMM, an algorithm for mapping CNVs from array comparative genomic hybridization (aCGH) platforms comprised of 385 000 to more than 3 million probes. wuHMM is unique in that it can utilize sequence divergence information to reduce the false positive rate (FPR). We apply wuHMM to 385K-aCGH, 2.1M-aCGH and 3.1M-aCGH experiments comparing the 129X1/SvJ and C57BL/6J inbred mouse genomes. We assess wuHMM's performance on the 385K platform by comparison to the higher resolution platforms and we independently validate 10 CNVs. The method requires no training data and is robust with respect to changes in algorithm parameters. At a FPR of <10%, the algorithm can detect CNVs with five probes on the 385K platform and three on the 2.1M and 3.1M platforms, resulting in effective resolutions of 24 kb, 2–5 kb and 1 kb, respectively.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Yoon, Z. Xuan, V. Makarov, K. Ye, and J. Sebat Sensitive and accurate detection of copy number variants using read depth of coverage Genome Res., September 1, 2009; 19(9): 1586 - 1592. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Y. Wu, H. A. Chipman, S. B. Bull, L. Briollais, and K. Wang A Bayesian segmentation approach to ascertain copy number variations at the population level Bioinformatics, July 1, 2009; 25(13): 1669 - 1679. [Abstract] [Full Text] [PDF] |
||||

