Nucleic Acids Research Advance Access originally published online on October 31, 2008
Nucleic Acids Research 2009 37(Database issue):D19-D25; doi:10.1093/nar/gkn765
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2009, Vol. 37, Database issue D19-D25
© 2008 The Author(s)
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
This article appears in the following Nucleic Acids Research issue: Database issue [View the issue table of contents]
Articles |
Petabyte-scale innovations at the European Nucleotide Archive
1EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK and 2Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
*To whom correspondence should be addressed. Tel: +44 (0) 1223 4925634; Fax: +44 (0) 1223 494 468; Email: cochrane{at}ebi.ac.uk
Received September 30, 2008. Revised October 3, 2008. Accepted October 6, 2008.
Dramatic increases in the throughput of nucleotide sequencing machines, and the promise of ever greater performance, have thrust bioinformatics into the era of petabyte-scale data sets. Sequence repositories, which provide the feed for these data sets into the worldwide computational infrastructure, are challenged by the impact of these data volumes. The European Nucleotide Archive (ENA; http://www.ebi.ac.uk/embl), comprising the EMBL Nucleotide Sequence Database and the Ensembl Trace Archive, has identified challenges in the storage, movement, analysis, interpretation and visualization of petabyte-scale data sets. We present here our new repository for next generation sequence data, a brief summary of contents of the ENA and provide details of major developments to submission pipelines, high-throughput rule-based validation infrastructure and data integration approaches.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
F. Valentin, S. Squizzato, M. Goujon, H. McWilliam, J. Paern, and R. Lopez Fast and efficient searching of biological data resources--using EB-eye Brief Bioinform, February 11, 2010; (2010) bbp065v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
The UniProt Consortium The Universal Protein Resource (UniProt) in 2010 Nucleic Acids Res., January 1, 2010; 38(suppl_1): D142 - D148. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Aranda, P. Achuthan, Y. Alam-Faruque, I. Armean, A. Bridge, C. Derow, M. Feuermann, A. T. Ghanbarian, S. Kerrien, J. Khadake, et al. The IntAct molecular interaction database in 2010 Nucleic Acids Res., January 1, 2010; 38(suppl_1): D525 - D531. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Robinson, K. Mistry, H. McWilliam, R. Lopez, and S. G. E. Marsh IPD--the Immuno Polymorphism Database Nucleic Acids Res., January 1, 2010; 38(suppl_1): D863 - D869. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. Kersey, D. Lawson, E. Birney, P. S. Derwent, M. Haimel, J. Herrero, S. Keenan, A. Kerhornou, G. Koscielny, A. Kahari, et al. Ensembl Genomes: Extending Ensembl across the taxonomic space Nucleic Acids Res., January 1, 2010; 38(suppl_1): D563 - D569. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Klucar, M. Stano, and M. Hajduk phiSITE: database of gene regulation in bacteriophages Nucleic Acids Res., January 1, 2010; 38(suppl_1): D366 - D370. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Shumway, G. Cochrane, and H. Sugawara Archiving next generation sequencing data Nucleic Acids Res., January 1, 2010; 38(suppl_1): D870 - D871. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Mcwilliam, F. Valentin, M. Goujon, W. Li, M. Narayanasamy, J. Martin, T. Miyar, and R. Lopez Web services at the European Bioinformatics Institute-2009 Nucleic Acids Res., July 1, 2009; 37(suppl_2): W6 - W10. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Imanishi and H. Nakaoka Hyperlink Management System and ID Converter System: enabling maintenance-free hyperlinks among major biological databases Nucleic Acids Res., July 1, 2009; 37(suppl_2): W17 - W22. [Abstract] [Full Text] [PDF] |
||||

