Nucleic Acids Research Advance Access originally published online on November 27, 2006
Nucleic Acids Research 2007 35(Database issue):D61-D65; doi:10.1093/nar/gkl842
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Nucleic Acids Research, 2007, Vol. 35, Database issue D61-D65
Published by Oxford University Press 2006
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Articles |
NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins
National Center for Biotechnology Information, National Library of Medicine National Institutes of Health Rm 6An.12J, 45 Center Drive, Bethesda, MD 20892-6510, USA
*To whom correspondence should be addressed. Tel: +1 301 435 5898; Fax: +1 301 480 2918; Email: pruitt{at}ncbi.nlm.nih.gov
Received September 20, 2006. Revised October 6, 2006. Accepted October 6, 2006.
NCBI's reference sequence (RefSeq) database (http://www.ncbi.nlm.nih.gov/RefSeq/) is a curated non-redundant collection of sequences representing genomes, transcripts and proteins. The database includes 3774 organisms spanning prokaryotes, eukaryotes and viruses, and has records for 2 879 860 proteins (RefSeq release 19). RefSeq records integrate information from multiple sources, when additional data are available from those sources and therefore represent a current description of the sequence and its features. Annotations include coding regions, conserved domains, tRNAs, sequence tagged sites (STS), variation, references, gene and protein product names, and database cross-references. Sequence is reviewed and features are added using a combined approach of collaboration and other input from the scientific community, prediction, propagation from GenBank and curation by NCBI staff. The format of all RefSeq records is validated, and an increasing number of tests are being applied to evaluate the quality of sequence and annotation, especially in the context of complete genomic sequence.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
N. Elango and S. V. Yi DNA Methylation and Structural and Functional Bimodality of Vertebrate Promoters Mol. Biol. Evol., August 1, 2008; 25(8): 1602 - 1608. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. B. Conley, J. Piriyapongsa, and I. K. Jordan Retroviral promoters in the human genome Bioinformatics, July 15, 2008; 24(14): 1563 - 1567. [Abstract] [PDF] |
||||
![]() |
R. E. Gerszten, F. Accurso, G. R. Bernard, R. M. Caprioli, E. W. Klee, G. G. Klee, I. Kullo, T. A. Laguna, F. P. Roth, M. Sabatine, et al. Challenges in translating plasma proteomics from bench to bedside: update from the NHLBI Clinical Proteomics Programs Am J Physiol Lung Cell Mol Physiol, July 1, 2008; 295(1): L16 - L22. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Lee and D. Lee DAhunter: a web-based server that identifies homologous proteins by comparing domain architecture Nucleic Acids Res., July 1, 2008; 36(suppl_2): W60 - W64. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Lemoine, B. Labedan, and C. Froidevaux GenoQuery: a new querying module for functional annotation in a genomic warehouse Bioinformatics, July 1, 2008; 24(13): i322 - i329. [Abstract] [PDF] |
||||
![]() |
M. Feldhahn, P. Thiel, M. M. Schuler, N. Hillen, S. Stevanovic, H.-G. Rammensee, and O. Kohlbacher EpiToolKit--a web server for computational immunomics Nucleic Acids Res., July 1, 2008; 36(suppl_2): W519 - W522. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. V. Antonov, T. Schmidt, Y. Wang, and H. W. Mewes ProfCom: a web tool for profiling the complex functionality of gene groups identified from high-throughput data Nucleic Acids Res., July 1, 2008; 36(suppl_2): W347 - W351. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. E. Martinez-Guerrero, R. Ciria, C. Abreu-Goodger, G. Moreno-Hagelsieb, and E. Merino GeConT 2: gene context analysis for orthologous proteins, conserved domains and metabolic pathways Nucleic Acids Res., July 1, 2008; 36(suppl_2): W176 - W180. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Romero-Zaliz, C. del Val, J. P. Cobb, and I. Zwir Onto-CC: a web server for identifying Gene Ontology conceptual clusters Nucleic Acids Res., July 1, 2008; 36(suppl_2): W352 - W357. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-R. Ho, W.-J. Jang, C.-h. Chen, L.-Y. Ch'ang, and W.-c. Lin Designating eukaryotic orthology via processed transcription units Nucleic Acids Res., June 1, 2008; 36(10): 3436 - 3442. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Okamura and K. Nakai Retrotransposition as a Source of New Promoters Mol. Biol. Evol., June 1, 2008; 25(6): 1231 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Castrignano, M. D'Antonio, A. Anselmo, D. Carrabino, A. D'Onorio De Meo, A. M. D'Erchia, F. Licciulli, M. Mangiulli, F. Mignone, G. Pavesi, et al. ASPicDB: A database resource for alternative splicing analysis Bioinformatics, May 15, 2008; 24(10): 1300 - 1304. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. M. Meyer, A. Roth, S. M. Chervin, G. A. Garcia, and R. R. Breaker Confirmation of a second natural preQ1 aptamer class in Streptococcaceae bacteria RNA, April 1, 2008; 14(4): 685 - 695. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Fullwood, J. J. S. Tan, P. W. P. Ng, K. P. Chiu, J. Liu, C. L. Wei, and Y. Ruan The use of multiple displacement amplification to amplify complex DNA libraries Nucleic Acids Res., March 1, 2008; 36(5): e32 - e32. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Stanke, M. Diekhans, R. Baertsch, and D. Haussler Using native and syntenically mapped cDNA alignments to improve de novo gene finding Bioinformatics, March 1, 2008; 24(5): 637 - 644. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stephen, M. Pheasant, I. V. Makunin, and J. S. Mattick Large-Scale Appearance of Ultraconserved Elements in Tetrapod Genomes and Slowdown of the Molecular Clock Mol. Biol. Evol., February 1, 2008; 25(2): 402 - 408. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Levy, N. Sela, and G. Ast TranspoGene and microTranspoGene: transposed elements influence on the transcriptome of seven vertebrates and invertebrates Nucleic Acids Res., January 11, 2008; 36(suppl_1): D47 - D52. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Huang, S. K. P. Lau, P. C. Y. Woo, and K.-y. Yuen CoVDB: a comprehensive database for comparative analysis of coronavirus genes and genomes Nucleic Acids Res., January 11, 2008; 36(suppl_1): D504 - D511. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kono, T. Yuasa, S. Nishiue, and K. Yura coliSNP database server mapping nsSNPs on protein structures Nucleic Acids Res., January 11, 2008; 36(suppl_1): D409 - D413. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, E. Szeto, K. Palaniappan, Y. Grechkin, K. Chu, I-M. A. Chen, I. Dubchak, I. Anderson, A. Lykidis, K. Mavromatis, et al. The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensions Nucleic Acids Res., January 11, 2008; 36(suppl_1): D528 - D533. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Liolios, K. Mavromatis, N. Tavernarakis, and N. C. Kyrpides The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadata Nucleic Acids Res., January 11, 2008; 36(suppl_1): D475 - D479. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. A. Bruford, M. J. Lush, M. W. Wright, T. P. Sneddon, S. Povey, and E. Birney The HGNC Database in 2008: a resource for the human genome Nucleic Acids Res., January 11, 2008; 36(suppl_1): D445 - D448. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Yang, L. Chen, L. Sun, J. Yu, and Q. Jin VFDB 2008 release: an enhanced web-based resource for comparative pathogenomics Nucleic Acids Res., January 11, 2008; 36(suppl_1): D539 - D542. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Flicek, B. L. Aken, K. Beal, B. Ballester, M. Caccamo, Y. Chen, L. Clarke, G. Coates, F. Cunningham, T. Cutts, et al. Ensembl 2008 Nucleic Acids Res., January 11, 2008; 36(suppl_1): D707 - D714. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Yeats, J. Lees, A. Reid, P. Kellam, N. Martin, X. Liu, and C. Orengo Gene3D: comprehensive structural and functional annotation of genomes Nucleic Acids Res., January 11, 2008; 36(suppl_1): D414 - D418. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Lechat, L. Hummel, S. Rousseau, and I. Moszer GenoList: an integrated environment for comparative analysis of microbial genomes Nucleic Acids Res., January 11, 2008; 36(suppl_1): D469 - D474. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kuhn, C. von Mering, M. Campillos, L. J. Jensen, and P. Bork STITCH: interaction networks of chemicals and proteins Nucleic Acids Res., January 11, 2008; 36(suppl_1): D684 - D688. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Karolchik, R. M. Kuhn, R. Baertsch, G. P. Barber, H. Clawson, M. Diekhans, B. Giardine, R. A. Harte, A. S. Hinrichs, F. Hsu, et al. The UCSC Genome Browser Database: 2008 update Nucleic Acids Res., January 11, 2008; 36(suppl_1): D773 - D779. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Castellano, V. N. Gladyshev, R. Guigo, and M. J. Berry SelenoDB 1.0 : a database of selenoprotein genes, proteins and SECIS elements Nucleic Acids Res., January 11, 2008; 36(suppl_1): D332 - D338. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Rannikko, C. Ortutay, and M. Vihinen Immunity genes and their orthologs: a multi-species database Int. Immunol., December 1, 2007; 19(12): 1361 - 1370. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Spudich, X. M. Fernandez-Suarez, and E. Birney Genome browsing with Ensembl: a practical overview Brief Funct Genomic Proteomic, October 29, 2007; (2007) elm025v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. E. Vinogradov and O. V. Anatskaya Organismal complexity, cell differentiation and gene expression: human over mouse Nucleic Acids Res., October 8, 2007; 35(19): 6350 - 6356. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pagni, V. Ioannidis, L. Cerutti, M. Zahn-Zabal, C. V. Jongeneel, J. Hau, O. Martin, D. Kuznetsov, and L. Falquet MyHits: improvements to an interactive resource for analyzing protein sequences Nucleic Acids Res., July 13, 2007; 35(suppl_2): W433 - W437. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Csuros, J. A. Holey, and I. B. Rogozin In search of lost introns Bioinformatics, July 1, 2007; 23(13): i87 - i96. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. E. Suzek, H. Huang, P. McGarvey, R. Mazumder, and C. H. Wu UniRef: comprehensive and non-redundant UniProt reference clusters Bioinformatics, May 15, 2007; 23(10): 1282 - 1288. [Abstract] [Full Text] [PDF] |
||||






