Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (321K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (105)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2003, Vol. 31, No. 1 34-37
© 2003 Oxford University Press

NCBI Reference Sequence Project: update and current status

Kim D. Pruitt*, Tatiana Tatusova and Donna R. Maglott

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A Room 6N605, 8600 Rockville Pike, Bethesda, MD 20894, USA

*To whom correspondence should be addressed. Email: pruitt{at}ncbi.nlm.nih.gov

Received September 14, 2002; Accepted September 24, 2002

ABSTRACT

The goal of the NCBI Reference Sequence (RefSeq) project is to provide the single best non-redundant and comprehensive collection of naturally occurring biological molecules, representing the central dogma. Nucleotide and protein sequences are explicitly linked on a residue-by-residue basis in this collection. Ideally all molecule types will be available for each well-studied organism, but the initial database collection pragmatically includes only those molecules and organisms that are most readily identified. Thus different amounts of information are available for different organisms at any given time. Furthermore, for some organisms additional intermediate records are provided when the genome sequence is not yet finished. The collection is supplied by NCBI through three distinct pipelines in addition to collaborations with community groups. The collection is curated on an ongoing basis. Additional information about the NCBI RefSeq project is available at http://www.ncbi.nih.gov/RefSeq/.

BACKGROUND

RefSeq is unique in providing a large multi-species curated sequence database that explicitly links chromosomal, transcript and protein information; this establishes a critical baseline for integrating sequence, genetic, expression and functional information into a single consistent framework. All RefSeq records include attribution to the original sequence data. When a molecule is represented by multiple GenBank sequences, an effort is made to select the ‘best’ sequence to instantiate as a RefSeq. The goal is to avoid mutations, sequencing errors and cloning artifacts. Should a RefSeq be identified with a problem of this type, it is corrected. Sequences are computationally validated to confirm that the genomic sequence corresponding to an annotated mRNA feature does match the mRNA sequence record and that coding region features really can be translated into the corresponding protein sequence.

RefSeq offers a significant advantage for gene characterization, database searching and sequence identification, whether by BLAST, text or accession queries, or inclusion in a local customized database. The RefSeq collection may include alternatively spliced transcripts that share some identical exons, or identical proteins expressed from these alternatively spliced transcripts, or close paralogs or homologs. It has the advantage of a representing an objective and experimentally verifiable definition of ‘non-redundant’ in providing one example of each natural biomolecule per organism.

Thus the RefSeq collection establishes a consistent baseline and clear model of the central dogma. The RefSeq collection supports:

Facile identification of the sequence standard for a genome, transcript, or protein
Genome annotation
Comparative genomics
Reduction of redundancy in clustering approaches
Analysis and comparison
Unambiguous association of functional information
Navigation to additional sources of information

Distinct processing pipelines are used to provide the RefSeq collection:

Computed annotation pipeline
Entrez genomes pipeline
LocusLink supported pipeline
Collaboration

These pipelines are overlapping and the RefSeq dataset for some genomes may be provided by more than one pipeline.

Computed Annotation Pipeline
The Computed Annotation Pipeline relies on automated computation to provide scaffolds, transcripts and proteins. Transcript and protein records may represent an ab initio prediction with varying levels of transcript or protein homology support, or they may be fully supported by GenBank transcript data. These records are not subject to incremental individual updates or to direct curation. They are sometimes more predicted in nature and represent an approach that complements manual curation in providing comprehensive genome annotation.

Entrez Genomes Pipeline
NCBI's Entrez Genomes database represents a collection of complete, or nearly complete, genomes and chromosomes (2). It is divided into six large taxonomic groups: Archaea, Eubacteria, Eukaryotae, Viroids, Viruses and Plasmids. Entrez Genomes RefSeq records include genomic, transcript and protein records and are provided by collaboration, in-house automatic processing and in-house curation. The Entrez Genomes web site includes custom displays, analysis and tools for some genomes.

In general, these RefSeq records undergo an initial automated validation step before being released. The resulting record is a copy of a GenBank entry, but validation may make some corrections and provides more consistent feature annotation. Records provided via collaboration have a status of ‘Reviewed’ and attribution to the collaborating group. Records provided by in-house processing have a ‘Provisional’ or ‘Reviewed’ status. This pipeline has provided RefSeq records for over 1780 distinct organisms and makes significant contribution to the NCBI RefSeq collection (Fig. 1).



View larger version (16K):
[in this window]
[in a new window]
 
Figure 1. Number of RefSeq proteins contributed by the three source pipelines (as of August, 2002).

 
LocusLink pipeline—new features and current statistics
The Entrez Genomes and LocusLink supported pipelines are similar in that they both make use of automatic computation to validate records and are also subject to ongoing in-house curation of individual records. Both of these pipelines welcome opportunities to collaborate. These two pipelines differ in infrastructure, targeted scope and processing details. The curated RefSeq set is used as one of several reagents in the annotation pipeline; the human RefSeq collection is targeted for focused in-house curation.

Molecule types
The collection has been expanded to include non-coding transcripts such as those derived from structural RNA genes and transcribed pseudogenes. See Table 1 for a complete listing of the available molecule types, corresponding accession prefix and pipeline source.


View this table:
[in this window]
[in a new window]
 
Table 1. RefSeq accession prefixes, molecule types, originating pipeline and annotated status categories
 
Status categories
RefSeq records include annotation that provides a general indication of their curation status and reliability. Records generated via the automated annotation pipeline are annotated as ‘Model RefSeq’ whereas those contributed by the curation pipelines may be annotated as provisional (not yet reviewed), predicted (transcript or protein is not fully supported), or reviewed. Reviewed records are the most highly curated and effort has been made to ensure the quality and comprehensive coverage of the sequence itself as well as to apply descriptive information that leads the user to functionally relevant data (publications, names, summaries). RefSeq records annotated as predicted do have some level of support; they do not merely instantiate ab initio predictions (see Table 1).

Organisms
The LocusLink pipeline now provides RefSeq records for Danio rerio (zebrafish) and Drosophila melanogaster in addition to records for human, mouse and rat. Gene-to-sequence associations for human, mouse and rat are curated both in-house and in collaboration with the Human Gene Nomenclature Committee (3), OMIM (4), the Mouse Genome Informatics group (5), and the Rat Genome Database (6). Records for Drosophila are provided in collaboration with FlyBase (7). Collaboration with the zebrafish community ZFIN database (via LocusLink) supports RefSeq annotation (8).

Growth
The LocusLink pipeline provided over 14 000 new RefSeq records between July 2001 and July 2002 (Fig. 2). Much of this growth was realized by the addition of zebrafish and Drosophila records (692 and 7243 respectively). The annual growth trend by organism and by new accessions added is presented in Figure 2A and B; the annual growth trend of the manually curated records (with annotated status of Reviewed) is presented in Figure 2C.






View larger version (75K):
[in this window]
[in a new window]
 
Figure 2. Annual growth trends for the LocusLink supported RefSeq collection. (A) The number of RefSeq records (NM accessions) publicly available on July 1 of 1999 through 2002 for human (Hs), mouse (Mm), rat (Rn), zebrafish (Dr) and fly (Dm). These numbers include genes for which multiple reference sequences are provided to represent splice variants and their products. As of July 2002, 37919 RefSeqs were available; this includes 16142 human, 10004 mouse, 3838 rat, 692 zebrafish and 7243 Drosophila RefSeq records. (B) New growth. The number of new accessions (NM) added to the collection per 12 month window for the sum of human, mouse, rat, zebrafish and fly. (C) Distribution of human RefSeq records in different status categories over the same time window.

 
Manual curation improves the RefSeq collection in several ways including:

Extension of the transcript
Correction of sequence artifacts (vector, linker sequence)
Provision of additional records representing alternative splice products
Functional (descriptive) annotation of the nucleotide and protein record
Representing the correct gene name-to-sequence association

Alternate transcript records are provided for over 1368 genes, of these the vast majority (930) instantiate a single additional transcript variant (see Fig. 3). Two loci have instantiated a larger number of alternate splice forms; 18 variants are provided for the DMD gene, in collaboration with Dr den Dunnen and 19 variants are provided for the collagen gene COL13A1.



View larger version (13K):
[in this window]
[in a new window]
 
Figure 3. Number of splice variant records provided per locus. The manual curation effort instantiates additional splice variants as a new accession number when the CDS is full-length and when there is some level of support either in available sequence data or in the literature for the existence of the new splice form. Because of these conservative criteria, the RefSeq collection under-represents the number of alternate splice forms that may occur. Additional records are provided primarily through the manual curation process, however, a smaller number are made available prior to the full review process following preliminary sequence review.

 

REFERENCES

  1. Pruitt,K.D., Katz,K.S., Sicotte,H. and Maglott,D.R. (2000) Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. Trends Genet., 16, 44–47.[CrossRef][Web of Science][Medline]

  2. Tatusova,T.A., Karsch-Mizrachi,I. and Ostell,J.A. (1999) Complete genomes in WWW Entrez: data representation and analysis. Bioinformatics, 15, 536–543.[Abstract/Free Full Text]

  3. White,J.A., McAlpine,P.J., Antonarakis,S., Cann,H., Eppig,J.T., Frazer,K., Frezal,J., Lancet,D., Nahmias,J., Pearson,P., Peters,J., Scott,A., Scott,H., Spurr,N., Talbot,C. Jr and Povey,S. (1997) Guidelines for human gene nomenclature (1997). HUGO Nomenclature Committee. Genomics, 45, 468–471.[CrossRef][Web of Science][Medline]

  4. Hamosh,A., Scott,A.F., Amberger,J., Valle.D. and McKusick,V.A. (2000) Online Mendelian Inheritance in Man (OMIM). Hum. Mutat., 15, 57–61.[CrossRef][Web of Science][Medline]

  5. Blake,J.A., Eppig,J.T., Richardson,J.E. and Davisson,M.T. (2000) The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse. The Mouse Genome Database Group. Nucleic Acids Res., 28, 108–111.[Abstract/Free Full Text]

  6. Twigger,S., Lu,J., Shimoyama,M., Chen,D., Pasko,D., Long,H., Ginster.J., Chen,C.F., Nigam,R., Kwitek,A., Eppig,J., Maltais,L., Maglott,D., Schuler,G., Jacob,H. and Tonellato,P.J. (2002) Rat Genome Database (RGD): mapping disease onto the genome. Nucleic Acids Res., 30, 125–128.[Abstract/Free Full Text]

  7. FlyBase Consortium (1999) The FlyBase database of the Drosophila Genome Projects and community literature. The FlyBase Consortium. Nucleic Acids Res., 27, 85–88.[Abstract/Free Full Text]

  8. Westerfield,M., Doerry,E., Kirkpatrick,A.E. and Douglas,S.A. (1999) Zebrafish informatics and the ZFIN database. Methods Cell Biol., 60, 339–355.[Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Neuro OncologyHome page
H. Colman, L. Zhang, E. P. Sulman, J. M. McDonald, N. L. Shooshtari, A. Rivera, S. Popoff, C. L. Nutt, D. N. Louis, J. G. Cairncross, et al.
A multigene predictor of outcome in glioblastoma
Neuro Oncology, October 20, 2009; (2009) nop007v1.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. M. Becker, R. L. Graham, E. F. Donaldson, B. Rockx, A. C. Sims, T. Sheahan, R. J. Pickles, D. Corti, R. E. Johnston, R. S. Baric, et al.
Synthetic recombinant bat SARS-like coronavirus is infectious in cultured cells and in mice
PNAS, December 16, 2008; 105(50): 19944 - 19949.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Zhang, Y. Ji, and L. Zhang
Extracting three-way gene interactions from microarray data
Bioinformatics, November 1, 2007; 23(21): 2903 - 2909.
[Abstract] [Full Text] [PDF]


Home page
DNA ResHome page
D. Gherbassi, L. Bhatt, S. Thuret, and H. H. Simon
Merging Mouse Transcriptome Analyses with Parkinson's Disease Linkage Studies
DNA Res, May 23, 2007; (2007) dsm007v1.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
B. J. Frey and D. Dueck
Clustering by Passing Messages Between Data Points
Science, February 16, 2007; 315(5814): 972 - 976.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. G. Jegga, J. Chen, S. Gowrisankar, M. A. Deshmukh, R. Gudivada, S. Kong, V. Kaimal, and B. J. Aronow
GenomeTrafac: a whole genome resource for the detection of transcription factor binding site clusters associated with conventional and microRNA encoding genes conserved between mouse and human gene orthologs
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D116 - D121.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
M. Semon and L. Duret
Evolutionary Origin and Maintenance of Coexpressed Gene Clusters in Mammals
Mol. Biol. Evol., September 1, 2006; 23(9): 1715 - 1723.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. S. Siddiqui, A. D. Delaney, A. Schnerch, O. L. Griffith, S. J. M. Jones, and M. A. Marra
Sequence biases in large scale gene expression profiling data
Nucleic Acids Res., July 13, 2006; 34(12): e83 - e83.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
S. Yamazaki, J. Yamazaki, K. Nishijima, R. Otsuka, M. Mise, H. Ishikawa, K. Sasaki, S.-i. Tago, and K. Isono
Proteome Analysis of an Aerobic Hyperthermophilic Crenarchaeon, Aeropyrum pernix K1
Mol. Cell. Proteomics, May 1, 2006; 5(5): 811 - 823.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
L. Florea
Bioinformatics of alternative splicing and its regulation
Brief Bioinform, March 1, 2006; 7(1): 55 - 69.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Cui, N. Veeraraghavan, A. Richter, K. Wall, R. K. Jansen, J. Leebens-Mack, I. Makalowska, and C. W. dePamphilis
ChloroplastDB: the Chloroplast Genome Database
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D692 - D696.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. S. Siddiqui, J. Khattra, A. D. Delaney, Y. Zhao, C. Astell, J. Asano, R. Babakaiff, S. Barber, J. Beland, S. Bohacec, et al.
A mouse atlas of gene expression: Large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells
PNAS, December 20, 2005; 102(51): 18485 - 18490.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Maiorino, A. Roveri, L. Benazzi, V. Bosello, P. Mauri, S. Toppo, S. C. E. Tosatto, and F. Ursini
Functional Interaction of Phospholipid Hydroperoxide Glutathione Peroxidase with Sperm Mitochondrion-associated Cysteine-rich Protein Discloses the Adjacent Cysteine Motif as a New Substrate of the Selenoperoxidase
J. Biol. Chem., November 18, 2005; 280(46): 38395 - 38402.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
N. Osada, M. Hirata, R. Tanuma, J. Kusuda, M. Hida, Y. Suzuki, S. Sugano, T. Gojobori, C.-K. J. Shen, C.-I Wu, et al.
Substitution Rate and Structural Divergence of 5'UTR Evolution: Comparative Analysis Between Human and Cynomolgus Monkey cDNAs
Mol. Biol. Evol., October 1, 2005; 22(10): 1976 - 1982.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. Siepel, G. Bejerano, J. S. Pedersen, A. S. Hinrichs, M. Hou, K. Rosenbloom, H. Clawson, J. Spieth, L. W. Hillier, S. Richards, et al.
Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes
Genome Res., August 1, 2005; 15(8): 1034 - 1050.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. H. Kim, L. O. Barrera, C. Qu, S. Van Calcar, N. D. Trinklein, S. J. Cooper, R. M. Luna, C. K. Glass, M. G. Rosenfeld, R. M. Myers, et al.
Direct isolation and identification of promoters in the human genome
Genome Res., June 1, 2005; 15(6): 830 - 839.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
T.-H. Kim, Y.-J. Jeon, W.-Y. Kim, and H.-S. Kim
HESAS: HERVs Expression and Structure Analysis System
Bioinformatics, April 15, 2005; 21(8): 1699 - 1700.
[Abstract] [Full Text] [PDF]


Home page
J. Gen. Virol.Home page
F. Ferron, C. Rancurel, S. Longhi, C. Cambillau, B. Henrissat, and B. Canard
VaZyMolO: a tool to define and classify modularity in viral proteins
J. Gen. Virol., March 1, 2005; 86(3): 743 - 749.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Harbig, R. Sprinkle, and S. A. Enkemann
A sequence-based identification of the genes detected by probesets on the Affymetrix U133 plus 2.0 array
Nucleic Acids Res., February 18, 2005; 33(3): e31 - e31.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
C. B. Do, M. S.P. Mahabhashyam, M. Brudno, and S. Batzoglou
ProbCons: Probabilistic consistency-based multiple sequence alignment
Genome Res., February 1, 2005; 15(2): 330 - 340.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
M. Semon, D. Mouchiroud, and L. Duret
Relationship between gene expression and GC-content in mammals: statistical significance and biological relevance
Hum. Mol. Genet., February 1, 2005; 14(3): 421 - 427.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. E. Vinogradov
Noncoding DNA, isochores and gene expression: nucleosome formation potential
Nucleic Acids Res., January 26, 2005; 33(2): 559 - 563.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
C. ZWIEB, R. W. VAN NUES, M. A. ROSENBLAD, J. D. BROWN, and T. SAMUELSSON
A nomenclature for all signal recognition particle RNAs
RNA, January 1, 2005; 11(1): 7 - 13.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
T. P. V. Manh, M. Mokrane, E. Georgenthum, J. Flavigny, L. Carrier, M. Semeriva, M. Piovant, and L. Roder
Expression of cardiac myosin-binding protein-C (cMyBP-C) in Drosophila as a model for the study of human cardiomyopathies
Hum. Mol. Genet., January 1, 2005; 14(1): 7 - 17.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. Barta, E. Sebestyen, T. B. Palfy, G. Toth, C. P. Ortutay, and L. Patthy
DoOP: Databases of Orthologous Promoters, collections of clusters of orthologous upstream sequences from chordates and plants
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D86 - D90.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. F. Abril, R. Castelo, and R. Guigo
Comparison of splice sites in mammals and chicken
Genome Res., January 1, 2005; 15(1): 111 - 119.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. M. Chalk, R. E. Warfinge, P. Georgii-Hemming, and E. L. L. Sonnhammer
siRNAdb: a database of siRNA sequences
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D131 - D134.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
F. Mignone, G. Grillo, F. Licciulli, M. Iacono, S. Liuni, P. J. Kersey, J. Duarte, C. Saccone, and G. Pesole
UTRdb and UTRsite: a collection of sequences and regulatory motifs of the untranslated regions of eukaryotic mRNAs
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D141 - D146.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Kersey, L. Bower, L. Morris, A. Horne, R. Petryszak, C. Kanz, A. Kanapin, U. Das, K. Michoud, I. Phan, et al.
Integr8 and Genome Reviews: integrated views of complete genomes and proteomes
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D297 - D302.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Hubbard, D. Andrews, M. Caccamo, G. Cameron, Y. Chen, M. Clamp, L. Clarke, G. Coates, T. Cox, F. Cunningham, et al.
Ensembl 2005
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D447 - D453.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
The ENCODE Project Consortium
The ENCODE (ENCyclopedia Of DNA Elements) Project
Science, October 22, 2004; 306(5696): 636 - 640.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. S. Furey, M. Diekhans, Y. Lu, T. A. Graves, L. Oddy, J. Randall-Maher, L. W. Hillier, R. K. Wilson, and D. Haussler
Analysis of Human mRNAs With the Reference Genome Sequence Reveals Potential Errors, Polymorphisms, and RNA Editing
Genome Res., October 1, 2004; 14(10b): 2034 - 2040.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
The MGC Project Team
The Status, Quality, and Expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC)
Genome Res., October 1, 2004; 14(10b): 2121 - 2127.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
P. N. Robinson, U. Bohme, R. Lopez, S. Mundlos, and P. Nurnberg
Gene-Ontology analysis reveals association of tissue-specific 5' CpG-island genes with development and embryogenesis
Hum. Mol. Genet., September 1, 2004; 13(17): 1969 - 1978.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
B. H. Mecham, D. Z. Wetmore, Z. Szallasi, Y. Sadovsky, I. Kohane, and T. J. Mariani
Increased measurement accuracy for sequence-verified microarray probes
Physiol Genomics, August 11, 2004; 18(3): 308 - 315.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
Y. Bao, S. Federhen, D. Leipe, V. Pham, S. Resenchuk, M. Rozanov, R. Tatusov, and T. Tatusova
National Center for Biotechnology Information Viral Genomes Project
J. Virol., July 15, 2004; 78(14): 7291 - 7298.
[Full Text] [PDF]


Home page
Nucleic Acids ResHome page
N. Kim, S. Shin, and S. Lee
ASmodeler: gene modeling of alternative splicing from genomic alignment of mRNA, EST and protein sequences
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W181 - W186.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
Y. Han, K. Lassen, D. Monie, A. R. Sedaghat, S. Shimoji, X. Liu, T. C. Pierson, J. B. Margolick, R. F. Siliciano, and J. D. Siliciano
Resting CD4+ T Cells from Human Immunodeficiency Virus Type 1 (HIV-1)-Infected Individuals Carry Integrated HIV-1 Genomes within Actively Transcribed Host Genes
J. Virol., June 15, 2004; 78(12): 6122 - 6133.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
M. Kiriakidou, P. T. Nelson, A. Kouranov, P. Fitziev, C. Bouyioukos, Z. Mourelatos, and A. Hatzigeorgiou
A combined computational-experimental approach predicts human microRNA targets
Genes & Dev., May 15, 2004; 18(10): 1165 - 1178.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. C. Edgar
MUSCLE: multiple sequence alignment with high accuracy and high throughput
Nucleic Acids Res., March 19, 2004; 32(5): 1792 - 1797.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. W. Klee, D. F. Carlson, S. C. Fahrenkrug, S. C. Ekker, and L. B. M. Ellis
Identifying secretomes in people, pufferfish and pigs
Nucleic Acids Res., February 27, 2004; 32(4): 1414 - 1421.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. L. Wheeler, D. M. Church, R. Edgar, S. Federhen, W. Helmberg, T. L. Madden, J. U. Pontius, G. D. Schuler, L. M. Schriml, E. Sequeira, et al.
Database resources of the National Center for Biotechnology Information: update
Nucleic Acids Res., January 1, 2004; 32(90001): D35 - 40.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. D. Schmid, V. Praz, M. Delorenzi, R. Perier, and P. Bucher
The Eukaryotic Promoter Database EPD: the impact of in silico primer extension
Nucleic Acids Res., January 1, 2004; 32(90001): D82 - 85.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Divina and J. Forejt
The Mouse SAGE Site: database of public mouse SAGE libraries
Nucleic Acids Res., January 1, 2004; 32(90001): D482 - 483.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Akagi, T. Suzuki, R. M. Stephens, N. A. Jenkins, and N. G. Copeland
RTCGD: retroviral tagged cancer gene database
Nucleic Acids Res., January 1, 2004; 32(90001): D523 - 527.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
T. Ruggiero, M. Olivero, A. Follenzi, L. Naldini, R. Calogero, and M. F. Di Renzo
Deletion in a (T)8 microsatellite abrogates expression regulation by 3'-UTR
Nucleic Acids Res., November 15, 2003; 31(22): 6561 - 6569.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. K. Chanda, S. White, A. P. Orth, R. Reisdorph, L. Miraglia, R. S. Thomas, P. DeJesus, D. E. Mason, Q. Huang, R. Vega, et al.
From the Cover: Genome-scale functional profiling of the mammalian AP-1 signaling pathway
PNAS, October 14, 2003; 100(21): 12153 - 12158.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
W. J. Kent, R. Baertsch, A. Hinrichs, W. Miller, and D. Haussler
Evolution's cauldron: Duplication, deletion, and rearrangement in the mouse and human genomes
PNAS, September 30, 2003; 100(20): 11484 - 11489.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. S. Halees, D. Leyfer, and Z. Weng
PromoSer: a large-scale mammalian promoter and transcription start site identification service
Nucleic Acids Res., July 1, 2003; 31(13): 3554 - 3559.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. L. Wheeler, D. M. Church, S. Federhen, A. E. Lash, T. L. Madden, J. U. Pontius, G. D. Schuler, L. M. Schriml, E. Sequeira, T. A. Tatusova, et al.
Database resources of the National Center for Biotechnology
Nucleic Acids Res., January 1, 2003; 31(1): 28 - 33.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (321K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (105)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?