Skip Navigation

This Article
Right arrow Abstract Freely available
Right arrow Print PDF (107K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (498)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Nucleic Acids Research, 2001, Vol. 29, No. 1 137-140
© 2001 Oxford University Press

RefSeq and LocusLink: NCBI gene-centered resources

Kim D. Pruitt* and Donna R. Maglott

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A Room 6N605, 8600 Rockville Pike, Bethesda, MD 20894 USA

Received October 2, 2000; Accepted October 4, 2000.


    ABSTRACT
 TOP
 ABSTRACT
 BACKGROUND
 EXPANDED AND NEW FEATURES
 GROWTH AND MAINTENANCE
 REFERENCES
 
Thousands of genes have been painstakingly identified and characterized a few genes at a time. Many thousands more are being predicted by large scale cDNA and genomic sequencing projects, with levels of evidence ranging from supporting mRNA sequence and comparative genomics to computing ab initio models. This, coupled with the burgeoning scientific literature, makes it critical to have a comprehensive directory for genes and reference sequences for key genomes. The NCBI provides two resources, LocusLink and RefSeq, to meet these needs. LocusLink organizes information around genes to generate a central hub for accessing gene-specific information for fruit fly, human, mouse, rat and zebrafish. RefSeq provides reference sequence standards for genomes, transcripts and proteins; human, mouse and rat mRNA RefSeqs, and their corresponding proteins, are discussed here. Together, RefSeq and LocusLink provide a non-redundant view of genes and other loci to support research on genes and gene families, variation, gene expression and genome annotation. Additional information about LocusLink and RefSeq is available at http://www.ncbi.nlm.nih.gov/LocusLink/.


    BACKGROUND
 TOP
 ABSTRACT
 BACKGROUND
 EXPANDED AND NEW FEATURES
 GROWTH AND MAINTENANCE
 REFERENCES
 
LocusLink maintains descriptive information about loci including nomenclature, database identifiers (ID), disease associations, map positions and sequence accessions. These associations are then used to compute connections to other NCBI resources, to both facilitate navigation and enhance opportunities for new discoveries. LocusLink data are continuously augmented and reviewed through the combined efforts of NCBI staff and successful collaborations with the Human Gene Nomenclature Committee (HGNC) (1), Online Mendelian Inheritance in Man (OMIM) (2), Mouse Genome Database (MGD) (3), Rat Genome Database (RGD), FlyBase (4) and Zebrafish (ZFIN) (5) groups. This compilation is freely distributed via the LocusLink web site and by FTP. LocusLink queries can be based on text terms (such as a protein or disease name), gene symbols, sequence accessions and database IDs (e.g., MIM or EC numbers). More complex operations such as wild cards (e.g., ABC*), field restriction (e.g., 123456[mim]) and booleans (e.g., ABC* AND 17[chr]) are also supported.

The NCBI reference sequences (RefSeq) provide standards for complete genomic nucleic acids, assembled contigs, transcripts and proteins (see Table 1). RefSeq records are derived from GenBank and the literature to provide a non-redundant set of sequences that facilitate sequence identification and information retrieval (6). Human, mouse and rat RefSeq mRNAs, and their corresponding proteins, are the focus of this discussion; these sequences can be accessed through the NCBI Entrez retrieval system (7), BLAST (8), FTP and LocusLink.


View this table:
[in this window]
[in a new window]
 
Table 1. RefSeq accessions, sequence type, processing method and categories
 

    EXPANDED AND NEW FEATURES
 TOP
 ABSTRACT
 BACKGROUND
 EXPANDED AND NEW FEATURES
 GROWTH AND MAINTENANCE
 REFERENCES
 
LocusLink
The locus-to-sequence association maintained in LocusLink is used to provide reciprocal connections among LocusLink, RefSeq and several content-rich resources at NCBI including the Conserved Domain Database (CDD), GenBank (9), Genes and Disease, HomoloGene, Map Viewer, OMIM, PubMed, dbSNP (10), dbSTS and UniGene (11) (Table 2). In addition, nucleotide and protein sequences and OMIM records are linked to LocusLink reports via the NCBI LinkOut service. Following ‘LinkOut’ at the upper right side of an Entrez record is a convenient method to access a wealth of information about that gene. Links between LocusLink and Genes and Disease, HomoloGene, RefSeq, dbSNP and UniGene are based on shared identifiers stored by these resources (e.g., MIM numbers or GenBank accessions). Links to dbSTS and CDD are computed based on electronic PCR (ePCR) (12) and BLAST, respectively. And links to the NCBI Map Viewer are based on sequence identity, cytogenetic location or genetic map data. URLs to these resources are provided in Table 3.


View this table:
[in this window]
[in a new window]
 
Table 2. LocusLink: distributed data and links to additional information
 

View this table:
[in this window]
[in a new window]
 
Table 3. URLs for NCBI resources
 
Several organisms and features have been added to the LocusLink web site over the past year and are listed below. LocusLink provides links to each resource whenever additional information is available.

Conserved domains: CDD domains annotated on RefSeq proteins are summarized in the product section of LocusLink reports. This section also includes the protein name, alternate names and enzyme commission (EC) numbers.

GeneRIF: LocusLink now supports community-generated functional annotation. LocusLink reports display submitted GeneRIFs (Gene References Into Function) and include a link to the submission form. Submissions consist of a publication (PubMed ID) and brief comment summarizing the functional information supported by the publication.

HomoloGene: HomoloGene reports UniGene clusters that are either calculated to be homologous, based on sequence identity, or reported to be homologous. HomoloGene also reports the ‘best’ homologous sequence pairs. HomoloGene includes all genomes in UniGene (human, mouse, rat and zebrafish), as well as Drosophila melanogaster mRNAs.

Maps: LocusLink now provides links to the Map Viewer and the Human/Mouse Homology map, as appropriate. The Map Viewer presents graphical views of various maps at the whole genome and single chromosome levels. This resource is currently available for D.melanogaster, Homo sapiens and Mus musculus. Views include presentation of genes, STS markers, SNPs and disease phenotypes along the chromosomes. The Human/Mouse Homology maps display genes, thought to be orthologous, ordered by MGD’s genetic map and the human sequence map.

Organisms: LocusLink now includes fruit fly, human, mouse, rat and zebrafish genes. The query interface supports retrieving results for all species, restricting to a single organism and displaying query results as a ‘Brief’ or expanded ‘Summary’ display.

Reference sequences: The RefSeq section of the LocusLink report has been expanded to include mitochondrial records, genomic contigs and modeled transcripts and proteins produced by the NCBI Human Genome Annotation effort. RefSeq data for a locus may include curated RefSeq records representing known transcripts, a genomic contig and modeled transcripts that overlap, but are distinct from, the known transcripts. Table 1 indicates the different accession number formats and categories they represent.

RefSeq
RefSeq transcript and corresponding protein records are now produced through two independent processes. The first process, which relies heavily on manual curation, provides a non-redundant dataset of transcripts and proteins of known genes for human, mouse and rat; these ‘known’ genes have some supporting evidence for the existence of the gene although the protein function may not be clear. The second process is NCBI’s Human Genome Annotation project, which will be described in detail elsewhere. Briefly, working draft and finished human genome sequence is assembled into contigs (13); gene predictions methods are used to produce the modeled mRNA and protein RefSeq records (see Table 1). The modeled transcripts and proteins may have different degrees of mRNA or EST evidence, or may be predicted ab initio.

RefSeq records include links to LocusLink where additional information is often available. The organisms and features added to RefSeq records over the past year are summarized below.

Category: RefSeq mRNA and protein records are now provided in four categories (previously termed ‘Status’ in LocusLink reports): (i) genome annotation, (ii) predicted, (iii) provisional and (iv) reviewed. The genome annotation category includes contigs, modeled mRNAs and corresponding modeled proteins. Predicted records represent genes of unknown function that are supported by mRNA sequence generated in a full-length insert cDNA sequencing project, homology or ESTs; the location of the protein product may be predicted in these records. Provisional and reviewed RefSeq records largely represent genes with known or inferred function. Provisional RefSeqs are not yet reviewed, whereas reviewed RefSeq records have been individually reviewed by NCBI staff and are richly annotated. Reviewed records include publications, a gene description and annotation that may not be on the provisional RefSeq or original GenBank records. Furthermore the sequence may be modified, relative to the source GenBank sequence, to extend the UTR, provide a transcript variant or remove contaminating vector or linker sequence. By including sequence data derived from more than one GenBank record and from the literature, reviewed RefSeq records may provide a full-length mRNA sequence that is not available in a single GenBank record.

Conserved domains: BLAST analysis is used to compare RefSeq proteins to the NCBI CDD; significant domain matches are annotated on the protein record and are linked to the CDD resource where additional information is available. Reviewed RefSeq proteins may include additional domain annotation derived from the literature.

Links: RefSeq records include links that facilitate access to additional information. Links are provided to: (i) the record(s) from which the mRNA (and hence protein) was derived, (ii) LocusLink, (iii) OMIM, (iv) MGD, (v) RGD and (vi) CDD, as appropriate. In addition, an overview of sequences that share sequence identity is available through the ‘Related sequences’ link at the top right of RefSeq (and GenBank) records.

Organisms: The mRNA/protein RefSeq resource now provides sequence standards for human, mouse and rat. Review of all three organisms to verify the initial ‘gene name-to-sequence’ association and to provide highly annotated reviewed RefSeq records is ongoing.

Variation: Variation features are automatically added to Genome Annotation RefSeq records using the data available in dbSNP. This expanded annotation will also be added to RefSeq records representing known genes.


    GROWTH AND MAINTENANCE
 TOP
 ABSTRACT
 BACKGROUND
 EXPANDED AND NEW FEATURES
 GROWTH AND MAINTENANCE
 REFERENCES
 
The information available in LocusLink and RefSeq is continually being reviewed and augmented. As illustrated in Figure 1, the number of RefSeq records tripled, and LocusLink expanded ~5-fold over a 1-year period. New or updated RefSeq records are made publicly available on a daily basis. The LocusLink web site is updated weekly; a subset of data on the FTP site updated daily and the remainder is updated weekly.




View larger version (50K):
[in this window]
[in a new window]
 
Figure 1. RefSeq and LocusLink growth over a 1-year period. (A) The number of RefSeq records for human, mouse and rat for mid-September of 1999 and 2000. These numbers include genes for which multiple reference sequences are provided to represent splice variants and their products. As of September 2000, 10 301 human reference sequence mRNAs (and their corresponding proteins) are provided for 9954 genes. The number of human RefSeq records in three categories is also indicated (Hs, human); Genome Annotation reference sequence numbers are not included. Rat, mouse and predicted human RefSeq records became available after September 1999. (B) Loci available in LocusLink for September of 1999 and 2000. Mouse, rat, zebrafish and fruit fly genomes were added after September 1999. Of the 17 214 known human loci, 14 103 have some associated sequence data (including ESTs), and 9954 loci have at least one mRNA and protein reference sequence.

 


    FOOTNOTES
 
* To whom correspondence should be addressed. Tel: +1 301 435 5898; Fax: +1 301 480 2290; Email: pruitt{at}ncbi.nlm.nih.gov Back


    REFERENCES
 TOP
 ABSTRACT
 BACKGROUND
 EXPANDED AND NEW FEATURES
 GROWTH AND MAINTENANCE
 REFERENCES
 

    1 White,J.A., McAlpine,P.J., Antonarakis,S., Cann,H., Eppig,J.T., Frazer,K., Frezal,J., Lancet,D., Nahmias,J., Pearson,P., Peters,J., Scott,A., Scott,H., Spurr,N., Talbot,C.,Jr. and Povey,S. (1997) Guidelines for human gene nomenclature (1997). HUGO Nomenclature Committee. Genomics, 45, 468–471.[Web of Science][Medline]

    2 Hamosh,A., Scott,A.F., Amberger,J., Valle,D. and McKusick,V.A. (2000) Online Mendelian Inheritance in Man (OMIM). Hum. Mutat., 15, 57–61.[Web of Science][Medline]

    3 Blake,J.A., Eppig,J.T., Richardson,J.E. and Davisson,M.T. (2000) The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse. The Mouse Genome Database Group. Nucleic Acids Res., 28, 108–111. Updated article in this issue: Nucleic Acids Res. (2001), 29, 91–94.[Abstract/Free Full Text]

    4 FlyBase Consortium (1999) The FlyBase database of the Drosophila Genome Projects and community literature. The FlyBase Consortium. Nucleic Acids Res., 27, 85–88.[Abstract/Free Full Text]

    5 Westerfield,M., Doerry,E., Kirkpatric,A.E. and Douglas,S.A. (1999) Zebrafish informatics and the ZFIN database. Methods Cell Biol., 60, 339–355.[Web of Science][Medline]

    6 Pruitt,K.D., Katz,K.S., Sicotte,H. and Maglott,D.R. (2000) Introducing RefSeq and LocusLink: curated human genome resources at the NCBI. Trends Genet., 16, 44–47.[Web of Science][Medline]

    7 Schuler,G.D., Epstein,J.A., Ohkawa,H. and Kans,J.A. (1996) Entrez: molecular biology database and retrieval system. Methods Enzymol., 266, 141–162.[Web of Science][Medline]

    8 Altschul,S.F., Gish,W., Miller,W., Myers,E.W. and Lipman,D.J. (1990) Basic local alignment search tool. J. Mol. Biol., 215, 403–410.[Web of Science][Medline]

    9 Benson,D.A., Karsch-Mizrachi,I., Lipman,D.J., Ostell,J., Rapp,B.A. and Wheeler,D.L. (2000) GenBank. Nucleic Acids Res., 28, 15–18.[Abstract/Free Full Text]

    10 Sherry,S.T., Ward,M. and Sirotkin,K. (1999) dbSNP-database for single nucleotide polymorphisms and other classes of minor genetic variation. Genome Res., 8, 677–679.

    11 Schuler,G.D. (1997) Pieces of the puzzle: expressed sequence tags and the catalog of human genes. J. Mol. Med., 75, 694–698.[Web of Science][Medline]

    12 Schuler,G.D. (1997) Sequence mapping by electronic PCR. Genome Res., 7, 541–550.[Abstract/Free Full Text]

    13 Jang,W., Chen,H.C., Sicotte,H. and Schuler,G.D. (1999) Making effective use of human genomic sequence data. Trends Genet., 15, 284–286.[Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
K. D. Yokoyama, U. Ohler, and G. A. Wray
Measuring spatial preferences at fine-scale resolution identifies known and novel cis-regulatory element candidates and functional motif-pair relationships
Nucleic Acids Res., June 2, 2009; (2009) gkp423v2.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. A. Elias, A. Mukhopadhyay, M. P. Joachimiak, E. C. Drury, A. M. Redding, H.-C. B. Yen, M. W. Fields, T. C. Hazen, A. P. Arkin, J. D. Keasling, et al.
Expression profiling of hypothetical genes in Desulfovibrio vulgaris leads to improved functional annotation
Nucleic Acids Res., May 1, 2009; 37(9): 2926 - 2939.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Batuwita and V. Palade
microPred: effective classification of pre-miRNAs for human miRNA gene prediction
Bioinformatics, April 15, 2009; 25(8): 989 - 995.
[Abstract] [Full Text] [PDF]


Home page
J R Soc InterfaceHome page
G. A Reeves, D. Talavera, and J. M Thornton
Genome and proteome annotation: organization, interpretation and integration
J R Soc Interface, February 6, 2009; 6(31): 129 - 147.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Weigt, R. A. White, H. Szurmant, J. A. Hoch, and T. Hwa
Identification of direct residue contacts in protein-protein interaction by message passing
PNAS, January 6, 2009; 106(1): 67 - 72.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. J. Furney, B. Calvo, P. Larranaga, J. A. Lozano, and N. Lopez-Bigas
Prioritization of candidate cancer genes--an aid to oncogenomic studies
Nucleic Acids Res., October 1, 2008; 36(18): e115 - e115.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. A. Glazov, P. A. Cottee, W. C. Barris, R. J. Moore, B. P. Dalrymple, and M. L. Tizard
A microRNA catalog of the developing chicken embryo identified by a deep sequencing approach
Genome Res., June 1, 2008; 18(6): 957 - 964.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
W. Miller, K. Rosenbloom, R. C. Hardison, M. Hou, J. Taylor, B. Raney, R. Burhans, D. C. King, R. Baertsch, D. Blankenberg, et al.
28-Way vertebrate alignment and conservation track in the UCSC Genome Browser
Genome Res., December 1, 2007; 17(12): 1797 - 1808.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Bianchetti, Y. Wu, E. Guerin, F. Plewniak, and O. Poch
SAGETTARIUS: a program to reduce the number of tags mapped to multiple transcripts and to plan SAGE sequencing stages
Nucleic Acids Res., September 25, 2007; 35(18): e122 - e122.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
J. J. McCarthy, J. L. Andrews, E. L. McDearmon, K. S. Campbell, B. K. Barber, B. H. Miller, J. R. Walker, J. B. Hogenesch, J. S. Takahashi, and K. A. Esser
Identification of the circadian transcriptome in adult mouse skeletal muscle
Physiol Genomics, September 11, 2007; 31(1): 86 - 95.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
Y. Takaoka, M. Ohta, A. Ito, K. Takamatsu, A. Sugano, K. Funakoshi, N. Takaoka, N. Sato, H. Yokozaki, N. Arizono, et al.
Electroacupuncture suppresses myostatin gene expression: cell proliferative reaction in mouse skeletal muscle
Physiol Genomics, July 18, 2007; 30(2): 102 - 110.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Uzun, C. M. Leslin, A. Abyzov, and V. Ilyin
Structure SNP (StSNP): a web server for mapping and modeling nsSNPs on protein structures with linkage to metabolic pathways
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W384 - W392.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. L. S. Ng and S. K. Mishra
De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures
Bioinformatics, June 1, 2007; 23(11): 1321 - 1330.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
G. J. Menger, G. C. Allen, N. Neuendorff, S.-S. Nahm, T. L. Thomas, V. M. Cassone, and D. J. Earnest
Circadian profiling of the transcriptome in NIH/3T3 fibroblasts: comparison with rhythmic gene expression in SCN2.2 cells and the rat SCN
Physiol Genomics, May 11, 2007; 29(3): 280 - 289.
[Abstract] [Full Text] [PDF]


Home page
Genes Dev.Home page
R. K. Gupta, N. Gao, R. K. Gorski, P. White, O. T. Hardy, K. Rafiq, J. E. Brestelli, G. Chen, C. J. Stoeckert Jr., and K. H. Kaestner
Expansion of adult beta-cell mass in response to increased metabolic demand is dependent on HNF-4{alpha}
Genes & Dev., April 1, 2007; 21(7): 756 - 769.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
U. T. Shankavaram, W. C. Reinhold, S. Nishizuka, S. Major, D. Morita, K. K. Chary, M. A. Reimers, U. Scherf, A. Kahn, D. Dolginow, et al.
Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study
Mol. Cancer Ther., March 1, 2007; 6(3): 820 - 832.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
N. C. Duarte, S. A. Becker, N. Jamshidi, I. Thiele, M. L. Mo, T. D. Vo, R. Srivas, and B. O. Palsson
Global reconstruction of the human metabolic network based on genomic and bibliomic data
PNAS, February 6, 2007; 104(6): 1777 - 1782.
[Abstract] [Full Text] [PDF]


Home page
RNAHome page
S. NG Kwang Loong and S. K. Mishra
Unique folding of precursor microRNAs: Quantitative evidence and implications for de novo identification
RNA, February 1, 2007; 13(2): 170 - 187.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
B. Tian, Z. Pan, and J. Y. Lee
Widespread mRNA polyadenylation events in introns indicate dynamic interplay between polyadenylation and splicing
Genome Res., February 1, 2007; 17(2): 156 - 165.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Zhang, J. Li, L. Kong, G. Gao, Q.-R. Liu, and L. Wei
NATsDB: Natural Antisense Transcripts DataBase
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D156 - D161.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Cheng, R. M. Miura, and B. Tian
Prediction of mRNA polyadenylation sites by support vector machine
Bioinformatics, October 1, 2006; 22(19): 2320 - 2325.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Yao, R. Charlab, and P. Li
Systematic identification of pseudogenes through whole genome expression evidence profiling
Nucleic Acids Res., September 11, 2006; 34(16): 4477 - 4485.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. D. Thomas, A. Kejariwal, N. Guo, H. Mi, M. J. Campbell, A. Muruganujan, and B. Lazareva-Ulitsky
Applications for protein sequence-function evolution data: mRNA/protein expression analysis and coding SNP scoring tools.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W645 - W650.
[Abstract] [Full Text] [PDF]


Home page
Journals of Gerontology Series A: Biological Sciences and Medical SciencesHome page
M. Cardelli, F. Marchegiani, L. Cavallone, F. Olivieri, S. Giovagnetti, E. Mugianesi, R. Moresi, R. Lisa, and C. Franceschi
A Polymorphism of the YTHDF2 Gene (1p35) Located in an Alu-Rich Genomic Domain Is Associated With Human Longevity.
J. Gerontol. A Biol. Sci. Med. Sci., June 1, 2006; 61(6): 547 - 556.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Lopez-Bigas, B. J. Blencowe, and C. A. Ouzounis
Highly consistent patterns for inherited human diseases at the molecular level
Bioinformatics, February 1, 2006; 22(3): 269 - 277.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Saxonov, P. Berg, and D. L. Brutlag
A genome-wide analysis of CpG dinucleotides in the human genome distinguishes two distinct classes of promoters
PNAS, January 31, 2006; 103(5): 1412 - 1417.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
A. S. Siddiqui, J. Khattra, A. D. Delaney, Y. Zhao, C. Astell, J. Asano, R. Babakaiff, S. Barber, J. Beland, S. Bohacec, et al.
A mouse atlas of gene expression: Large-scale digital gene-expression profiles from precisely defined developing C57BL/6J mouse tissues and cells
PNAS, December 20, 2005; 102(51): 18485 - 18490.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
E. Kim and Y. Kliger
Discovering hidden viral piracy
Bioinformatics, December 1, 2005; 21(23): 4216 - 4222.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
F. L. Poole II, B. A. Gerwe, R. C. Hopkins, G. J. Schut, M. V. Weinberg, F. E. Jenney Jr., and M. W. W. Adams
Defining Genes in the Genome of the Hyperthermophilic Archaeon Pyrococcus furiosus: Implications for All Microbial Genomes
J. Bacteriol., November 1, 2005; 187(21): 7325 - 7332.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Physiol. Heart Circ. Physiol.Home page
D. Marti, R. Miquel, K. Ziani, R. Gisbert, M. D. Ivorra, E. Anselmi, L. Moreno, V. Villagrasa, D. Barettino, and P. D'Ocon
Correlation between mRNA levels and functional role of {alpha}1-adrenoceptor subtypes in arteries: evidence of {alpha}1L as a functional isoform of the {alpha}1A-adrenoceptor
Am J Physiol Heart Circ Physiol, November 1, 2005; 289(5): H1923 - H1932.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Z. Kan, P. W. Garrett-Engele, J. M. Johnson, and J. C. Castle
Evolutionarily conserved and diverged alternative splicing events show different expression and functional profiles
Nucleic Acids Res., September 29, 2005; 33(17): 5659 - 5666.
[Abstract] [Full Text] [PDF]


Home page
cellbioedHome page
A. E. Bednarski, S. C.R. Elgin, and H. B. Pakrasi
An Inquiry into Protein Structure and Genetic Disease: Introducing Undergraduates to Bioinformatics in a Large Introductory Course
CBE Life Sci Educ, September 1, 2005; 4(3): 207 - 220.
[Abstract] [Full Text] [PDF]


Home page
Stem CellsHome page
W. Wagner, S. Laufs, J. Blake, C. Schwager, X. Wu, J. W. Zeller, A. D. Ho, and S. Fruehauf
Retroviral Integration Sites Correlate with Expressed Genes in Hematopoietic Stem Cells
Stem Cells, September 1, 2005; 23(8): 1050 - 1058.
[Abstract] [Full Text] [PDF]


Home page
Hum Mol GenetHome page
G. V Kryukov, S. Schmidt, and S. Sunyaev
Small fitness effect of mutations in highly conserved non-coding regions
Hum. Mol. Genet., August 1, 2005; 14(15): 2221 - 2229.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. C. King, J. Taylor, L. Elnitski, F. Chiaromonte, W. Miller, and R. C. Hardison
Evaluation of regulatory potential and conservation scores for detecting cis-regulatory modules in aligned mammalian genome sequences
Genome Res., August 1, 2005; 15(8): 1051 - 1060.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. Liu and A. Grigoriev
Fast parsers for Entrez Gene
Bioinformatics, July 15, 2005; 21(14): 3189 - 3190.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
G. M. Cooper, E. A. Stone, G. Asimenos, NISC Comparative Sequencing Program, E. D. Green, S. Batzoglou, and A. Sidow
Distribution and intensity of constraint in mammalian genomic sequence
Genome Res., July 1, 2005; 15(7): 901 - 913.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Mlecnik, M. Scheideler, H. Hackl, J. Hartler, F. Sanchez-Cabo, and Z. Trajanoski
PathwayExplorer: web service for visualizing high-throughput expression data on biological pathways
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W633 - W637.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
B. Zhang, S. Kirov, and J. Snoddy
WebGestalt: an integrated system for exploring gene sets in various biological contexts
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W741 - W748.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. Khatri, S. Sellamuthu, P. Malhotra, K. Amin, A. Done, and S. Draghici
Recent additions and improvements to the Onto-Tools
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W762 - W765.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Qian, N. Esumi, Y. Chen, Q. Wang, I. Chowers, and D. J. Zack
Identification of regulatory targets of tissue-specific transcription factors: application to retina-specific gene regulation
Nucleic Acids Res., June 20, 2005; 33(11): 3479 - 3491.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
E. A. Glazov, M. Pheasant, E. A. McGraw, G. Bejerano, and J. S. Mattick
Ultraconserved elements in insect genomes: A highly conserved intronic sequence implicated in the control of homothorax mRNA splicing
Genome Res., June 1, 2005; 15(6): 800 - 808.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Korn, S. Rohrig, S. Schulze-Kremer, and U. Brinkmann
Common denominator procedure: a novel approach to gene-expression data mining for identification of phenotype-specific genes
Bioinformatics, June 1, 2005; 21(11): 2766 - 2772.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
G. J. Menger, K. Lu, T. Thomas, V. M. Cassone, and D. J. Earnest
Circadian profiling of the transcriptome in immortalized rat SCN cells
Physiol Genomics, May 11, 2005; 21(3): 370 - 381.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
A. A. Sharov, D. B. Dudekula, and M. S.H. Ko
Genome-wide assembly and analysis of alternative transcripts in mouse
Genome Res., May 1, 2005; 15(5): 748 - 754.
[Abstract] [Full Text] [PDF]


Home page
Toxicol SciHome page
D. C. Volz, D. C. Bencic, D. E. Hinton, J. M. Law, and S. W. Kullman
2,3,7,8-Tetrachlorodibenzo-p-Dioxin (TCDD) Induces Organ- Specific Differential Gene Expression in Male Japanese Medaka (Oryzias latipes)
Toxicol. Sci., May 1, 2005; 85(1): 572 - 584.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. M. Cameron, T. Hurd, and B. H. Robinson
Computational identification of human mitochondrial proteins based on homology to yeast mitochondrially targeted proteins
Bioinformatics, May 1, 2005; 21(9): 1825 - 1830.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
C. C. DiRusso, H. Li, D. Darwis, P. A. Watkins, J. Berger, and P. N. Black
Comparative Biochemical Studies of the Murine Fatty Acid Transport Proteins (FATP) Expressed in Yeast
J. Biol. Chem., April 29, 2005; 280(17): 16829 - 16837.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Nakao, R. A. Barrero, Y. Mukai, C. Motono, M. Suwa, and K. Nakai
Large-scale analysis of human alternative protein isoforms: pattern classification and correlation with subcellular localization signals
Nucleic Acids Res., April 28, 2005; 33(8): 2355 - 2363.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. M. Harrison, D. Zheng, Z. Zhang, N. Carriero, and M. Gerstein
Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability
Nucleic Acids Res., April 28, 2005; 33(8): 2374 - 2383.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Wolstencroft, R. McEntire, R. Stevens, L. Tabernero, and A. Brass
Constructing ontology-driven protein family databases
Bioinformatics, April 15, 2005; 21(8): 1685 - 1692.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Castelo, A. Reymond, C. Wyss, F. Camara, G. Parra, S. E. Antonarakis, R. Guigo, and E. Eyras
Comparative gene finding in chicken indicates that we are closing in on the set of multi-exonic widely expressed human genes
Nucleic Acids Res., April 4, 2005; 33(6): 1935 - 1939.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
X. Zhang, D. T. Odom, S.-H. Koo, M. D. Conkright, G. Canettieri, J. Best, H. Chen, R. Jenner, E. Herbolsheimer, E. Jacobsen, et al.
Genome-wide analysis of cAMP-response element binding protein occupancy, phosphorylation, and target gene activation in human tissues
PNAS, March 22, 2005; 102(12): 4459 - 4464.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
S. W. Cole, W. Yan, Z. Galic, J. Arevalo, and J. A. Zack
Expression-based monitoring of transcription factor activity: the TELiS database
Bioinformatics, March 15, 2005; 21(6): 803 - 810.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
J. Yan and T. G. Marr
Computational analysis of 3'-ends of ESTs shows four classes of alternative polyadenylation in human, mouse, and rat
Genome Res., March 1, 2005; 15(3): 369 - 375.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Kim, J. Seo, Y. S. Lee, and S. Kim
TFExplorer: integrated analysis database for predicted transcription regulatory elements
Bioinformatics, February 15, 2005; 21(4): 548 - 550.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
K. Tan, L. A. McCue, and G. D. Stormo
Making connections between novel transcription factors and their DNA motifs
Genome Res., February 1, 2005; 15(2): 312 - 320.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Kahraman, A. Avramov, L. G. Nashev, D. Popov, R. Ternes, H.-D. Pohlenz, and B. Weiss
PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics
Bioinformatics, February 1, 2005; 21(3): 418 - 420.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
L. A. Blomberg, E. L. Long, T. S. Sonstegard, C. P. Van Tassell, J. R. Dobrinsky, and K. A. Zuelke
Serial analysis of gene expression during elongation of the peri-implantation porcine trophectoderm (conceptus)
Physiol Genomics, January 20, 2005; 20(2): 188 - 194.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Chen, H. Liu, and C. Friedman
Gene name ambiguity of eukaryotic nomenclatures
Bioinformatics, January 15, 2005; 21(2): 248 - 256.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L. Florea, V. Di Francesco, J. Miller, R. Turner, A. Yao, M. Harris, B. Walenz, C. Mobarry, G. V. Merkulov, R. Charlab, et al.
Gene and alternative splicing annotation with AIR
Genome Res., January 1, 2005; 15(1): 54 - 66.
[Abstract] [Full Text] [PDF]


Home page
Mol Cancer ResHome page
Z. Liu, H. Lu, Z. Jiang, A. Pastuszyn, and C.-a. A. Hu
Apolipoprotein L6, a Novel Proapoptotic Bcl-2 Homology 3-Only Protein, Induces Mitochondria-Mediated Apoptosis in Cancer Cells
Mol. Cancer Res., January 1, 2005; 3(1): 21 - 31.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
S. J. Hubbard, D. V. Grafham, K. J. Beattie, I. M. Overton, S. R. McLaren, M. D.R. Croning, P. E. Boardman, J. K. Bonfield, J. Burnside, R. M. Davies, et al.
Transcriptome analysis for the chicken based on 19,626 finished cDNA sequences and 485,337 expressed sequence tags
Genome Res., January 1, 2005; 15(1): 174 - 183.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Zhang, J. Hu, M. Recce, and B. Tian
PolyA_DB: a database for mammalian mRNA polyadenylation
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D116 - D120.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
A. Bairoch, R. Apweiler, C. H. Wu, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, M. Magrane, et al.
The Universal Protein Resource (UniProt)
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D154 - D159.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Chen, Y. Zhang, Y. Yin, G. Gao, S. Li, Y. Jiang, X. Gu, and J. Luo
SPD--a web-based secreted protein database
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D169 - D173.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Giudicelli, D. Chaume, and M.-P. Lefranc
IMGT/GENE-DB: a comprehensive database for human and mouse immunoglobulin and T cell receptor genes
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D256 - D261.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. Mi, B. Lazareva-Ulitsky, R. Loo, A. Kejariwal, J. Vandergriff, S. Rabkin, N. Guo, A. Muruganujan, O. Doremieux, M. J. Campbell, et al.
The PANTHER database of protein families, subfamilies, functions and pathways
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D284 - D288.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. L. Winsor, R. Lo, S. J. H. Sui, K. S.E. Ung, S. Huang, D. Cheng, W.-K. H. Ching, R. E. W. Hancock, and F. S. L. Brinkman
Pseudomonas aeruginosa Genome Database and PseudoCAP: facilitating community-based, continually updated, genome annotation
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D338 - D343.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
H. J. Kang, K. O. Choi, B.-D. Kim, S. Kim, and Y. J. Kim
FESD: a Functional Element SNPs Database in human
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D518 - D522.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
M. Shklar, L. Strichman-Almashanu, O. Shmueli, M. Shmoish, M. Safran, and D. Lancet
GeneTide--Terra Incognita Discovery Endeavor: a new transcriptome focused member of the GeneCards/GeneNote suite of databases
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D556 - D561.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
R. Homayouni, K. Heinrich, L. Wei, and M. W. Berry
Gene clustering by Latent Semantic Indexing of MEDLINE abstracts
Bioinformatics, January 1, 2005; 21(1): 104 - 115.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Iwama and T. Gojobori
Highly conserved upstream sequences for transcription factor genes and implications for the regulatory network
PNAS, December 7, 2004; 101(49): 17156 - 17161.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
L. Ding, A. Sabo, N. Berkowicz, R. R. Meyer, Y. Shotland, M. R. Johnson, K. H. Pepin, R. K. Wilson, and J. Spieth
EAnnot: A genome annotation tool using experimental evidence
Genome Res., December 1, 2004; 14(12): 2503 - 2509.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Zhang, S. Kasif, C. R. Cantor, and N. E. Broude
GC/AT-content spikes as genomic punctuation marks
PNAS, November 30, 2004; 101(48): 16855 - 16860.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
A. Narezkina, K. D. Taganov, S. Litwin, R. Stoyanova, J. Hayashi, C. Seeger, A. M. Skalka, and R. A. Katz
Genome-Wide Analyses of Avian Sarcoma Virus Integration Sites
J. Virol., November 1, 2004; 78(21): 11656 - 11663.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
P. D. Thomas and A. Kejariwal
Coding single-nucleotide polymorphisms associated with complex vs. Mendelian disease: Evolutionary evidence for differences in molecular effects
PNAS, October 26, 2004; 101(43): 15398 - 15403.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
W. Thompson, M. J. Palumbo, W. W. Wasserman, J. S. Liu, and C. E. Lawrence
Decoding Human Regulatory Circuits
Genome Res., October 1, 2004; 14(10a): 1967 - 1974.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
T. S. Furey, M. Diekhans, Y. Lu, T. A. Graves, L. Oddy, J. Randall-Maher, L. W. Hillier, R. K. Wilson, and D. Haussler
Analysis of Human mRNAs With the Reference Genome Sequence Reveals Potential Errors, Polymorphisms, and RNA Editing
Genome Res., October 1, 2004; 14(10b): 2034 - 2040.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. N. Messina, J. Glasscock, W. Gish, and M. Lovett
An ORFeome-based Analysis of Human Transcription Factor Genes and the Construction of a Microarray to Interrogate Their Expression
Genome Res., October 1, 2004; 14(10b): 2041 - 2047.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Oyama, C. Itagaki, H. Hata, Y. Suzuki, T. Izumi, T. Natsume, T. Isobe, and S. Sugano
Analysis of Small Human Proteins Reveals the Translation of Upstream Open Reading Frames of mRNAs
Genome Res., October 1, 2004; 14(10b): 2048 - 2052.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
The MGC Project Team
The Status, Quality, and Expansion of the NIH Full-Length cDNA Project: The Mammalian Gene Collection (MGC)
Genome Res., October 1, 2004; 14(10b): 2121 - 2127.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
B. Issac and G. P. S. Raghava
EGPred: Prediction of Eukaryotic Genes Using Ab Initio Methods After Combining With Sequence Similarity Approaches
Genome Res., September 1, 2004; 14(9): 1756 - 1766.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
E. Jamet
Bioinformatics as a critical prerequisite to transcriptome and proteome studies
J. Exp. Bot., September 1, 2004; 55(405): 1977 - 1979.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
B. H. Mecham, D. Z. Wetmore, Z. Szallasi, Y. Sadovsky, I. Kohane, and T. J. Mariani
Increased measurement accuracy for sequence-verified microarray probes
Physiol Genomics, August 11, 2004; 18(3): 308 - 315.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Leipzig, P. Pevzner, and S. Heber
The Alternative Splicing Gallery (ASG): bridging the gap between genome and transcriptome
Nucleic Acids Res., August 3, 2004; 32(13): 3977 - 3983.
[Abstract] [Full Text] [PDF]


Home page
IOVSHome page
R. H. Farkas, J. Qian, J. L. Goldberg, H. A. Quigley, and D. J. Zack
Gene Expression Profiling of Purified Rat Retinal Ganglion Cells
Invest. Ophthalmol. Vis. Sci., August 1, 2004; 45(8): 2503 - 2513.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
G. Rajashekara, J. D. Glasner, D. A. Glover, and G. A. Splitter
Comparative Whole-Genome Hybridization Reveals Genomic Islands in Brucella Species
J. Bacteriol., August 1, 2004; 186(15): 5040 - 5051.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. R. Nelson, G. Marnellos, S. Kammerer, C. R. Hoyal, M. M. Shi, C. R. Cantor, and A. Braun
Large-Scale Validation of Single Nucleotide Polymorphisms in Gene Regions
Genome Res., August 1, 2004; 14(8): 1664 - 1668.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
P. C. FitzGerald, A. Shlyakhtenko, A. A. Mir, and C. Vinson
Clustering of DNA Sequences in Human Promoters
Genome Res., August 1, 2004; 14(8): 1562 - 1574.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
P. M. Haverty, M. C. Frith, and Z. Weng
CARRIE web service: automated transcriptional regulatory network inference and interactive analysis
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W213 - W216.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. B. Vega, D. K. Bangarusamy, L. D. Miller, E. T. Liu, and C.-Y. Lin
BEARR: Batch Extraction and Analysis of cis-Regulatory Regions
Nucleic Acids Res., July 1, 2004; 32(suppl_2): W257 - W260.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Print PDF (107K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in ISI Web of Science
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (498)
Right arrowRequest Permissions
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Pruitt, K. D.
Right arrow Articles by Maglott, D. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?