Dennis A. Benson*, Mark S. Boguski, David J. Lipman, James Ostell, B. F. Francis Ouellette+, Barbara A. Rapp and David L. Wheeler
National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Building 38A, 8600 Rockville Pike, Bethesda, MD 20894, USA
Received October 5, 1998;Accepted October 5, 1998
ABSTRACT
The GenBank (Registered Trademark symbol) sequence database incorporates DNA sequences from all available public sources, primarily through the direct submission of sequence data from individual laboratories and from large-scale sequencing projects. Most submitters use the BankIt (Web) or Sequin programs to format and send sequence data. Data exchange with the EMBL Data Library and the DNA Data Bank of Japan helps ensure comprehensive worldwide coverage. GenBank data is accessible through NCBI's integrated retrieval system, Entrez, which integrates data from the major DNA and protein sequence databases along with taxonomy, genome and protein structure information. MEDLINE (Registered Trademark symbol) abstracts from published articles describing the sequences are included as an additional source of biological annotation through the PubMed search system. Sequence similarity searching is offered through the BLAST series of database search programs. In addition to FTP, Email, and server/client versions of Entrez and BLAST, NCBI offers a wide range of World Wide Web retrieval and analysis services based on GenBank data. The GenBank database and related resources are freely accessible via the URL: http://www.ncbi.nlm.nih.gov
GenBank (1) is a public database of all known nucleotide and protein sequences with supporting bibliographic and biological annotation, built and distributed by the National Center for Biotechnology Information (NCBI), a division of the National Library of Medicine (NLM), located on the campus of the US National Institutes of Health (NIH). NCBI was created by special legislation in 1988 to develop information systems in the field of molecular biology to support the biomedical research community. NCBI was also mandated to conduct basic and applied research and, as part of the NIH Intramural Program, NCBI scientists work in areas of gene and genome analysis, computational structural biology and mathematical methods for sequence analysis.
NCBI builds GenBank primarily from the direct submission of sequence data from authors. Another major source of data is bulk submission of EST and other high-throughput data from sequencing centers. The data are supplemented by sequences submitted to other public databases. Through a long-standing international collaboration with the EMBL Data Library (2) in the UK and the DNA Databank of Japan (DDBJ) (3), data are exchanged daily to ensure that all three sites maintain a comprehensive collection of sequence information. NCBI makes the data available at no cost over the Internet, by FTP access and by Web text and sequence similarity search services.
GenBank continues to grow at an exponential rate. Over the past 12 months 770 000 new sequences have been added. As of Release 108 in August 1998, GenBank contained almost 1.8 billion nucleotide bases from 2.5 million different sequences. Complete genomes represent a growing portion of the database, with 10 complete genomes added in 1998, compared with six in 1997, and two in 1996. A recent addition is the Caenorhabditiselegans genome of 100 Mb sequenced jointly by Washington University in St Louis and the Sanger Sequencing Center in Hinxton, UK. There are at least 20 additional microorganism genomes that are being sequenced, many of which are expected to be in the public databases over the coming year. Historically, GenBank had been doubling in size about every 18 months, but that rate has accelerated to doubling every 15 months due primarily to the enormous growth in data from expressed sequence tags (ESTs). Nearly 70% of the sequences in the current GenBank release are ESTs, and most of the growth in sequence records over the past three years has come from the collaborative project between Merck & Co. and Washington University (4,5). Human EST sequencing continues and is being supplemented by a mouse EST project supported by the Howard Hughes Medical Institute at Washington University.
Over 40,000 different species are represented in GenBank and new species are being added at the rate of 900 per month. Human sequences constitute 54% of the total sequences (42% of all sequences are human ESTs). After Homo sapiens, the top species in GenBank in terms of the number of bases include Mus musculus, C.elegans, Arabidopsis thaliana and Drosophila melanogaster. Database sequences are processed and can be queried using a comprehensive sequence-based taxonomy developed by NCBI in collaboration with EMBL and DDBJ and with the valuable assistance of external advisors and curators. Further details, along with a taxonomy browser and information on taxonomic resources, may be found on NCBI's home page.
Each GenBank entry includes a concise description of the sequence, the scientific name and taxonomy of the source organism, and a table of features that identifies coding regions and other sites of biological significance, such as transcription units, intron/exon boundaries, sites of mutations or modifications and other sequence features. Protein translations for coding regions are in the feature table. Bibliographic references are included along with links to the MEDLINE abstracts for all published sequences.
The files in the GenBank distribution have traditionally been divided into `divisions' that roughly correspond to taxonomic divisions, e.g., bacteria, viruses, primates, rodents, etc. There are currently 17 divisions. For convenience in file transfer, the larger divisions, e.g., EST and primate, are divided into multiple files.
To produce the GenBank and Entrez databases, NCBI tracks and indexes records from multiple sources of sequence data: DNA sequences from EMBL, DDBJ and the US Office of Patents and Trademarks (USPTO), plus amino acid sequences from the nucleotide databases (CDS features with `/translations'), Protein Identification Resource (PIR), SWISS-PROT, Protein Research Foundation (PRF), the Protein Data Bank (PDB) and the USPTO. GenBank assigns each record an accession number, which is considered the unique identifier for each GenBank entry and does not change, even when there is a change to the sequence or annotation. In order to identify specific sequences from the different sources, as well as any changes in those sequences that may occur, NCBI additionally assigns a stable identifier, termed a `gi' number, to each sequence. When a change in a sequence occurs, a new `gi' number is assigned to the new sequence version. These identifiers appear in the `NID' field of a GenBank record, immediately following the ACCESSION field.
There is an agreement among the collaborative DNA sequence databases to introduce in February 1999 a third identifier which will encompass the information present in both the `gi' and ACCESSION number. GenBank will show this identifier on the VERSION line, which will appear below the NID line and will be in the form `Accession.version'. For example, an entry appearing in the database for the first time would have a VERSION number equivalent to the ACCESSION number followed by `.1' to reflect that this is the first version of the sequence in this entry, e.g.:
ACCESSION AF000001
NID g987654321
VERSION AF000001.1 GI: 987654321
The VERSION line will also display the `gi' number. If the nucleotide sequence changes, then so will the `gi' number and the version, but the accession will remain the same. Although the NID line will carry redundant information, this line will remain in the file for an extended time to ensure compatibility with existing programs.
A similar system for tracking changes in the corresponding protein translations will also be introduced in February 1999. Protein sequences will have identification numbers (in the format of three letters followed by five digits, e.g., AAA00001) that do not change, followed by a version number which increases with each subsequent version of the sequence. This will appear as a qualifier for a CDS feature in the FEATURES table portion of a GenBank entry, e.g., /protein_id=`AAA00001.1'
Protein translations currently receive their own unique `gi' number, which appears as a qualifier on the CDS feature: /db_xref=`PID:g1234567'. The letter prefix indicates the database of origin for these identifiers (d=DDBJ, e=EMBL, g=GenBank). Eventually, after a transition period of at least a year, this form will be phased out since the new `protein_id' complete with the version number will represent both a stable identifier and a means to identify changes in the sequence.
ESTs continue to be the major source of new sequence records and genes. Last year there were 1 247 603 sequences in the EST division of GenBank. Over the past year the number of ESTs has increased by nearly 50% to the current total of 1 765 860 sequences representing over 130 different organisms. The top five organisms include: 1 072 823 human sequences (61%); 351 852 mouse (20%); 72 569 nematode (4%); 57 227 rat (3%); and 37 848 fruit fly (2%).
As part of its daily processing of EST data, NCBI identifies through BLAST searches all homologies for new EST sequences and incorporates that information into a specialized database (dbEST). ESTs continue to provide the major source of new gene discoveries and NCBI has processed more than a million queries (BLAST searches, Email retrievals, Web accesses and anonymous FTP downloads) for dbEST in the past year.
In order to organize the EST data in a useful fashion, NCBI has created the UniGene collection of unique human genes (6) and mouse genes. UniGene starts with entries in the Primate (or Rodent) division of GenBank, combines these with ESTs of that organism, and creates clusters of sequences that share virtually identical 3[prime] untranslated regions (3[prime] UTRs). In this manner, over one million human ESTs in GenBank have been reduced 20-fold to ~52 000 sequence clusters, each of which may be considered as representing a single human gene. In a similar fashion, the mouse ESTs have been organized as 10 000 clusters. The UniGene collection has been effectively used as a source of mapping candidates for the construction of a human gene map (7). In this case, the 3[prime] UTRs of genes and ESTs are converted to sequence-tagged sites (STSs) which are then placed on physical maps and integrated with preexisting genetic maps of the genome. The UniGene collection has also been used as a source of unique sequences for the fabrication of `chips' for the large-scale study of gene expression (8). Access to the UniGene collection is provided through NCBI's home page on the Web.
The ultimate purpose for creating high resolution physical maps of the human genome is to create a scaffold for organizing large scale sequencing (9). Physical maps based on STS landmarks are used to develop so-called `sequence-ready' clones consisting of overlapping cosmids or BACs. As the HTG sequence data derived from these clones are submitted to GenBank, STSs become crucial reference points for organizing, presenting and searching the data. NCBI uses `electronic PCR' to compare all human sequences with the contents of the STS division of GenBank; this identifies primer-binding sites on the human sequences that may be amplified in a PCR reaction. This tool permits the assignment of an initial location on the map for sequence data and the association of existing GenBank entries to the new reference sequence. The electronic PCR tool is also being made publicly available on the Web to enable any researcher with a new human sequence to relate that sequence to existing maps and HTG sequence data.
The STS division of GenBank currently contains 60 564 sequences and includes anonymous STSs based on genomic sequence as well gene-based STSs derived from the 3[prime] ends of genes and ESTs. These STS records usually include primer sequences, annotations and PCR reaction conditions.
The Genome Survey Sequences (GSS) division of GenBank has been the fastest growing division in the last year, having increased 5-fold to a total of 222 573 records with 103 970 022 nucleotides. GSS records represent `random' genomic sequences, but are predominantly represented by `BAC ends'-single reads from Bacterial Artificial Chromosomes used in a variety of genome sequencing projects, notably that of human (150 098 records), Fugu rubripes (32 075 records) and A.thaliana (26 605 records). It is expected that the human data will be used along with the STS records in tiling the BACs used for the Human Genome Project (10).
The high throughput genomic (HTG) sequences are unfinished large-scale genomic records that are in transition to a finished state, after which they will be placed in the appropriate organism division (11). It is now clear that a great number of human sequences will continue to appear in the unfinished (HTG) division of GenBank, as they will in the corresponding finished (PRI) division. Together these two divisions should add some 2000 Mb of new genomic sequences from US-sponsored laboratories within the next three years.
Single-base variations in the human genetic code called single-nucleotide polymorphisms (SNPs) promise to be helpful in large-scale association genetics studies. In collaboration with the National Human Genome Research Institute (NHGRI), NCBI has established a database (dbSNP) to serve as a central repository for both single base nucleotide substitutions and short deletion and insertion polymorphisms. One of the goals of the US Human Genome Project is to use these data to generate publicly available maps of at least 100 000 SNP markers within the next five years (12). The dbSNP database contains the experimental conditions used to detect each mutation and each mutation's observed variation for populations and individuals. A flexible file-based submission format has been developed to accommodate the typical large set of SNP submissions. Although dbSNP is not strictly a part of GenBank, appropriate links between dbSNP data and other genomic data, including GenBank, are being developed for NCBI's retrieval systems.
The data in GenBank come from two sources: (i) authors who submit data directly to the collaborating databases, and (ii) bulk submissions from sequencing centers in the form of ESTs, STSs, GSSs or large genomic records (usually sequences from cosmids, BACs or YACs). Data are exchanged daily with DDBJ and EMBL, our collaborating databases, so that the daily updates from NCBI servers incorporate the most recently available sequence data from all sources.
Virtually all records enter GenBank as direct submissions, with the majority of authors using the BankIt or Sequin programs. The nucleotide databases no longer routinely manually scan the published literature for articles containing sequences. Many journals have a policy of requiring authors with sequence data to submit data directly to the database as a condition of publication.
GenBank staff can usually assign an author an accession number within two working days of receipt, and do so at a rate of several hundred per day. The accession number serves as confirmation that the sequence has been submitted and allows readers of the article to retrieve the relevant data. All direct submissions receive a systematic quality assurance review including checking for vector contamination, verifying proper translation of coding regions, and correct taxonomy and bibliographic citations. A draft of the GenBank record is passed back to the author for review before entering the database. Authors have the right to request that their sequences be kept confidential until the time of publication. In these cases, authors are reminded to inform the database of the publication date of the article in which the sequence is cited in order to have a timely release of the data. Although only the submitting scientist is permitted to modify sequence data or annotations, all users are encouraged to inform the database of possible lags in releasing data, errors or omissions using the Email address update@ncbi.nlm.nih.gov
Several large-scale sequencing projects are producing megabases of human genomic DNA sequence. NCBI works closely with sequencing centers to ensure timely incorporation of these data for public release. In parallel, NCBI has developed methods to display these data integrated with genetic and physical map data and to search the sequences more effectively (e.g., through options in BLAST to mask Alu and other types of repetitive elements). GenBank offers special batch procedures for large-scale sequencing groups to facilitate data submission using the program `fa2htgs' and other tools (13).
Over 65% of individual submissions are received through a Web-based data submission tool, BankIt. With BankIt, authors enter sequence information directly into a form, edit as necessary, and add biological annotation (e.g., coding regions, mRNA features). Free-form text boxes provide the option of using free text to describe the sequence, without having to learn formatting rules or use restricted vocabularies. BankIt creates a draft record in GenBank flat file format for the user to review and revise. BankIt is the tool of choice for simple submissions, especially when only one or a small number of records is submitted (11).
NCBI has developed a platform-independent submission program called Sequin which runs stand-alone or over the network. Sequin handles simple sequences (e.g. a cDNA), as well as long sequences and segmented entries (with which BankIt and other Web-based forms have difficulty). Sequin has convenient editing and complex annotation features and contains a number of built-in validation functions for enhanced quality assurance. It is also designed to facilitate submission of sequences from phylogenetic, population (Fig. 1) and mutation studies. Sequin's strength is in editing and updating sequence records, as well as its ability to function as a sequence analysis tool. For example, Sequin can now incorporate any analysis tool available on the Web that accepts FASTA or ASN.1 as its input format. In addition, Sequin is able to work on large records (e.g. the Escherichia coli genome at 5.6 Mb) and read in all of its annotations via simple tables. Versions for Macintosh, PC and Unix computers are available via anonymous FTP to `ncbi.nlm.nih.gov' in the `sequin' directory. Once a submission is completed, users can Email it to the address: gb-sub@ncbi.nlm.nih.gov. Additional information about Sequin can be found through the NCBI home page.
Figure1. Aligned sequence view from Sequin for the submission of HIV1 sequences representing a population study. Sequin allows authors to prepare and submit nucleotide sequences for which they may have already generated an alignment, and also allows editing of these alignments. In addition, various views of the data are available, including a graphical view (lower left), standard GenBank text view (lower right), or a view showing features and alignments (upper center). Sequin has a number of functions to deal with population, phylogenetic and mutational studies, as well as standard records (e.g., cDNA with one CDS feature) or very large ones (e.g., E.coli genome with 4000 CDS features).
Entrez is an integrated database retrieval system that accesses DNA and protein sequence data, MEDLINE references (PubMed), genome data, the NCBI taxonomy, and protein structures from the Molecular Modeling Database, MMDB (14). The DNA and protein sequence data are integrated from a variety of sources and therefore include more sequence data than are available within GenBank alone. PubMed was developed at NCBI and allows text searching of the 9 million references in MEDLINE as well as links to the full-text of over 200 journals that are available on the Web.
Entrez provides an entry point into sequence or bibliographic records by simple Boolean queries. From a record, a user can `point-and-click' via hypertext links to reach different information sources. Some of the links are simple cross-references, for example, between a sequence and the abstract of the paper in which the sequence was reported, or between a protein sequence and its corresponding DNA sequence. Other links are based on computed similarities among the sequences or among MEDLINE abstracts. The resulting pre-computed `neighbors' allow very rapid access for browsing groups of related records.
Entrez is available over the Internet both through the Web and in a server/client version. The Web version, including PubMed MEDLINE searching, is used by over 60 000 different users per day. The server/client version of Entrez operates with a client program on a user's machine connected over the Internet to a server located at the NCBI. Client programs for Macintosh, PC and Unix computers can be obtained by downloading from `ncbi.nlm.nih.gov' in the `entrez/network' directory. The Web version has essentially the same functionality as the server/client, with the added capability of linking to full-text versions of journal articles. Both versions allow viewing of genome and related map information as well as 3D structures (14).
The most frequent type of analysis performed using GenBank is the search for sequences similar to a query sequence. NCBI offers the BLAST family of search programs to locate good alignments between a query sequence and database sequences. Each BLAST alignment is accompanied by an alignment score and a measure of statistical significance, called the Expectation value, for judging the quality of the alignment. The Web version of the standard BLAST 2.0 program accepts a query sequence or accession number which is pasted into an input window. The search for similarity is performed using a PAM or BLOSUM scoring matrix and results in a set of gapped alignments containing hyperlinks to database hits. The Web versions of BLAST now provide a graphical overview consisting of an array of bars, representing database hits, which are aligned to a master bar, which represents the query sequence. These bars are color-coded by expectation value and clearly show the extent and quality of the sequence similarities detected by BLAST as well as the disposition of gaps in the alignments.
Specialized versions of Web BLAST facilitate other approaches to similarity searching. Position Specific Iterated PSI-BLAST (15) initially performs a conventional BLAST search to produce alignments from which it constructs a position specific profile. Subsequent BLAST iterations use this profile in place of the query and scoring matrix to find similarities in a database. The newly implemented Pattern Hit Initiated PHI-BLAST (16) takes both a peptide query sequence and a peptide pattern, or motif, found within the query, as input. The motif specifies an obligatory match between query and database sequences about which optimal local alignments are constructed.
The default databases searched by BLAST are the non-redundant (nr) nucleotide and protein subsets of GenBank. However, specialized databases or subsets may also be searched. Standard Web BLAST allows the restriction of searches to the `month' database, comprised of sequences submitted over the last 30 days, dbEST (all of dbEST, human only or mouse only), E.coli or Saccharomyces cerevisiae sequences, as well as several other GenBank subsets. A new specialized BLAST page allows a nucleotide query against any combination of 16 complete and 20 incomplete microbial genomes. Another recent addition, `Blast 2 Sequences', can display the similarity between two DNA or peptide sequences by producing a dot-plot representation of the alignments it reports.
PowerBLAST (17) is a network BLAST client designed for the analysis of large contigs of genomic sequence. PowerBLAST is capable of processing 100 kb of sequence per hour to produce a set of gapped alignments which can be filtered by organism.
The full GenBank release (issued every 2 months) and the daily updates (which also incorporate sequence data from other public databases) are available by anonymous FTP from `ncbi.nlm.nih.gov'. The full release in flat-file format is available as compressed files in the directory, `genbank'. A cumulative update file is contained in the sub-directory, `daily', and a non-cumulative set of updates is contained in `daily-nc'. Software developers creating their own interfaces or analysis tools for GenBank data are offered the NCBI ToolKit to assist in developing specialized applications. The ToolKit software can be found in the directory `toolbox/ncbi_tools'.
Users with access to electronic mail can search GenBankand several other databases by accession number or Booleancombinations of text words. The QUERY server (query@ncbi.nlm.nih.gov) performs text-based searches of the integrated Entrez databases. It allows access not only to sets of sequence or MEDLINE records, but also to the neighbored data. Various output formats, such as FASTA for sequence data, are available. BLAST sequence similarity searches can be performed by Email through the address: blast{at}ncbi.nlm.nih.gov. Documentation can be obtained by sending the word `help' in the body of an Email message to the addresses above. The flat file version of GenBank is no longer available on CD-ROM due to declining demand and the large number of disks needed to contain a single release.
The GenBank Fellowship Program is an NCBI initiative to improve the quality of the database and also to serve as a bioinformatics training program. GenBank Fellows are selected for strong backgrounds in biology and for motivation to apply computational tools to the organization of electronic data in molecular and structural biology, genetics and phylogeny. GenBank Fellows, under the supervision of a mentor from NCBI's Computational Biology Branch, pursue various applied research projects to improve the quality and annotation of GenBank entries, to reduce sequence redundancy, and to establish and maintain links to other databases. Applications are reviewed on a continuing cycle.
11. Kans,J.A. and Ouellette,B.F.F. (1998) In Baxevanis,A. and Ouellette,B.F.F. (eds), Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins. John Wiley and Sons, Inc., New York, NY, pp. 319-353.
12. Collins,F.S., Patrinos,A., Jordan,E., Chakravarti,A., Gesteland,R., Walters,L. and the members of the DOE and NIH planning groups (1998) Science, 282, 682-689. MEDLINE Abstract
P. H. Summanen, P. A. Lawson, and S. M. Finegold Porphyromonas bennonis sp. nov., isolated from human clinical specimens
Int J Syst Evol Microbiol,
July 1, 2009;
59(7):
1727 - 1732.
[Abstract][Full Text][PDF]
S. V. Suetin, V. A. Shcherbakova, N. A. Chuvilskaya, E. M. Rivkina, N. E. Suzina, A. M. Lysenko, and D. A. Gilichinsky Clostridium tagluense sp. nov., a psychrotolerant, anaerobic, spore-forming bacterium from permafrost
Int J Syst Evol Microbiol,
June 1, 2009;
59(6):
1421 - 1426.
[Abstract][Full Text][PDF]
F. Chamkh, C. Sproer, P. C. Lemos, S. Besson, A.-G. El Asli, R. Bennisse, M. Labat, M. Reis, and A.-I. Qatibi Desulfovibrio marrakechensis sp. nov., a 1,4-tyrosol-oxidizing, sulfate-reducing bacterium isolated from olive mill wastewater
Int J Syst Evol Microbiol,
May 1, 2009;
59(5):
936 - 942.
[Abstract][Full Text][PDF]
Z. Ben Ali Gam, R. Oueslati, S. Abdelkafi, L. Casalot, J. L. Tholozan, and M. Labat Desulfovibrio tunisiensis sp. nov., a novel weakly halotolerant, sulfate-reducing bacterium isolated from exhaust water of a Tunisian oil refinery
Int J Syst Evol Microbiol,
May 1, 2009;
59(5):
1059 - 1063.
[Abstract][Full Text][PDF]
O. Haouari, M.-L. Fardeau, J.-L. Cayol, C. Casiot, F. Elbaz-Poulichet, M. Hamdi, M. Joseph, and B. Ollivier Desulfotomaculum hydrothermale sp. nov., a thermophilic sulfate-reducing bacterium isolated from a terrestrial Tunisian hot spring
Int J Syst Evol Microbiol,
November 1, 2008;
58(11):
2529 - 2535.
[Abstract][Full Text][PDF]
T. Barbeyron, S. L'Haridon, G. Michel, and M. Czjzek Mariniflexile fucanivorans sp. nov., a marine member of the Flavobacteriaceae that degrades sulphated fucans from brown algae
Int J Syst Evol Microbiol,
September 1, 2008;
58(9):
2107 - 2113.
[Abstract][Full Text][PDF]
R. A. Artigas, A. Gonzalez, E. Riquelme, C. A. Carvajal, A. Cattani, A. Martinez-Aguayo, A. M. Kalergis, T. Perez-Acle, and C. E. Fardella A Novel Adrenocorticotropin Receptor Mutation Alters Its Structure and Function, Causing Familial Glucocorticoid Deficiency
J. Clin. Endocrinol. Metab.,
August 1, 2008;
93(8):
3097 - 3105.
[Abstract][Full Text][PDF]
T. Barbeyron, F. Carpentier, S. L'Haridon, M. Schuler, G. Michel, and R. Amann Description of Maribacter forsetii sp. nov., a marine Flavobacteriaceae isolated from North Sea water, and emended description of the genus Maribacter
Int J Syst Evol Microbiol,
April 1, 2008;
58(4):
790 - 797.
[Abstract][Full Text][PDF]
I. Muller, C. Beeger, C. Alix-Panabieres, X. Rebillard, K. Pantel, and H. Schwarzenbach Identification of Loss of Heterozygosity on Circulating Free DNA in Peripheral Blood of Prostate Cancer Patients: Potential and Technical Improvements
Clin. Chem.,
April 1, 2008;
54(4):
688 - 696.
[Abstract][Full Text][PDF]
P.-P. Liebgott, M. Joseph, M.-L. Fardeau, J.-L. Cayol, E. Falsen, F. Chamkh, A.-I. Qatibi, and M. Labat Clostridiisalibacter paucivorans gen. nov., sp. nov., a novel moderately halophilic bacterium isolated from olive mill wastewater
Int J Syst Evol Microbiol,
January 1, 2008;
58(1):
61 - 67.
[Abstract][Full Text][PDF]
C. Hall and F. S. Dietrich The Reacquisition of Biotin Prototrophy in Saccharomyces cerevisiae Involved Horizontal Gene Transfer, Gene Duplication and Gene Clustering
Genetics,
December 1, 2007;
177(4):
2293 - 2307.
[Abstract][Full Text][PDF]
Z. Ben Ali Gam, S. Abdelkafi, L. Casalot, J. L. Tholozan, R. Oueslati, and M. Labat Modicisalibacter tunisiensis gen. nov., sp. nov., an aerobic, moderately halophilic bacterium isolated from an oilfield-water injection sample, and emended description of the family Halomonadaceae Franzmann et al. 1989 emend Dobson and Franzmann 1996 emend. Ntougias et al. 2007
Int J Syst Evol Microbiol,
October 1, 2007;
57(10):
2307 - 2313.
[Abstract][Full Text][PDF]
N. Klouche, M.-L. Fardeau, J.-F. Lascourreges, J.-L. Cayol, H. Hacene, P. Thomas, and M. Magot Geosporobacter subterraneus gen. nov., sp. nov., a spore-forming bacterium isolated from a deep subsurface aquifer
Int J Syst Evol Microbiol,
August 1, 2007;
57(8):
1757 - 1761.
[Abstract][Full Text][PDF]
C. Diaz, S. Baena, M.-L. Fardeau, and B. K. C. Patel Aminiphilus circumscriptus gen. nov., sp. nov., an anaerobic amino-acid-degrading bacterium from an upflow anaerobic sludge reactor
Int J Syst Evol Microbiol,
August 1, 2007;
57(8):
1914 - 1918.
[Abstract][Full Text][PDF]
S. Crapart, M.-L. Fardeau, J.-L. Cayol, P. Thomas, C. Sery, B. Ollivier, and Y. Combet-Blanc Exiguobacterium profundum sp. nov., a moderately thermophilic, lactic acid-producing bacterium isolated from a deep-sea hydrothermal vent
Int J Syst Evol Microbiol,
February 1, 2007;
57(2):
287 - 292.
[Abstract][Full Text][PDF]
E. Miranda-Tello, M.-L. Fardeau, C. Joulian, M. Magot, P. Thomas, J.-L. Tholozan, and B. Ollivier Petrotoga halophila sp. nov., a thermophilic, moderately halophilic, fermentative bacterium isolated from an offshore oil well in Congo
Int J Syst Evol Microbiol,
January 1, 2007;
57(1):
40 - 44.
[Abstract][Full Text][PDF]
O. Haouari, M.-L. Fardeau, L. Casalot, J.-L. Tholozan, M. Hamdi, and B. Ollivier Isolation of sulfate-reducing bacteria from Tunisian marine sediments and description of Desulfovibrio bizertensis sp. nov.
Int J Syst Evol Microbiol,
December 1, 2006;
56(12):
2909 - 2913.
[Abstract][Full Text][PDF]
A. Lopez-Cortes, M.-L. Fardeau, G. Fauque, C. Joulian, and B. Ollivier Reclassification of the sulfate- and nitrate-reducing bacterium Desulfovibrio vulgaris subsp. oxamicus as Desulfovibrio oxamicus sp. nov., comb. nov.
Int J Syst Evol Microbiol,
July 1, 2006;
56(Pt 7):
1495 - 1499.
[Abstract][Full Text][PDF]
V. A. Luna, D. S. King, K. K. Peak, F. Reeves, L. Heberlein-Larson, W. Veguilla, L. Heller, K. E. Duncan, A. C. Cannons, P. Amuso, et al. Bacillus anthracis Virulent Plasmid pX02 Genes Found in Large Plasmids of Two Other Bacillus Species.
J. Clin. Microbiol.,
July 1, 2006;
44(7):
2367 - 2377.
[Abstract][Full Text][PDF]
X. Shao, V. Shepelev, and A. Fedorov Bioinformatic analysis of exon repetition, exon scrambling and trans-splicing in humans
Bioinformatics,
March 15, 2006;
22(6):
692 - 698.
[Abstract][Full Text][PDF]
A. Birkland and G. Yona BIOZON: a hub of heterogeneous biological data
Nucleic Acids Res.,
January 1, 2006;
34(suppl_1):
D235 - D242.
[Abstract][Full Text][PDF]
M. E. Garcia-Ojeda, S. Dejbakhsh-Jones, D. Chatterjea-Matthes, A. Mukhopadhyay, A. BitMansour, I. L. Weissman, J. M. Y. Brown, and S. Strober Stepwise Development of Committed Progenitors in the Bone Marrow That Generate Functional T Cells in the Absence of the Thymus
J. Immunol.,
October 1, 2005;
175(7):
4363 - 4373.
[Abstract][Full Text][PDF]
P. H. Summanen, B. Durmaz, M.-L. Vaisanen, C. Liu, D. Molitoris, E. Eerola, I. M. Helander, and S. M. Finegold Porphyromonas somerae sp. nov., a Pathogen Isolated from Humans and Distinct from Porphyromonas levii
J. Clin. Microbiol.,
September 1, 2005;
43(9):
4455 - 4459.
[Abstract][Full Text][PDF]
D. R. Clutterbuck, A. Leroy, M. A. O'Connell, and C. A. M. Semple A bioinformatic screen for novel A-I RNA editing sites reveals recoding editing in BC10
Bioinformatics,
June 1, 2005;
21(11):
2590 - 2595.
[Abstract][Full Text][PDF]
T. Z. Sen, R. L. Jernigan, J. Garnier, and A. Kloczkowski GOR V server for protein secondary structure prediction
Bioinformatics,
June 1, 2005;
21(11):
2787 - 2788.
[Abstract][Full Text][PDF]
K. B. Sheehan, J. M. Henson, and M. J. Ferris Legionella Species Diversity in an Acidic Biofilm Community in Yellowstone National Park
Appl. Envir. Microbiol.,
January 1, 2005;
71(1):
507 - 511.
[Abstract][Full Text][PDF]
M. Bonilla Salinas, M.-L. Fardeau, P. Thomas, J.-L. Cayol, B. K. C. Patel, and B. Ollivier Mahella australiensis gen. nov., sp. nov., a moderately thermophilic anaerobic bacterium isolated from an Australian oil well
Int J Syst Evol Microbiol,
November 1, 2004;
54(6):
2169 - 2173.
[Abstract][Full Text][PDF]
S. Thierry, H. Macarie, T. Iizuka, W. Geissdorfer, E. A. Assih, M. Spanevello, F. Verhe, P. Thomas, R. Fudou, O. Monroy, et al. Pseudoxanthomonas mexicana sp. nov. and Pseudoxanthomonas japonensis sp. nov., isolated from diverse environments, and emended descriptions of the genus Pseudoxanthomonas Finkmann et al. 2000 and of its type species
Int J Syst Evol Microbiol,
November 1, 2004;
54(6):
2245 - 2255.
[Abstract][Full Text][PDF]
M. B. Salinas, M.-L. Fardeau, J.-L. Cayol, L. Casalot, B. K. C. Patel, P. Thomas, J.-L. Garcia, and B. Ollivier Petrobacter succinatimandens gen. nov., sp. nov., a moderately thermophilic, nitrate-reducing bacterium isolated from an Australian oil well
Int J Syst Evol Microbiol,
May 1, 2004;
54(3):
645 - 649.
[Abstract][Full Text][PDF]
M.-L. Fardeau, M. B. Salinas, S. L'Haridon, C. Jeanthon, F. Verhe, J.-L. Cayol, B. K. C. Patel, J.-L. Garcia, and B. Ollivier Isolation from oil reservoirs of novel thermophilic anaerobes phylogenetically related to Thermoanaerobacter subterraneus: reassignment of T. subterraneus, Thermoanaerobacter yonseiensis, Thermoanaerobacter tengcongensis and Carboxydibrachium pacificum to Caldanaerobacter subterraneus gen. nov., sp. nov., comb. nov. as four novel subspecies
Int J Syst Evol Microbiol,
March 1, 2004;
54(2):
467 - 474.
[Abstract][Full Text][PDF]
C. Cravo-Laureau, R. Matheron, J.-L. Cayol, C. Joulian, and A. Hirschler-Rea Desulfatibacillum aliphaticivorans gen. nov., sp. nov., an n-alkane- and n-alkene-degrading, sulfate-reducing bacterium
Int J Syst Evol Microbiol,
January 1, 2004;
54(1):
77 - 83.
[Abstract][Full Text][PDF]
E. Miranda-Tello, M.-L. Fardeau, P. Thomas, F. Ramirez, L. Casalot, J.-L. Cayol, J.-L. Garcia, and B. Ollivier Petrotoga mexicana sp. nov., a novel thermophilic, anaerobic and xylanolytic bacterium isolated from an oil-producing well in the Gulf of Mexico
Int J Syst Evol Microbiol,
January 1, 2004;
54(1):
169 - 174.
[Abstract][Full Text][PDF]
R. I. Mackie, R. I. Aminov, W. Hu, A. V. Klieve, D. Ouwerkerk, M. A. Sundset, and Y. Kamagata Ecology of Uncultivated Oscillospira Species in the Rumen of Cattle, Sheep, and Reindeer as Assessed by Microscopy and Molecular Approaches
Appl. Envir. Microbiol.,
November 1, 2003;
69(11):
6808 - 6815.
[Abstract][Full Text][PDF]
A. Fedorov, S. Roy, L. Fedorova, and W. Gilbert Mystery of Intron Gain
Genome Res.,
October 1, 2003;
13(10):
2236 - 2241.
[Abstract][Full Text][PDF]
P. Nyvall, E. Corre, C. Boisset, T. Barbeyron, S. Rousvoal, D. Scornet, B. Kloareg, and C. Boyen Characterization of Mannuronan C-5-Epimerase Genes from the Brown Alga Laminaria digitata
Plant Physiology,
October 1, 2003;
133(2):
726 - 735.
[Abstract][Full Text][PDF]
A. S. Ouattara, E. A. Assih, S. Thierry, J.-L. Cayol, M. Labat, O. Monroy, and H. Macarie Bosea minatitlanensis sp. nov., a strictly aerobic bacterium isolated from an anaerobic digester
Int J Syst Evol Microbiol,
September 1, 2003;
53(5):
1247 - 1251.
[Abstract][Full Text][PDF]
E. Miranda-Tello, M.-L. Fardeau, J. Sepulveda, L. Fernandez, J.-L. Cayol, P. Thomas, and B. Ollivier Garciella nitratireducens gen. nov., sp. nov., an anaerobic, thermophilic, nitrate- and thiosulfate-reducing bacterium isolated from an oilfield separator in the Gulf of Mexico
Int J Syst Evol Microbiol,
September 1, 2003;
53(5):
1509 - 1514.
[Abstract][Full Text][PDF]
C. Audiffrin, J.-L. Cayol, C. Joulian, L. Casalot, P. Thomas, J.-L. Garcia, and B. Ollivier Desulfonauticus submarinus gen. nov., sp. nov., a novel sulfate-reducing bacterium isolated from a deep-sea hydrothermal vent
Int J Syst Evol Microbiol,
September 1, 2003;
53(5):
1585 - 1590.
[Abstract][Full Text][PDF]
E. M. Panina, A. A. Mironov, and M. S. Gelfand Comparative genomics of bacterial zinc regulons: Enhanced ion transport, pathogenesis, and rearrangement of ribosomal proteins
PNAS,
August 19, 2003;
100(17):
9912 - 9917.
[Abstract][Full Text][PDF]
B. Knudsen, M. M. Miyamoto, P. J. Laipis, and D. N. Silverman Using Evolutionary Rates to Investigate Protein Functional Divergence and Conservation: A Case Study of the Carbonic Anhydrases
Genetics,
August 1, 2003;
164(4):
1261 - 1269.
[Abstract][Full Text][PDF]
N. Brisbarre, M.-L. Fardeau, V. Cueff, J.-L. Cayol, G. Barbier, V. Cilia, G. Ravot, P. Thomas, J.-L. Garcia, and B. Ollivier Clostridium caminithermale sp. nov., a slightly halophilic and moderately thermophilic bacterium isolated from an Atlantic deep-sea hydrothermal chimney
Int J Syst Evol Microbiol,
July 1, 2003;
53(4):
1043 - 1049.
[Abstract][Full Text][PDF]
B. R. Baldwin, C. H. Nakatsu, and L. Nies Detection and Enumeration of Aromatic Oxygenase Genes by Multiplex and Real-Time PCR
Appl. Envir. Microbiol.,
June 1, 2003;
69(6):
3350 - 3358.
[Abstract][Full Text][PDF]
R. L. Winslow and M. S. Boguski Genome Informatics: Current Status and Future Prospects
Circ. Res.,
May 16, 2003;
92(9):
953 - 961.
[Abstract][Full Text][PDF]
D. Karolchik, R. Baertsch, M. Diekhans, T. S. Furey, A. Hinrichs, Y. T. Lu, K. M. Roskin, M. Schwartz, C. W. Sugnet, D. J. Thomas, et al. The UCSC Genome Browser Database
Nucleic Acids Res.,
January 1, 2003;
31(1):
51 - 54.
[Abstract][Full Text][PDF]
D. Alazard, S. Dukan, A. Urios, F. Verhe, N. Bouabida, F. Morel, P. Thomas, J.-L. Garcia, and B. Ollivier Desulfovibrio hydrothermalis sp. nov., a novel sulfate-reducing bacterium isolated from hydrothermal vents
Int J Syst Evol Microbiol,
January 1, 2003;
53(1):
173 - 178.
[Abstract][Full Text][PDF]
W. C. Ray and C. J. Daniels PACRAT: a database and analysis system for archaeal and bacterial intergenic sequence features
Nucleic Acids Res.,
January 1, 2003;
31(1):
109 - 113.
[Abstract][Full Text][PDF]
A. Fedorov, A. F. Merican, and W. Gilbert Large-scale comparison of intron positions among animal, plant, and fungal genes
PNAS,
December 10, 2002;
99(25):
16128 - 16133.
[Abstract][Full Text][PDF]
W. J. Kent, C. W. Sugnet, T. S. Furey, K. M. Roskin, T. H. Pringle, A. M. Zahler, and a. D. Haussler The Human Genome Browser at UCSC
Genome Res.,
June 1, 2002;
12(6):
996 - 1006.
[Abstract][Full Text][PDF]
O. Vilhelmsson and K. J. Miller Synthesis of Pyruvate Dehydrogenase in Staphylococcus aureus Is Stimulated by Osmotic Stress
Appl. Envir. Microbiol.,
May 1, 2002;
68(5):
2353 - 2358.
[Abstract][Full Text][PDF]
R. I. Aminov, J. C. Chee-Sanford, N. Garrigues, B. Teferedegne, I. J. Krapac, B. A. White, and R. I. Mackie Development, Validation, and Application of PCR Primers for Detection of Tetracycline Efflux Genes of Gram-Negative Bacteria
Appl. Envir. Microbiol.,
April 1, 2002;
68(4):
1786 - 1793.
[Abstract][Full Text][PDF]
A. Fedorov, S. Saxonov, and W. Gilbert Regularities of context-dependent codon bias in eukaryotic genes
Nucleic Acids Res.,
March 1, 2002;
30(5):
1192 - 1197.
[Abstract][Full Text][PDF]
H. Hartman and A. Fedorov The origin of the eukaryotic cell: A genomic investigation
PNAS,
January 17, 2002;
(2002)
32658599.
[Abstract][Full Text][PDF]
I. J. McFadyen, T. G. Metzger, M.G. Paterlini, and D. M. Ferguson Exploring the unique pharmacology of a novel opioid receptor, ZFOR1, using molecular modeling and the `message-address' concept
Protein Eng. Des. Sel.,
December 1, 2001;
14(12):
953 - 960.
[Abstract][Full Text][PDF]
K. R. Hristova, C. M. Lutenegger, and K. M. Scow Detection and Quantification of Methyl tert-Butyl Ether-Degrading Strain PM1 by Real-Time TaqMan PCR
Appl. Envir. Microbiol.,
November 1, 2001;
67(11):
5154 - 5160.
[Abstract][Full Text]
T. SEEMUNGAL, R. HARPER-OWEN, A. BHOWMIK, I. MORIC, G. SANDERSON, S. MESSAGE, P. MACCALLUM, T. W. MEADE, D. J. JEFFRIES, S. L. JOHNSTON, et al. Respiratory Viruses, Symptoms, and Inflammatory Markers in Acute Exacerbations and Stable Chronic Obstructive Pulmonary Disease
Am. J. Respir. Crit. Care Med.,
November 1, 2001;
164(9):
1618 - 1623.
[Abstract][Full Text][PDF]
G. Marais, D. Mouchiroud, and L. Duret Does recombination improve selection on codon usage? Lessons from nematode and fly complete genomes
PNAS,
April 18, 2001;
(2001)
91427698.
[Abstract][Full Text]
A. Fedorov, S. Saxonov, L. Fedorova, and I. Daizadeh Comparison of intron-containing and intron-lacking human genes elucidates putative exonic splicing enhancers
Nucleic Acids Res.,
April 1, 2001;
29(7):
1464 - 1469.
[Abstract][Full Text][PDF]
K. Tan, G. Moreno-Hagelsieb, J. Collado-Vides, and G. D. Stormo A Comparative Genomics Approach to Prediction of New Members of Regulons
Genome Res.,
April 1, 2001;
11(4):
566 - 584.
[Abstract][Full Text]
F. Gorin, M. Hogarth, and M. Gertz The Challenges and Rewards of Integrating Diverse Neuroscience Information
Neuroscientist,
February 1, 2001;
7(1):
18 - 27.
[Abstract][PDF]
R. I. Aminov, N. Garrigues-Jeanjean, and R. I. Mackie Molecular Ecology of Tetracycline Resistance: Development and Validation of Primers for Detection of Tetracycline Resistance Genes Encoding Ribosomal Protection Proteins
Appl. Envir. Microbiol.,
January 1, 2001;
67(1):
22 - 32.
[Abstract][Full Text]
M. Burset, I. A. Seledtsov, and V. V. Solovyev SpliceDB: database of canonical and non-canonical mammalian splice sites
Nucleic Acids Res.,
January 1, 2001;
29(1):
255 - 259.
[Abstract][Full Text][PDF]
A. E. Levine and D. L. Steffen OrCGDB: a database of genes involved in oral cancer
Nucleic Acids Res.,
January 1, 2001;
29(1):
300 - 302.
[Abstract][Full Text][PDF]
M. M. Albà, R. Das, C. A. Orengo, and P. Kellam Genomewide Function Conservation and Phylogeny in the Herpesviridae
Genome Res.,
January 1, 2001;
11(1):
43 - 54.
[Abstract][Full Text]
C. J. Jackson, R. C. Barton, S. L. Kelly, and E. G. V. Evans Strain Identification of Trichophyton rubrum by Specific Amplification of Subrepeat Elements in the Ribosomal DNA Nontranscribed Spacer
J. Clin. Microbiol.,
December 1, 2000;
38(12):
4527 - 4534.
[Abstract][Full Text]
J. A. G. Ochoa de Alda and J. Houmard Genomic survey of cAMP and cGMP signalling components in the cyanobacterium Synechocystis PCC 6803
Microbiology,
December 1, 2000;
146(12):
3183 - 3194.
[Abstract][Full Text][PDF]
J. Andrews, G. G. Bouffard, C. Cheadle, J. Lü, K. G. Becker, and B. Oliver Gene Discovery Using Computational and Microarray Analysis of Transcription in the Drosophila melanogaster Testis
Genome Res.,
December 1, 2000;
10(12):
2030 - 2043.
[Abstract][Full Text]
L. Duret, G. Marais, and C. Biémont Transposons but Not Retrotransposons Are Located Preferentially in Regions of High Recombination Rate in Caenorhabditis elegans
Genetics,
December 1, 2000;
156(4):
1661 - 1669.
[Abstract][Full Text]
B. A. Moffatt, L. Wang, M. S. Allen, Y. Y. Stevens, W. Qin, J. Snider, and K. von Schwartzenberg Adenosine Kinase of Arabidopsis. Kinetic Properties and Gene Expression
Plant Physiology,
December 1, 2000;
124(4):
1775 - 1785.
[Abstract][Full Text]
M. Burset, I. A. Seledtsov, and V. V. Solovyev Analysis of canonical and non-canonical splice sites in mammalian genomes
Nucleic Acids Res.,
November 1, 2000;
28(21):
4364 - 4375.
[Abstract][Full Text][PDF]
T. M. LaPara, C. H. Nakatsu, L. Pantea, and J. E. Alleman Phylogenetic Analysis of Bacterial Communities in Mesophilic and Thermophilic Bioreactors Treating Pharmaceutical Wastewater
Appl. Envir. Microbiol.,
September 1, 2000;
66(9):
3951 - 3959.
[Abstract][Full Text]
R. Roepman, N. Bernoud-Hubac, D. E. Schick, A. Maugeri, W. Berger, H.-H. Ropers, F. P.M. Cremers, and P. A. Ferreira The retinitis pigmentosa GTPase regulator (RPGR) interacts with novel transport-like proteins in the outer segments of rod photoreceptors
Hum. Mol. Genet.,
September 1, 2000;
9(14):
2095 - 2105.
[Abstract][Full Text][PDF]
B. L. Humphreys Electronic Health Record Meets Digital Library: A New Environment for Achieving an Old Goal
J. Am. Med. Inform. Assoc.,
September 1, 2000;
7(5):
444 - 452.
[Abstract][Full Text]
I. S. Kohane Bioinformatics and Clinical Informatics: The Imperative to Collaborate
J. Am. Med. Inform. Assoc.,
September 1, 2000;
7(5):
512 - 516.
[Full Text]
R. L. Hallett, J. P. Clewley, F. Bobet, P. J. McKiernan, and C. G. Teo Characterization of a highly divergent TT virus genome
J. Gen. Virol.,
September 1, 2000;
81(9):
2273 - 2279.
[Abstract][Full Text]
H. Kong, L.-F. Lin, N. Porter, S. Stickel, D. Byrd, J. Posfai, and R. J. Roberts Functional analysis of putative restriction-modification system genes in the Helicobacter pylori J99 genome
Nucleic Acids Res.,
September 1, 2000;
28(17):
3216 - 3223.
[Abstract][Full Text][PDF]
W. J. Kent and A. M. Zahler Conservation, Regulation, Synteny, and Introns in a Large-scale C. briggsae-C. elegans Genomic Alignment
Genome Res.,
August 1, 2000;
10(8):
1115 - 1125.
[Abstract][Full Text]
G. Ferbeyre, V. Bourdeau, M. Pageau, P. Miramontes, and R. Cedergren Distribution of Hammerhead and Hammerhead-like RNA Motifs Through the GenBank
Genome Res.,
July 1, 2000;
10(7):
1011 - 1019.
[Abstract][Full Text]
A. J. Cuticchia High performance computing and medical research
Can. Med. Assoc. J.,
April 1, 2000;
162(8):
1148 - 1149.
[Full Text][PDF]
J. Aach, W. Rindone, and G. M. Church Systematic Management and Analysis of Yeast Gene Expression Data
Genome Res.,
April 1, 2000;
10(4):
431 - 445.
[Abstract][Full Text]
S. H. Bhuiyan, K. Gowda, H. Hotokezaka, and C. Zwieb Assembly of archaeal signal recognition particle from recombinant components
Nucleic Acids Res.,
March 15, 2000;
28(6):
1365 - 1373.
[Abstract][Full Text][PDF]
J.E. Bray, A.E. Todd, F.M.G. Pearl, J.M. Thornton, and C.A. Orengo The CATH Dictionary of Homologous Superfamilies (DHS): a consensus approach for identifying distant structural homologues
Protein Eng. Des. Sel.,
March 1, 2000;
13(3):
153 - 165.
[Abstract][Full Text][PDF]
F. Barloy-Hubler, D. Capela, M. J. Barnett, S. Kalman, N. A. Federspiel, S. R. Long, and F. Galibert High-Resolution Physical Map of the Sinorhizobium meliloti 1021 pSyma Megaplasmid
J. Bacteriol.,
February 15, 2000;
182(4):
1185 - 1189.
[Abstract][Full Text]
M. S. Gelfand, E. V. Koonin, and A. A. Mironov Prediction of transcription regulatory sites in Archaea by a comparative genomic approach
Nucleic Acids Res.,
February 1, 2000;
28(3):
695 - 705.
[Abstract][Full Text][PDF]
K. Rantakokko-Jalava, S. Nikkari, J. Jalava, E. Eerola, M. Skurnik, O. Meurman, O. Ruuskanen, A. Alanen, E. Kotilainen, P. Toivanen, et al. Direct Amplification of rRNA Genes in Diagnosis of Bacterial Infections
J. Clin. Microbiol.,
January 1, 2000;
38(1):
32 - 39.
[Abstract][Full Text]
C. Harger, G. Chen, A. Farmer, W. Huang, J. Inman, D. Kiphart, F. Schilkey, M. P. Skupski, and J. Weller The Genome Sequence DataBase
Nucleic Acids Res.,
January 1, 2000;
28(1):
31 - 32.
[Abstract][Full Text][PDF]
G. Perriere, P. Bessieres, and B. Labedan EMGLib: the Enhanced Microbial Genomes Library (update 2000)
Nucleic Acids Res.,
January 1, 2000;
28(1):
68 - 71.
[Abstract][Full Text][PDF]
C. A. Ball, K. Dolinski, S. S. Dwight, M. A. Harris, L. Issel-Tarver, A. Kasarskis, C. R. Scafe, G. Sherlock, G. Binkley, H. Jin, et al. Integrating functional genomic information into the Saccharomyces Genome Database
Nucleic Acids Res.,
January 1, 2000;
28(1):
77 - 80.
[Abstract][Full Text][PDF]
A. Kumar, K.-H. Cheung, P. Ross-Macdonald, P. S. R. Coelho, P. Miller, and M. Snyder TRIPLES: a database of gene function in Saccharomyces cerevisiae
Nucleic Acids Res.,
January 1, 2000;
28(1):
81 - 84.
[Abstract][Full Text][PDF]
W. J. Kent and A. M. Zahler The Intronerator: exploring introns and alternative splicing in Caenorhabditis elegans
Nucleic Acids Res.,
January 1, 2000;
28(1):
91 - 93.
[Abstract][Full Text][PDF]
C. J. Palm, N. A. Federspiel, and R. W. Davis DAtA: Database of Arabidopsis thaliana Annotation
Nucleic Acids Res.,
January 1, 2000;
28(1):
102 - 103.
[Abstract][Full Text][PDF]
J. A. Blake, J. T. Eppig, J. E. Richardson, M. T. Davisson, and the Mouse Genome Database Group The Mouse Genome Database (MGD): expanding genetic and genomic resources for the laboratory mouse
Nucleic Acids Res.,
January 1, 2000;
28(1):
108 - 111.
[Abstract][Full Text][PDF]
M. Ringwald, J. T. Eppig, J. A. Kadin, J. E. Richardson, and the Gene Expression Database Group GXD: a Gene Expression Database for the laboratory mouse: current status and recent enhancements
Nucleic Acids Res.,
January 1, 2000;
28(1):
115 - 119.
[Abstract][Full Text][PDF]
R. Overbeek, N. Larsen, G. D. Pusch, M. D'Souza, E. S. Jr, N. Kyrpides, M. Fonstein, N. Maltsev, and E. Selkov WIT: integrated system for high-throughput genome sequence analysis and metabolic reconstruction
Nucleic Acids Res.,
January 1, 2000;
28(1):
123 - 125.
[Abstract][Full Text][PDF]
D. R. Maglott, K. S. Katz, H. Sicotte, and K. D. Pruitt NCBI's LocusLink and RefSeq
Nucleic Acids Res.,
January 1, 2000;
28(1):
126 - 128.
[Abstract][Full Text][PDF]
T. Kawashima, S. Kawashima, M. Kanehisa, H. Nishida, and K. W. Makabe MAGEST: MAboya Gene Expression patterns and Sequence Tags
Nucleic Acids Res.,
January 1, 2000;
28(1):
133 - 135.
[Abstract][Full Text][PDF]
P. Rodriguez-Tome and P. Lijnzaad RHdb: the Radiation Hybrid database
Nucleic Acids Res.,
January 1, 2000;
28(1):
146 - 147.
[Abstract][Full Text][PDF]
M. Attimonelli, N. Altamura, R. Benne, A. Brennicke, J. M. Cooper, D. D'Elia, A. d. Montalvo, B. d. Pinto, M. De Robertis, P. Golik, et al. MitBASE : a comprehensive and integrated mitochondrial DNA database. The present status
Nucleic Acids Res.,
January 1, 2000;
28(1):
148 - 152.
[Abstract][Full Text][PDF]
C. Lanave, S. Liuni, F. Licciulli, and M. Attimonelli Update of AMmtDB: a database of multi-aligned Metazoa mitochondrial DNA sequences
Nucleic Acids Res.,
January 1, 2000;
28(1):
153 - 154.
[Abstract][Full Text][PDF]
B. L. Maidak, J. R. Cole, T. G. Lilburn, C. T. Parker Jr, P. R. Saxman, J. M. Stredwick, G. M. Garrity, B. Li, G. J. Olsen, S. Pramanik, et al. The RDP (Ribosomal Database Project) continues
Nucleic Acids Res.,
January 1, 2000;
28(1):
173 - 174.
[Abstract][Full Text][PDF]