Nucleic Acids Research, 2005, Vol. 33, Database issue D29-D33
© 2005, the authors
Nucleic Acids Research, Vol. 33, Database issue © Oxford University Press 2005; all rights reserved
The EMBL Nucleotide Sequence Database
EMBL Outstation, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
* To whom correspondence should be addressed. Tel: +44 1223 494453; Fax: +44 1223 494468; Email: ckanz{at}ebi.ac.uk
Received September 14, 2004; Revised October 6, 2004; Accepted October 14, 2004
| ABSTRACT |
|---|
|
|
|---|
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl), maintained at the European Bioinformatics Institute (EBI) near Cambridge, UK, is a comprehensive collection of nucleotide sequences and annotation from available public sources. The database is part of an international collaboration with DDBJ (Japan) and GenBank (USA). Data are exchanged daily between the collaborating institutes to achieve swift synchrony. Webin is the preferred tool for individual submissions of nucleotide sequences, including Third Party Annotation (TPA) and alignments. Automated procedures are provided for submissions from large-scale sequencing projects and data from the European Patent Office. New and updated data records are distributed daily and the whole EMBL Nucleotide Sequence Database is released four times a year. Access to the sequence data is provided via ftp and several WWW interfaces. With the web-based Sequence Retrieval System (SRS) it is also possible to link nucleotide data to other specialist molecular biology databases maintained at the EBI. Other tools are available for sequence similarity searching (e.g. FASTA and BLAST). Changes over the past year include the removal of the sequence length limit, the launch of the EMBLCDSs dataset, extension of the Sequence Version Archive functionality and the revision of quality rules for TPA data.
| INTRODUCTION |
|---|
|
|
|---|
The European Bioinformatics Institute (EBI) is an outstation of the European Molecular Biology Laboratory (EMBL) in Heidelberg, Germany. It is located on the Wellcome Trust Genome Campus near Cambridge, UK.
The mission of the Service Programme at the EBI is the building, maintenance and provision of biological databases and other information services to support data deposition and free access by the scientific community (1).
The EMBL Nucleotide Sequence Database (http://www.ebi.ac.uk/embl/) is Europe's primary nucleotide sequence resource. This database is the European part of an international collaboration with DDBJ (Japan) (2) and GenBank (USA) (3) (INSDC, International Nucleotide Sequence Database Collaboration). Data are exchanged on a daily basis between the collaborating institutes. The data in the EMBL Nucleotide Sequence Database originates from a combination of large-scale genome sequencing projects, direct submissions from individual scientists and the European Patent Office. There is a quarterly release of the whole database and new and updated records are distributed daily.
Over the last year, the size of EMBL Nucleotide Sequence Database has increased from 27.2 million entries in Release 76, September 2003 to 42.3 million entries in Release 80, September 2004, of which 4.4 million entries are WGS (Whole Genome Shotgun) data. There are now over 185 000 organisms represented in the database.
In 2004, the limit on sequence length has been dropped, the EMBLCDSs dataset containing all coding sequences annotated in the EMBL Nucleotide Sequence Database was launched, the data collection rules for Third Party Anotation (TPA) data were revised and the functionality of the Sequence Version Archive was extended further.
Other databases provided by the EBI include the protein resource UniProt (4), InterPro, a database of protein families, domains and functional sites (5), the Macromolecular Structure Database E-MSD (6), the automatic genome annotation database Ensembl (7), Genome Reviews, curated versions of complete Genomes from the EMBL Database, the Enzyme database IntEnz (8) and the database for protein interaction data, IntAct (9).
| SUBMISSIONS TO THE EMBL NUCLEOTIDE SEQUENCE DATABASE |
|---|
|
|
|---|
Why is it essential to submit new sequence?
Printing sequence data as part of a publication is neither sensible nor manageable, hence journals prefer to cite only the accession number assigned by the INSD Collaboration. Most journals have a mandatory submission procedure such that papers will only be accepted if they have an accession number. The nucleotide sequence is considered part of the publication and therefore almost all nucleotide sequences are publicly available. Having your sequence in the database means it is readily available to the scientific user community. A repository of primary nucleotide sequence data that is freely accessible is essential for computational analysis and genome research.
How to submit new sequences to the EMBL Nucleotide Sequence Database?
The primary tool for submission of nucleotide sequence data is Webin. For alignment data, it is Webin-Align. Projects with large-scale submissions can open a project account allowing direct updates.
Information for submitters can be found here: http://www.ebi.ac.uk/embl/Documentation/information_for_submitters.html. For submission guidelines please see http://www.ebi.ac.uk/embl/Submission/.
Webin
Webin is the preferred submission tool for nucleotide sequences and biological information. It should also be used for TPA submissions. Webin allows fast submissions of single, multiple and very large numbers of sequences (bulk submissions) and is available at http://www.ebi.ac.uk/embl/Submission/webin.html.
Genome project submissions
Large-scale sequencing projects can open a project account to deposit and update data directly using email or ftp. Groups producing large volumes of sequence data are advised to contact the database at datasubs{at}ebi.ac.uk. More information is available at http://www.ebi.ac.uk/embl/Submission/genomes.html.
Alignment submissions
Webin-Align (10) is the dedicated submission tool for multiple nucleotide and protein alignments. It accepts all common alignment formats and is available at http://www.ebi.ac.uk/embl/Submission/align_top.html.
WGS submissions
WGS data submission is not a continuous processWGS datasets are normally not updated more often than once every few months. Therefore email or ftp accounts are not opened for the submission of WGS data, but submissions are dealt with on a one-by-one basis. Potential submitters are advised to contact the EMBL database at datasubs{at}ebi.ac.uk.
How to update entries in the EMBL Nucleotide Sequence Database?
The editorial rights to an entry in the EMBL Nucleotide Sequence Database remain with the original submitter(s). The EBI team adds value to entries, e.g. via cross-references, but the data itself is archival and is not updated by the EBI. Submitters are advised to update their own entries via the update form (http://www.ebi.ac.uk/embl/webin/update.html).
| DATA IN THE EMBL NUCLEOTIDE SEQUENCE DATABASE |
|---|
|
|
|---|
Data in the EMBL Nucleotide Sequence Database are grouped into divisions, according to either the methodology used in their generation (e.g. EST and HTG divisions) or taxonomic origin of the sequence source (e.g. HUM and PRO divisions). There are also some specialized entry types.
Whole Genome Shotgun (WGS) data
Methods using WGS data are used to gain a large amount of genome coverage for an organism. The sequences of all contigs originating from one experiment are grouped in a set. WGS entries have the standard EMBL format, with accession numbers clearly distinct from those of non-WGS entries. The accession numbers of all entries in each WGS set share the same prefix.
Third Party Annotation (TPA) data
The Third Party Annotation data set was launched in response to requests from the research community to submit entries that include either re-annotation of existing data, or combinations of novel sequence, existing primary sequence, trace archive and WGS data.
To distinguish TPA entries from primary data, the abbreviation TPA appears at the beginning of each description (DE) line and in the keyword list. The link to the primary data information is given in the linetypes AH and AS that have been created for TPA entries. The following flatfile extract is taken from entry BN000024 [GenBank] :
|
Constructed (CON) entries and expanded CONs
CON entries do not contain a sequence but an assembly of contigs, i.e. the sequence is to be constructed from segments of smaller sequences.
The format of a CON entry is similar to that of a standard entry, with the additional CO linetype to accommodate the assembly information. A CON entry does not have any annotation apart from source features.
The following example of an assembly is taken from entry BX470249 [GenBank] :
- CO join(BX640423
[GenBank]
[GenBank]
.1:1..348251,BX640424.1:51. .349146,BX640425.1:51..348257,
- CO BX640426 [GenBank] [GenBank] .1:51..348866,BX640427.1:51..348997,BX640428.1:51..348525,
- CO BX640429 [GenBank] [GenBank] .1:51..344321,BX640430.1:51..348014,BX640431.1:51..347894,
- CO BX640432 [GenBank] [GenBank] .1:51..346301,BX640433.1:51..349305,BX640434.1:51..344805,
- CO BX640435 [GenBank] [GenBank] .1:51..346259,BX640436.1:51..255260)
- CO BX640426 [GenBank] [GenBank] .1:51..348866,BX640427.1:51..348997,BX640428.1:51..348525,
EMBLCDSs dataset
Following requests from database users, a new subset of EMBL data, the EMBLCDSs database, has been created during the last year. Every CDS (coding sequence) feature annotated in EMBL entries is displayed as a single entry.
More details are provided in the New Developments section below.
| ACCESSING THE EMBL NUCLEOTIDE SEQUENCE DATABASE |
|---|
|
|
|---|
The EMBL Nucleotide Sequence Database is available from the EBI via various WWW interfaces, ftp and email (for more information see http://www.ebi.ac.uk/embl/Access).
Sequence Retrieval System (SRS)
The EMBL Nucleotide Sequence Database can be accessed via the EBI SRS server (11,12) at http://srs.ebi.ac.uk/. In SRS, the data are available in the libraries shown in Table 1.
|
WGS data are not represented in a separate library any more, but is part of EMBL (Release) and EMBL (Updates). WGS entries can be identified via the keyword WGS.
SRS also links to other databases, with cross-references to UniProt and publications available online, for example.
FTP Server
Release data, daily updates and cumulative files of all data types can be freely obtained from the ftp server at ftp://ftp.ebi.ac.uk/pub/databases/embl/. Please see the README file for further information.
To create and maintain a local copy of the cumulative file, the syncron tool (ftp://ftp.ebi.ac.uk/pub/software/unix/listtools/) can be used to download automatically newly available incremental data files from the ftp site and to merge them locally.
Dbfetch
Dbfetch (database fetch) is a tool for simple sequence retrieval via http. It can be used to retrieve up to 50 entries from various databases. Dbfetch can be found at http://www.ebi.ac.uk/cgi-bin/dbfetch.
Wsdbfetch provides programmatic access to the Dbfetch functionality. The service is described using Web Services Description Language (WSDL) and uses the Simple Object Access Protocol (SOAP) to communicate with other systems. For further information on Wsdbfetch please see http://www.ebi.ac.uk/Tools/webservices/WSDbfetch.html.
EMBL Sequence Version Archive
The EMBL Sequence Version Archive (SVA) (13) is a repository of all versions of any entry that have been distributed to the public from the EMBL Nucleotide Sequence Database. An interactive web-based interface to the SVA can be accessed at http://www.ebi.ac.uk/cgi-bin/sva/sva.pl.
Entries from the SVA can also be retrieved using dbfetch.
Completed genome sequences
Direct access to completely sequenced genomic components is available via the EBI Genomes server at http://www.ebi.ac.uk/genomes/. At the time of writing (September 2004) there are 162 completed genomes of bacteria, 19 archaea, 36 eukaryota, 540 organelles, 136 phages, 204 plasmids, 903 viruses and 36 viroids available.
Sequence searching
A comprehensive set of sequence analysis and database search algorithms is available at http://www.ebi.ac.uk/Tools/. The most commonly used algorithms available are FASTA (14) and WU-BLAST (15), permitting comparisons between query sequences and the nucleotide, translated nucleotide and protein databases.
Sequence similarity searches are available interactively over the WWW as well as by email. Instructions for email searches can be obtained by sending a message with the word HELP in its body to gpfasta{at}ebi.ac.uk.
Access via email
Data can also be retrieved by email using netserv (netserv{at}ebi.ac.uk). To get started send an email to netserv{at}ebi.ac.uk with HELP in the message body.
| NEW DEVELOPMENTS |
|---|
|
|
|---|
Sequence length limit
In the past, the sequence length of a database record was limited to 350 000 bp. In June 2004, this restriction was lifted and entries of any length are now permitted in the database. Complete genomic units such as entire chromosomes can now be represented in a single entry. To represent unsequenced gaps, the new gap feature is used. Some genomes that were split in the past in order to comply with the 350 000 bp limit have now been updated into single entries, e.g. AE000516 [GenBank] .
Third Party Annotationsnew rules
Following a decision taken at the 2004 Collaborative Meeting, the INSD Collaboration has increased the stringency for acceptance of data into the TPA dataset. The aim is to ensure that the TPA dataset includes the highest quality sequence and biological annotation.
To achieve this aim, the similarity between the TPA sequence and the contributing primary sequences is checked at the time of submission. We aim to achieve a similarity of at least 90%. In addition, there can be no more than 50 bp of the TPA sequence that does not correspond to primary entry(ies). All TPA records are manually curated and checked prior to public release.
To be released into the public TPA dataset, entries must also meet the following requirements:
- The study must have been published in a peer-reviewed journal.
- The study must be supported by biological experimental evidence.
Further details may be found at: http://www.ebi.ac.uk/embl/Documentation/third_party_annotation_dataset.html. and http://www.ebi.ac.uk/webin/webin_help.html.
EMBL Sequence Version Archiveextended functionality
In February 2004, a new batch retrieval functionality has been added to the SVA. Multiple entries can now be retrieved by supplying a list of accession numbers with either entry version number, sequence version number (user-indicated in the interface) or no version details for the most recent entry.
By the end of 2004, expanded CON entries will be included in the SVA.
A warning has been added to report the suppression date for entries that have been suppressed in the database.
EMBLCDSs dataset
Following requests from database users, a new subset of EMBL data, EMBLCDSs database, has been created during the year. Every CDS (coding sequence) feature annotated in EMBL entries is displayed as a single entry.
Entries are presented in an EMBL-like flatfile format, with addition of new line types (Figure 1).
|
The primary identifier of the entry given in the ID line is the protein_id of the CDS feature, the IV (identifier version) line gives protein_id and version. The accession number and sequence version of the parent EMBL entry can be found in the PA line. The DE line is created automatically and comprises the organism and product names. The taxonomic information is taken from the parent entry. The CDS annotation itself contains all qualifiers that belong to the feature, nucleotide locations being given in relation to the parent entry(ies). The nucleotide sequence of the feature is shown last in the entry.
The EMBLCDSs dataset is available via SRS [library: EMBL (Coding Sequences)] and ftp (ftp://ftp.ebi.ac.uk/pub/databases/embl/cds).
Finishing whole genome shotgun sets
Data from the WGS projects where the sequencing and assembling process is finished are moved into the main section of the database. At the time of writing only 5 out of 120 relatively small projects have been finished (example: Nanoarchaeum equitans Kin4-M, WGS project prefix: AACL, newly created entry in the main section: AE017199
[GenBank]
). In all cases, accession numbers of the WGS entries are added as secondary accession numbers to newly created entries in the main section to help track the data.
XML format
The International Nucleotide Sequence Database Collaboration INSDC has adopted a first draft for a common XML format for nucleotide data. The DTD can be found at http://www.ebi.ac.uk/embl/Documentation/DTD/INSDSeq_v1.dtd.txt.
| CITING THE EMBL NUCLEOTIDE SEQUENCE DATABASE |
|---|
|
|
|---|
The preferred form for citation of the EMBL Nucleotide Sequence Database is: Kanz,C. et al. (2005) The EMBL Nucleotide Sequence Database. Nucleic Acids Res., 33, D29D33.
| CONTACTING THE EMBL DATABASE |
|---|
|
|
|---|
Contact by email: data submissions: datasubs{at}ebi.ac.uk; other enquiries: datalib{at}ebi.ac.uk; data updates/publication notifications: update{at}ebi.ac.uk.
Postal address: EMBL Nucleotide Sequence Database, European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK.
Telephone: data submissions, +44 1223 494499; general, +44 1223 494444.
Fax: general, +44 1223 494468.
| Notes |
|---|
The online version of this article has been published under an open access model. Users are entitled to use, reproduce, disseminate, or display the open access version of this article for non-commercial purposes provided that: the original authorship is properly and fully attributed; the Journal and Oxford University Press are attributed as the original place of publication with the correct citation details given; if an article is subsequently reproduced or disseminated not in its entirety but only in part or as a derivative work this must be clearly indicated. For commercial re-use permissions, please contact journals.permissions{at}oupjournals.org.
| REFERENCES |
|---|
|
|
|---|
- Brooksbank,C., Camon,E., Harris,M.A., Magrane,M., Martin,M., Mulder,N., O'Donovan,C., Parkinson,H., Tuli,M., Apweiler,R. et al. ( (2003) ) The European Bioinformatics Institute's data resources. Nucleic Acids Res., , 31, , 4350.
[Abstract/Free Full Text] . - Miyazaki,S., Sugawara,H., Ikeo,K., Gojobori,T. and Tateno,Y. ( (2004) ) DDBJ in the stream of various biological data. Nucleic Acids Res., , 32, , D31D34.
[Abstract/Free Full Text] . - Benson,D.A., Karsch-Mizrachi,I., Lipman,D.J., Ostell,J. and Wheeler,D.L. ( (2004) ) GenBank: update. Nucleic Acids Res., , 32, , D23D26.
[Abstract/Free Full Text] . - Bairoch,A., Apweiler,R., Wu,C.H., Barker,W.C., Boeckmann,B., Ferro,S., Gasteiger,E., Huang,H., Lopez,R., Magrane,M. et al. ( (2005) ) The Universal Protein Resource (UniProt). Nucleic Acids Res., , 33, , D154D159.
[Abstract/Free Full Text] . - Mulder,N.J., Apweiler,R., Attwood,T.K., Bairoch,A., Barrell,D., Bateman,A., Binns,D., Biswas,M., Bradley,P., Bork,P. et al. ( (2003) ) The InterPro Database, 2003 brings increased coverage and new features. Nucleic Acids Res., , 31, , 315318.
[Abstract/Free Full Text] . - Golovin,A., Oldfield,T.J., Tate,J.G., Velankar,S., Barton,G.J., Boutselakis,H., Dimitropoulos,D., Fillon,J., Hussain,A., Henrick,K. et al. ( (2004) ) E-MSD: an integrated data resource for bioinformatics. Nucleic Acids Res., , 32, , D211D216.
[Abstract/Free Full Text] . - Clamp,M., Andrews,D., Barker,D., Bevan,P., Cameron,G., Chen,Y., Clark,L., Cox,T., Cuff,J., Curwen,V. et al. ( (2003) ) Ensembl 2002: accommodating comparative genomics. Nucleic Acids Res., , 31, , 3842.
[Abstract/Free Full Text] . - Fleischmann,A., Darsow,M., Degtyarenko,K., Fleischmann,W., Boyce,S., Axelsen,K.B., Bairoch,A., Schomburg,D., Tipton,K.F. and Apweiler,R. ( (2004) ) IntEnz, the integrated relational enzyme database. Nucleic Acids Res., , 32, , D434D437.
[Abstract/Free Full Text] . - Hermjakob,H., Montecchi-Palazzi,L., Lewington,C., Mudai,S., Kerrien,S., Orchard,S., Vingron,M., Roechert,B., Roepstorff,P. and Apweiler,R. ( (2004) ) IntAct: an open source molecular interaction database. Nucleic Acids Res., , 32, , D452D455.
[Abstract/Free Full Text] . - Lombard,V., Camon,E.B., Parkinson,H.E., Hingamp,P., Stoesser,G. and Redaschi,N. ( (2002) ) EMBL-Align: a new public nucleotide and amino acid multiple sequence alignment database. Bioinformatics, , 18, , 763764.
[Abstract/Free Full Text] . - Zdobnov,E.M., Lopez,R., Apweiler,R. and Etzold,T. ( (2002) ) The EBI SRS servernew features. Bioinformatics, , 18, , 11491150.
[Abstract/Free Full Text] . - Zdobnov,E.M., Lopez,R., Apweiler,R. and Etzold,T. ( (2002) ) The EBI SRS serverrecent developments. Bioinformatics, , 18, , 368373.
[Abstract/Free Full Text] . - Leinonen,R., Nardone,F., Oyewole,O., Redaschi,N. and Stoehr,P. ( (2003) ) The EMBL sequence version archive. Bioinformatics, , 19, , 18611862.
[Abstract/Free Full Text] . - Pearson,W.R. ( (1994) ) Using the FASTA program to search protein and DNA sequence databases. Methods Mol. Biol., , 24, , 307331.[Medline] .
- Lopez,R., Silventoinen,V., Robinson,S., Kibria,A. and Gish,W. ( (2003) ) WU-Blast2 server at the European Bioinformatics Institute. Nucleic Acids Res., , 31, , 37953798.
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
C. M. Song, S. J. Lim, and J. C. Tong Recent advances in computer-aided drug design Brief Bioinform, September 1, 2009; 10(5): 579 - 591. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wurdemann, B. J. Tindall, R. Pukall, H. Lunsdorf, C. Strompl, T. Namuth, H. Nahrstedt, M. Wos-Oxley, S. Ott, S. Schreiber, et al. Gordonibacter pamelaeae gen. nov., sp. nov., a new member of the Coriobacteriaceae isolated from a patient with Crohn's disease, and reclassification of Eggerthella hongkongensis Lau et al. 2006 as Paraeggerthella hongkongensis gen. nov., comb. nov. Int J Syst Evol Microbiol, June 1, 2009; 59(6): 1405 - 1415. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. P. Davis, C. G. Murphy, C. A. Saraceni-Richards, M. C. Rosenstein, T. C. Wiegers, and C. J. Mattingly Comparative Toxicogenomics Database: a knowledgebase and discovery tool for chemical-gene-disease networks Nucleic Acids Res., January 1, 2009; 37(suppl_1): D786 - D792. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Sherman, T. Martin, M. Nikolski, C. Cayla, J.-L. Souciet, P. Durrens, and for the Genolevures Consortium Genolevures: protein families and synteny among complete hemiascomycetous yeast proteomes and genomes Nucleic Acids Res., January 1, 2009; 37(suppl_1): D550 - D554. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Robinson, M. J. Waller, S. C. Fail, H. McWilliam, R. Lopez, P. Parham, and S. G. E. Marsh The IMGT/HLA database Nucleic Acids Res., January 1, 2009; 37(suppl_1): D1013 - D1017. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pichlmaier, V. Marwitz, C. Kuhn, M. Niehaus, G. Klein, C. Bara, A. Haverich, and W.-R. Abraham High prevalence of asymptomatic bacterial colonization of rhythm management devices Europace, September 1, 2008; 10(9): 1067 - 1072. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-R. Abraham, A. J. Macedo, H. Lunsdorf, R. Fischer, S. Pawelczyk, J. Smit, and M. Vancanneyt Phylogeny by a polyphasic approach of the order Caulobacterales, proposal of Caulobacter mirabilis sp. nov., Phenylobacterium haematophilum sp. nov. and Phenylobacterium conjunctum sp. nov., and emendation of the genus Phenylobacterium Int J Syst Evol Microbiol, August 1, 2008; 58(8): 1939 - 1949. [Abstract] [Full Text] [PDF] |
||||
![]() |
R.-M. Wittich, H.-J. Busse, P. Kampfer, A. J. Macedo, M. Tiirola, M. Wieser, and W.-R. Abraham Sphingomonas fennica sp. nov. and Sphingomonas haloaromaticamans sp. nov., outliers of the genus Sphingomonas Int J Syst Evol Microbiol, August 1, 2007; 57(8): 1740 - 1746. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Choi, A. Hobolth, D. M. Robinson, H. Kishino, and J. L. Thorne Quantifying the Impact of Protein Tertiary Structure on Molecular Evolution Mol. Biol. Evol., August 1, 2007; 24(8): 1769 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
R.-M. Wittich, H.-J. Busse, P. Kampfer, M. Tiirola, M. Wieser, A. J. Macedo, and W.-R. Abraham Sphingobium aromaticiconvertens sp. nov., a xenobiotic-compound-degrading bacterium from polluted river sediment Int J Syst Evol Microbiol, February 1, 2007; 57(2): 306 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Ashelford, N. A. Chuzhanova, J. C. Fry, A. J. Jones, and A. J. Weightman New Screening Software Shows that Most Recent Large 16S rRNA Gene Clone Libraries Contain Chimeras Appl. Envir. Microbiol., September 1, 2006; 72(9): 5734 - 5741. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Wilson, S. D. Gardner, N. M. Lambie, S. A. Commans, and D. J. Crowther Characterization of the human patatin-like phospholipase family J. Lipid Res., September 1, 2006; 47(9): 1940 - 1949. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Grassot, M. Gouy, G. Perriere, and G. Mouchiroud Origin and Molecular Evolution of Receptor Tyrosine Kinases with Immunoglobulin-Like Domains Mol. Biol. Evol., June 1, 2006; 23(6): 1232 - 1241. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Benson, I. Karsch-Mizrachi, D. J. Lipman, J. Ostell, and D. L. Wheeler GenBank Nucleic Acids Res., January 1, 2006; 34(suppl_1): D16 - D20. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Stamm, J.-J. Riethoven, V. Le Texier, C. Gopalakrishnan, V. Kumanduri, Y. Tang, N. L. Barbosa-Morais, and T. A. Thanaraj ASD: a bioinformatics resource on alternative splicing Nucleic Acids Res., January 1, 2006; 34(suppl_1): D46 - D55. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. H. Wu, R. Apweiler, A. Bairoch, D. A. Natale, W. C. Barker, B. Boeckmann, S. Ferro, E. Gasteiger, H. Huang, R. Lopez, et al. The Universal Protein Resource (UniProt): an expanding universe of protein information Nucleic Acids Res., January 1, 2006; 34(suppl_1): D187 - D191. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Whelan, P. I. W. de Bakker, E. Quevillon, N. Rodriguez, and N. Goldman PANDIT: an evolution-centric database of protein and associated nucleotide domains with inferred trees Nucleic Acids Res., January 1, 2006; 34(suppl_1): D327 - D331. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. M. Markowitz, F. Korzeniewski, K. Palaniappan, E. Szeto, G. Werner, A. Padki, X. Zhao, I. Dubchak, P. Hugenholtz, I. Anderson, et al. The integrated microbial genomes (IMG) system Nucleic Acids Res., January 1, 2006; 34(suppl_1): D344 - D348. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Paulsen and A. von Haeseler INVHOGEN: a database of homologous invertebrate genes Nucleic Acids Res., January 1, 2006; 34(suppl_1): D349 - D353. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Lombardot, R. Kottmann, H. Pfeffer, M. Richter, H. Teeling, C. Quast, and F. O. Glockner Megx.net--database resources for marine ecological genomics Nucleic Acids Res., January 1, 2006; 34(suppl_1): D390 - D393. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sherman, P. Durrens, F. Iragne, E. Beyne, M. Nikolski, and J.-L. Souciet Genolevures complete genomes provide data and tools for comparative genomics of hemiascomycetous yeasts Nucleic Acids Res., January 1, 2006; 34(suppl_1): D432 - D435. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Krull, S. Pistor, N. Voss, A. Kel, I. Reuter, D. Kronenberg, H. Michael, K. Schwarzer, A. Potapov, C. Choi, et al. TRANSPATH(R): an information resource for storing and visualizing signaling pathways and their pathological aberrations Nucleic Acids Res., January 1, 2006; 34(suppl_1): D546 - D551. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Giudicelli, P. Duroux, C. Ginestoux, G. Folch, J. Jabado-Michaloud, D. Chaume, and M.-P. Lefranc IMGT/LIGM-DB, the IMGT(R) comprehensive database of immunoglobulin and T cell receptor nucleotide sequences Nucleic Acids Res., January 1, 2006; 34(suppl_1): D781 - D784. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. E. Ashelford, N. A. Chuzhanova, J. C. Fry, A. J. Jones, and A. J. Weightman At Least 1 in 20 16S rRNA Sequence Records Currently Held in Public Repositories Is Estimated To Contain Substantial Anomalies Appl. Envir. Microbiol., December 1, 2005; 71(12): 7724 - 7736. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Fadiel, I. Anidi, and K. D. Eichenbaum Farm animal genomics and informatics: an update Nucleic Acids Res., November 7, 2005; 33(19): 6308 - 6318. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Goldovsky, P. Janssen, D. Ahren, B. Audit, I. Cases, N. Darzentas, A. J. Enright, N. Lopez-Bigas, J. M. Peregrin-Alvarez, M. Smith, et al. CoGenT++: an extensive and extensible data environment for computational genomics Bioinformatics, October 1, 2005; 21(19): 3806 - 3810. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Abbott, D. M. Aanensen, K. Rutherford, S. Butcher, and B. G. Spratt WebACT--an online companion for the Artemis Comparison Tool Bioinformatics, September 15, 2005; 21(18): 3665 - 3666. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. George, R. V. Spriggs, G. J. Bartlett, A. Gutteridge, M. W. MacArthur, C. T. Porter, B. Al-Lazikani, J. M. Thornton, and M. B. Swindells Inaugural Article: Effective function annotation through catalytic residue conservation PNAS, August 30, 2005; 102(35): 12299 - 12304. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Pillai, V. Silventoinen, K. Kallio, M. Senger, S. Sobhany, J. Tate, S. Velankar, A. Golovin, K. Henrick, P. Rice, et al. SOAP-based services provided by the European Bioinformatics Institute Nucleic Acids Res., July 1, 2005; 33(suppl_2): W25 - W28. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Wu, M. G. Walker, J. Luo, and L. Wei GBA server: EST-based digital gene expression profiling Nucleic Acids Res., July 1, 2005; 33(suppl_2): W673 - W676. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Masseroli, O. Galati, and F. Pinciroli GFINDer: genetic disease and phenotype location statistical analysis and mining of dynamically annotated gene lists Nucleic Acids Res., July 1, 2005; 33(suppl_2): W717 - W723. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Hekkelman and G. Vriend MRS: a fast and compact retrieval system for biological data Nucleic Acids Res., July 1, 2005; 33(suppl_2): W766 - W769. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Brooksbank, G. Cameron, and J. Thornton The European Bioinformatics Institute's data resources: towards systems biology Nucleic Acids Res., January 1, 2005; 33(suppl_1): D46 - D53. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Robinson, M. J. Waller, P. Stoehr, and S. G. E. Marsh IPD--the Immuno Polymorphism Database Nucleic Acids Res., January 1, 2005; 33(suppl_1): D523 - D526. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









