Nucleic Acids Research, 2004, Vol. 32, Database issue D50
© 2004 Oxford University Press
HERVd: the Human Endogenous RetroViruses Database: update
es1
ek1,2
es*,11 Institute of Molecular Genetics, Academy of Sciences of the Czech Republic, Flemingovo 2, CZ-16637 Prague, Czech Republic and 2 Genetic Information Research Institute, 2081 Landings Drive, Mountain View, CA 94043, USA
*To whom correspondence should be addressed. Tel: +420 2 20183541; Fax: +420 2 24311019; Email: vpaces{at}img.cas.cz
Received September 15, 2003; Accepted September 30, 2003
| ABSTRACT |
|---|
|
|
|---|
An elaboration of HERVd (http://herv.img.cas.cz) is being carried out in two directions. One of them is the integration and better classification of families that diverge considerably from typical retroviral genomes. This leads to a more precise identification of members with individual families. The second improvement is better accessibility of the database and connection with human genome annotation.
| DATABASE DESCRIPTION |
|---|
|
|
|---|
The Human Endogenous RetroViruses Database (HERVd) is designed to identify, store, classify and make accessible retrovirus-like elements that are present in the human genome (1). The source for the database was the output of the human genome project in NCBI (http://www.ncbi.nlm.nih.gov) (2) and GoldenPath (http://genome.ucsc.edu) (3). Copies of known endogenous retroviruses collected in Repbase Update (4) were detected by RepeatMasker (A. F. Smit and P. Green, unpublished) and processed by the defragmentation algorithm developed by us earlier (1). The database can be searched using several criteria such as HERV families, chromosomal locations or DNA similarities. The sequences, short descriptions and graphic outputs of all entries are available.
| RECENT DEVELOPMENTS |
|---|
|
|
|---|
Our effort of the past year has been concentrated in four areas: (i) including nucleotide sequences that diverged from colinearity with the typical retroviral genome [LTR-gag-pol(pro)-env-LTR] and thus considerably increasing the number of HERV families and quantity of data; (ii) better classification of HERV families and thereby increasing the quality of data; (iii) adding of both DNA and protein similarity search and (iv) creating links to other databases thus improving the accessibility of the HERVd and integration with the human genome annotation.
Data expansion and classification
Classification of HERVs is based on consensus sequences in Repbase Update (deposited by V. V. Kapitonov, A. F. Smit and J. Jurka) and published literature as appeared in the original HERVd (1). New consensus sequences improved detection of HERVs in the genome. This was especially important for non-autonomous elements that diverged considerably from the typical retroviral genome. The number of HERV families in the database more than doubled compared with the original version (1). The total number of different families in the database is now 150.
Data accessibility
Another improvement of the database is that HERVs can now be searched by nucleotide sequences for DNA and protein similarity using BLAT (5). In addition, we integrated our database with the human genome annotation. For each element a link to the UCSC Genome Browser (6) is now available.
| ACKNOWLEDGEMENTS |
|---|
This work was supported by the Center for Integrated Genomics and with grant M023 from the Grant Agency of the Czech Republic.
| REFERENCES |
|---|
|
|
|---|
- Pa
es,J., Pavlí
ek,A. and Pa
es,V. (2002) HERVd: database of human endogenous retroviruses. Nucleic Acids Res., 30, 205206.[Abstract/Free Full Text] - Wheeler,D.L., Church,D.M., Federhen,S., Lash,A.E., Madden,T.L., Pontius,J.U., Schuler,G.D., Schriml,L.M., Sequeira,E., Tatusova,T.A. et al. (2003) Database resources of the National Center for Biotechnology. Nucleic Acids Res., 31, 2833.
[Abstract/Free Full Text] - Karolchik,D., Baertsch,R., Diekhans,M., Furey,T.S., Hinrichs,A., Lu,Y.T., Roskin,K.M., Schwartz,M., Sugnet,C.W., Thomas,D.J. et al. (2003) The UCSC Genome Browser Database. Nucleic Acids Res., 31, 5154.
[Abstract/Free Full Text] - Jurka,J. (2000) Repbase update: a database and an electronic journal of repetitive elements. Trends Genet., 16, 418420.[CrossRef][ISI][Medline]
- Kent,W.J. (2002) BLATthe BLAST-like Alignment Tool. Genome Res., 12, 656664.
[Abstract/Free Full Text] - Kent,W.J., Sugnet,C.W., Furey,T.S., Roskin,K.M., Pringle,T.H., Zahler,A.M. and Haussler,D. (2002) The human genome browser at UCSC. Genome Res., 12, 9961006.
[Abstract/Free Full Text]
This article has been cited by other articles:
![]() |
G. O. Sperber, T. Airola, P. Jern, and J. Blomberg Automated recognition of retroviral sequences in genomic data RetroTector(C) Nucleic Acids Res., August 1, 2007; 35(15): 4964 - 4976. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.-C. Andersson, Z. Yun, G. O. Sperber, E. Larsson, and J. Blomberg ERV3 and Related Sequences in Humans: Structure and RNA Expression J. Virol., July 15, 2005; 79(14): 9270 - 9284. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Zika, J. Paces, A. Pavlicek, and V. Paces WAViS server for handling, visualization and presentation of multiple alignments of nucleotide or amino acids sequences Nucleic Acids Res., July 1, 2004; 32(suppl_2): W48 - W49. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||

