Nucleic Acids Research Advance Access published online on October 22, 2009
Nucleic Acids Research, doi:10.1093/nar/gkp847
Database Issue |
DDBJ launches a new archive database with analytical tools for next-generation sequence data
Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, Research Organization for Information and Systems, Yata, Mishima 411-8510, Japan
*To whom correspondence should be addressed. Tel: +81 55 981 6859; Fax: +81 55 981 6889; Email: yanakamu{at}genes.nig.ac.jp
Received September 15, 2009. Accepted September 22, 2009.
The DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) has collected and released 1 701 110 entries/1 116 138 614 bases between July 2008 and June 2009. A few highlighted data releases from DDBJ were the complete genome sequence of an endosymbiont within protist cells in the termite gut and Cap Analysis Gene Expression tags for human and mouse deposited from the Functional Annotation of the Mammalian cDNA consortium. In this period, we started a novel user announcement service using Really Simple Syndication (RSS) to deliver a list of data released from DDBJ on a daily basis. Comprehensive visualization of a DDBJ release data was attempted by using a word cloud program. Moreover, a new archive for sequencing data from next-generation sequencers, the DDBJ Read Archive (DRA), was launched. Concurrently, for read data registered in DRA, a semi-automatic annotation tool called the DDBJ Read Annotation Pipeline was released as a preliminary step. The pipeline consists of two parts: basic analysis for reference genome mapping and de novo assembly and high-level analysis of structural and functional annotations. These new services will aid users research and provide easier access to DDBJ databases.