Skip Navigation



Nucleic Acids Research Advance Access published online on September 18, 2009

Nucleic Acids Research, doi:10.1093/nar/gkp746
This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2794K) Freely available
Right arrow Screen PDF (416K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Yung, P. Y.
Right arrow Articles by Thomas, T.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yung, P. Y.
Right arrow Articles by Thomas, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author(s) 2009. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.


Methods Online

Phylogenetic screening of a bacterial, metagenomic library using homing endonuclease restriction and marker insertion

Pui Yi Yung1, Catherine Burke1, Matt Lewis2, Suhelen Egan1, Staffan Kjelleberg1 and Torsten Thomas1,*

1Centre for Marine Bio-Innovation and School of Biotechnology and Biomolecular Sciences, University of New South Wales, 2035 NSW, Australia and 2J. Craig Venter Institute, 9704 Medical Center Drive, Rockville, MD 20850, USA

*To whom correspondence should be addressed. Tel: +61 2 93853467; Fax: +61 2 93851779; Email: t.thomas{at}unsw.edu.au

Received June 1, 2009. Revised July 8, 2009. Accepted August 24, 2009.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
Metagenomics provides access to the uncultured majority of the microbial world. The approaches employed in this field have, however, had limited success in linking functional genes to the taxonomic or phylogenetic origin of the organism they belong to. Here we present an efficient strategy to recover environmental DNA fragments that contain phylogenetic marker genes from metagenomic libraries. Our method involves the cleavage of 23S ribsosmal RNA (rRNA) genes within pooled library clones by the homing endonuclease I-CeuI followed by the insertion and selection of an antibiotic resistance cassette. This approach was applied to screen a library of 6500 fosmid clones derived from the microbial community associated with the sponge Cymbastela concentrica. Several fosmid clones were recovered after the screen and detailed phylogenetic and taxonomic assignment based on the rRNA gene showed that they belong to previously unknown organisms. In addition, compositional features of these fosmid clones were used to classify and taxonomically assign a dataset of environmental shotgun sequences. Our approach represents a valuable tool for the analysis of rapidly increasing, environmental DNA sequencing information.


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
Metagenomics is defined as the study of the total genomic composition of the biota from a particular environment. In recent years metagenomics has revealed many novel phylogenetic and functional properties of microbial systems (1–4) and has provided insight into the microbial communities of seawater (5–7), soil (8,9), biogas plants (10), indoor air handling unit (11) and in animal- or human-associated niches (12–15). One major aim of metagenomics is to define the functional properties of microorganisms, which are yet to be cultured or difficult to isolate, by directly analyzing their genomes from the environmental sample.

Random shotgun sequencing of environmental DNA has established itself as a core methodology in metagenomics and recent advances in high-throughput sequencing technology have made this approach economically feasible and hence widely used (16). Typically, shotgun sequencing data are functionally annotated on the level of individual sequence reads or assembled contigs. The individual read annotation and poor success in assembling complex (i.e. species-rich) samples (17) often results in functional genes being physically detached from genes containing taxonomic information (such as the 16S rRNA gene). This has limited the progress in revealing the phylotype–function relationships for many uncultivated organisms and several computational and laboratory-based approaches have been developed to overcome this situation (18). Computational methods mainly use binning of DNA fragments based on conserved compositional features (such as oligonucletotide or k-mer frequencies) followed by taxonomic classification of bins. Algorithms like PhyloPythia (19), TETRA (20) or TACOA (21) have been successfully employed; however, their performance is heavily dependent on the availability of reference (or training) sequences that are long enough to provide sufficient compositional information to define a classifier for shorter, unknown fragments (typically a few thousand base pairs in length). These reference fragments should ideally also contain well-established and universal markers for detailed phylogenetic analysis and taxonomic assignment. Fosmids and BAC clones containing phylogenetic markers such as the 16S rRNA gene have often been quoted as fitting these requirements for suitable reference sequences (19,20).

Laboratory-based molecular screens for phylogenetic markers from fosmids or BAC clones have been developed and involve PCR- or fluorescent in situ hybridization-based screening of individually arrayed clones or small pools of clones (around 96 individual clones) (22–25). PCR-based screening of metagenomic libraries clones is frequently employed to identify phylogenetic marker genes (26–30). Potential cross-hybridization of the PCR primers or probes with the host background of the library (e.g. Escherichia coli) limits these approaches to either very specific organisms (e.g. targeting defined species) or to groups distantly related to the library host (e.g. screen for archaeal markers in an E. coli host library). Furthermore, these methods are rather labor intensive, potentially quite costly and often reach their practical limits with increasing clone numbers in the library.

Here we present an efficient phylogenetic screening strategy for metagenomic libraries using homing endonuclease restriction and marker insertion (HERMI) (Figure 1). This approach allows rapid selection of clones containing rRNA genes from large pools by employing a restriction digest with the homing endonuclease I-CeuI. The recognition site of I-CeuI is 26-bp long and cuts very specifically in a conserved part of the intron-free 23S rRNA gene to produce a 4-nucleotide (CTAA) 3' overhang (31). The slow evolutionary rate of the recognition site as well as the tolerance of the enzyme for minor sequence changes means that rRNA genes from a wide range of organisms can usually be cut (32). Hence, applying a I-CeuI digest to pooled metagenomic clones should only linearize clones containing 23S rRNA genes. The linearized clones are subsequently isolated by inserting a selectable marker gene (i.e. antibiotic cassette), followed by retransformation and selection from the library pool (Figure 1).


Figure 1
View larger version (19K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 1. Schematic diagram showing the steps involved in the HERMI process.

 

    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
Bacterial strains and media
Escherichia coli EPI300 (Epicentre, Madison, WI, USA) and DH5{alpha} strains were used throughout this study. Cultures were maintained in Luria Bertani medium plus 10% NaCl (LB10) or on LB10 with 1.5% agar solid media. For clones containing only the pCC1FOS vector backbone the media was supplemented with 12.5 µg/ml chloramphenicol. For pGEM:CeuKan clones 50 µg/ml kanamycin and 100 µg/ml ampicillin and for pCC1FOS:HERMI clones 12.5 µg/ml chloramphenicol and 50 µg/ml kanamycin were used.

Construction of pooled fosmid library
Total DNA was extracted from the microbial community associated with the marine sponge Cymbastela concentrica (Thomas et al., unpublished data). A metagenomic clone library was constructed using the CopyControlTM Fosmid Library Production Kit (Epicentre, Madison, WI, USA) according to the manufacturer’s protocol with minor modifications. Briefly, environmental DNA was size fractionated and DNA with molecular weight of 36-40 kb was gel excised followed by an end-repair reaction. The DNA (~1 µg) was ligated with the fosmid vector pCC1FOS followed by packaging the DNA into lambda phage particles, which were used to infect E. coli EPI300 cells. Transformants were plated onto LB10 agar with chloramphenicol and grown at 37°C for 16 h. The resulting library contains approximately 6500 clones with average insert size of 36 kb (data not shown). The E. coli library clones were pooled followed by the addition of 5 volumes of fresh LB10 broth supplemented with chloramphenicol. Induction of the fosmids to high copy number was done by addition of 0.01% (w/v) arabinose and incubating the cultures at 37°C for 4 h. After induction the cells were collected and the fosmid DNA was extracted using the Illustra plasmidPrep Mini Spin Kit (GE Bio-Science Corp, NJ, USA) according to manufacturer’s instructions.

Construction of a kanamycin cassette with flanking I-CeuI recognition sequences
The kanamycin cassette with flanking I-CeuI recognition sequences was generated via a three-step PCR as direct PCR with primers containing the 26-bp I-CeuI recognition sequence caused severe primer–dimer formation. The first round of PCR includes the amplification of the kanamycin resistance cassette from plasmid pACYC177 (NEB, Beverly, MA, USA) with flanking partial I-CeuI recognition sequences. The PCR conditions as per 50 µl reaction were 10 ng of pACYC177 plasmid, 1 x iProof HF Buffer, 200 µM of each dNTP, 0.5 µM of each primer (KanFCeu/KanRCeu) (Table 1) and 0.05 U/µl iProof DNA polymerase (Bio-Rad, Hercules, CA, USA), 1 min/98°C, followed by 10s/98°C, 30 s/53°C, 1 min/72°C, 30 cycles. The second round of PCRs was performed in two separate reactions to add the entire I-CeuI restriction sequence on each of the ends of the cassette; one reaction used the primer set CeuF/KanRCeu and the other used primer set CeuR/KanFCeu (Table 1). PCR conditions were: 3 ng of DNA template from the first PCR, remaining reaction mix composition as for the first-round PCR, 1 min/98°C, followed by 10 s/98°C, 30 s/45°C, 1 min 30 s/72°C, 30 cycles. The two PCR products were then pooled (final volume of 100 µl) and the 1.1 kb product was gel extracted to purify the kanamycin cassette. The third round of PCR involved hybridizations and fill-in reaction steps to generate a PCR product with the full I-CeuI recognition site on both ends of the kanamycin cassette. The conditions were: ~50 µl of gel-extracted DNA from the second round of PCR, 1 x RedTaq PCR buffer, 10 µM of each dNTP and 1 U of RedTaq DNA polymerase (Promega, Madison, WI, USA), 2 min/94°C, 20 s/94°C, 30 s/45°C, 20 s/72°C, 20 cycles. The final PCR product was cloned into pGEM vector (pGEM®-T Easy Vector System, Promega, Madison, WI, USA) according to manufacturer’s instructions followed by the transformation into E. coli DH5{alpha}. Transformants (named pGEM:CeuKan) with the correct insert (after confirmation via restriction digest with I-CeuI) were selected for miniprep purification. The kanamycin cassette with flanking I-CeuI recognition sequences (CeuKan) was gel purified after digestion of pGEM:CeuKan.


View this table:
[in this window]
[in a new window]

 
Table 1. Primers used in this study

 
Homing endonuclease restriction and marker insertion
One microgram of pooled fosmid DNA was digested with the I-CeuI homing endonuclease in a reaction containing two units of I-CeuI enzyme (NEB), 1 x NEB buffer 4, 0.5 µl of BSA (NEB) and sterile, deionized water up to a volume of 50 µl. The digest was incubated at 37°C for 3 h followed by a heat inactivation at 65°C for 20 min. The digested fosmids were dephosphorylated using two units of Antarctic alkaline phosphatase (NEB) according to manufacturer’s instructions. The fosmids were ligated with the CeuKan cassette in 1:30 (vector:insert) molar ratio overnight at room temperature. The ligation mix was transformed into E. coli EPI300 cells via electroporation and cells were recovered at 37°C for 1 h and then plated onto LB10 agar supplemented with appropriate antibiotics to select for pCC1FOS:HERMI clones. Transformants that grew on the selective agar were purified, fosmid DNA was extracted and subjected to denaturing gradient gel electrophoresis (DGGE) analysis, end sequencing and 16S/23S rRNA gene PCR as described below.

Denaturing gradient gel electrophoresis
The 16S rRNA gene was PCR amplified as described by Muyzer et al. (33) with the universal primer GM5FplusGC and 907RC (Table 1). Various DNA samples were used: (i) Pooled fosmid library DNA, (ii) E. coli genomic DNA and (iii) fosmid DNA of individual HERMI clones. PCR conditions were 10 ng of DNA template, 1 x RedTaq buffer, 0.5 µM of each forward and reverse primers, 200 µM of each dNTP, 300 µg of BSA, 1 U of RedTaq DNA polymerase (Promega, Madison, WI, USA), 3 min/96°C, hot start at 80°C, 30 s/94°C, 30 s/57°C, 1 min 10 s/72°C, 25 cycles. The PCR products were cleaned using the QIA quick PCR purification kit (QIAGEN, Hilden, Germany) and the DNA was examined with a DCode DGGE unit (BIO-RAD, Hercules, CA, USA) using the following parameters: 10% acrylamide gel, a denaturant gradient containing 45–60% urea-formamide, 1x TAE buffer, 75 V at 60°C for 16 h. Bands from the DGGE gel were extracted, dialysed overnight at 4°C with 50 µl of molecular grade water and re-amplified using primers GM5F and 907RC for sequencing.

Sequencing and phylogenetic analysis of fosmid clones
End sequencing of the HERMI clones were performed using the primer pair pEpiFosFor and pEpiFosRev (Table 1). PCR amplification and sequencing using universal primers for 23S and 16S rRNA gene (Table 1) were also performed on selected HERMI clones as described previously (34,35). Briefly, PCR conditions were 3 min/94°C, 1 min/94°C, 1 min/57°C, 3 min/72°C, 30 cycles (for 23S PCR) and 3 min/94°C, 80°C hot start, followed by 30 s/94°C, 1 min/50°C, 3 min/72°C, 25 cycles (for 16S PCR). The PCR products were subjected to sequencing using the same primers. Other sequencing reactions were also performed using the KanFSeq and KanRSeq primers (Table 1) to obtain 23S rRNA gene sequence flanking the kanamycin cassette.

The complete sequence of PCR products were obtained and searched with the BLAST algorithm (36) against the NCBI and Silva database (37) and closest representatives were selected, aligned using the Aligner tool provided in Silva, and imported into the Silva 16S rRNA and 23S rRNA database using the ARB program for phylogenetic tree construction (38). Maximum likelihood trees were constructed with default parameters.

Whole fosmid sequencing and analysis
HERMI fosmid clones were shotgun sequenced as outlined in Rusch et al. (5). Shotgun reads were processed and assembled using Phred/Phrap and the assembly was manually checked in Consed (39). Binning of the fosmid sequences and an assembled shotgun-reads dataset from the bacterial community of C. concentrica (Thomas et al., unpublished data) was done according to the strategy outlined by Woyke et al. (40). Briefly, tetranucleotide patterns were determined using TETRA (20) and exported as normalized Z-scores for fosmids and scaffolds of the shotgun datasets longer than 20 kb. Clustering was performed with Euclidian distance and complete linkage using the software Cluster 3.0 and visualized with JavaTreeView (41).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
Specificity of the HERMI screen
Applying the HERMI method in a single-tube reaction to a pooled library of approximately 6500 fosmid clones derived from the microbial community associated with the sponge C. concentrica resulted in 52 kanamycin-resistant transformants. End sequencing of these clones rejected 40 as being sister clones leaving 12 for further analysis. PCR amplification and sequencing of the region flanking the CeuKan cassette showed that none of the clones contains more than one CeuKan cassette. In addition, all insertion sites have similarity to the 23S rRNA gene indicating that no unspecific insertion had taken place. The I-CeuI recognition sequences for four of the HERMI clones that contained unique phylotype (HERMI06, 11, 16 and 30) were compared with the I-CeuI recognition sequence described in (32). This demonstrated that HERMI06, 11 and 30 have perfect matches to the recognition sequence (Figure 2), while HERMI16 had base pair insertions between positions 4–5, 5–6, 6–7 and 7–8 (respective to the original recognition sequence). This indicates that base pair insertions in these positions of the recognition sequence do not abolish I-CeuI cleavage activity.


Figure 2
View larger version (34K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 2. Alignment (5' to 3') of the I-CeuI recognition sites with insertion sites of HERMI clones and different sequences from the Silva LSU 23S rRNA database. Black triangle represents the cleavage site for I-CeuI and where the CeuKan marker was inserted for the HERMI clones. Letters underlined in the recognition sequence of HERMI16 indicate basepair insertions into the recognition sequence while letters underlined for other sequences highlight mismatches in positions that have previously been shown to negatively impact I-CeuI enzymatic activity.

 
To evaluate the phylogenetic spectrum of sequences that can be cleaved by I-CeuI, we compared the 26-bp recognition sequence to the entire bacterial 23S rRNA Silva database (37) with various thresholds (no mismatch, 1 mismatch and 2 or more mismatches). Partial sequences (those did not cover the entire region of the I-CeuI recognition sequence), mitochondrial and chloroplast DNA, and sequences belonging to the actinobacteria phyla [previously shown not to be cleaved by I-CeuI (42)] were excluded. We also considered sequences containing mismatches that were previously shown to abolish the activity of the enzyme in our comparison (31). Our analysis showed that only 1.1% of the 23S rRNA gene sequences (59 sequences out of 5525 sequences) contain mismatches that can potentially abolish the enzymatic activity. Phylotypes that might be excluded from our screening approach include a number of species within the genus Chlamydophila and Chlamydia, the unclassified gamma-proteobacteria Candidatus Carsonella ruddii, the Candidate division TM7 group GTL1 and a number of species in the genus Chloroflexi and Roseiflexus (all one mismatch) as well as some species within the Planctomycete, Coxiella burnetii and Francisella tularensis subsp. (two or more mismatches) (Figure 2). This updated, theoretical consideration is consistent with previous observations (43) and given the degree of mismatch tolerance described for homing endonucleases (44,45), we estimate that our HERMI strategy is capable of capturing at least 98% of the known diversity from a wide phylogenetic spectrum of 23S rRNA gene sequences.

Phylogenetic analysis of HERMI clones from C. concentrica
The 12 unique HERMI clones (defined as clones having unique fosmid end sequences) were subjected to 16S and 23S rRNA gene sequencing and their phylogenetic identities were examined. Based on 16S rRNA sequences four unique phylogenetic groups (with a percentage identity cut-off of 99%) were identified from these 12 clones. These four unique 16S rRNA gene sequences (using clone HERMI06, 11, 16 and 30 as representatives) were also found by band sequencing and migration distance to correspond to the four dominant bands of a 16S rRNA gene DGGE analysis of the pooled fosmids. This indicates that we have captured the major phylotypes present in the library (Figure 3). One minor band of the pooled fosmids was found to belong to the group Actinobacteria, which is known not to be cleaved by the I-CeuI enzyme (see above).


Figure 3
View larger version (35K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 3. DGGE gel of the 16S rRNA gene of a pooled fosmid library (lane 1), E. coli genomic DNA (lane 2) and HERMI clones (lanes 3–6). Individual bands were excised and sequenced; the taxonomic assignment of the band is shown next to the arrow.

 
A maximum likelihood tree was constructed for the 16S rRNA sequences of the HERMI clones (Figure 4) and shows that all sequences (except HERMI16, see below) are related to uncultured organisms and at least 7.5% divergent from their nearest cultured bacterium. Taxonomic classification assigned them to novel clades within the {alpha}- and {gamma}-proteobacteria group (HERMI06 and HERMI11) and a Mesorhizobium-related group (HERMI30). The 16S rRNA sequence of HERMI16 is identical to E. coli, but further analysis of this fosmid showed that only the 23S rRNA was cloned and that the 16S rRNA sequence resulted from amplification of E. coli host background. A maximum likelihood tree was also constructed for the 23S rRNA gene of the HERMI clones (Figure 5) and their closest representatives imported from the 23S rRNA Silva database (37). Here the HERMI clones are at least 11% divergent to their closest cultured representatives. The longer branch length in this tree to the next related sequences is a reflection of sparseness of available 23S rRNA gene data, but nevertheless allows us to link the 16S rRNA with 23S rRNA gene phylogeny (Figure 5).


Figure 4
View larger version (20K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 4. Maximum likelihood tree of the 16S rRNA gene for the HERMI clones (bold). and the closest representatives of cultured (from the RDP database, underlined) and uncultured organisms (from Silva SSU database) with accession numbers preceding the name of the organism. Scale bar indicates 0.025 sequence divergence.

 

Figure 5
View larger version (30K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 5. Maximum likelihood tree of the 23S rRNA gene for the HERMI clones (bold) and the closest representatives from the Silva 23S LSU database. Underlined sequence highlights closest cultured representatives. Scale bar indicates 0.025 sequence divergence.

 
Binning and taxonomic assignment of shotgun metagenomic sequences
To illustrate the value of whole fosmid sequences containing 16S or 23S rRNA genes for the assignment of unknown DNA fragments, we clustered the whole sequence of HERMI11 with sequences longer than 20 kb from an assembled shotgun sequence dataset of the bacterial community of C. concentrica. As illustrated in Figure 6, the HERMI11 clone sequence fell into a cluster of unknown sequences (red branches and yellow box) with a high correlation index (0.884). We therefore can now assign the shotgun fragments in this cluster to the novel phylotype within the {gamma}-proteobacteria and link this taxon with 372 predicted protein contained within these DNA sequences.


Figure 6
View larger version (66K):
[in this window]
[in a new window]
[Download PowerPoint slide]
 
Figure 6. Clustering of HERMI11 fosmid sequence with assembled shotgun sequences of unknown taxonomic assignment. Arrow indicates the position of HERMI11 in a cluster of sequences (correlation of 0.884) that is highlighted by red branches and a yellow box. Cyan and red colors in the data matrix represent over- and under-representation of tetranucleotide Z-scores.

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
In this study we have shown that HERMI is a suitable tool for screening a metagenomic library for clones with phylogenetic markers (i.e. the 23S rRNA gene). Handling time for the HERMI procedure is less than 2 h and allows for the simultaneous screening of thousands of clones in a single-tube reaction, hence making our approach faster and more convenient than other phylogenetic marker screens (22–25). We also note that the selectable marker applied in this study (i.e. kanamycin resistance marker) could be replaced with any marker cassette. For example, a green fluorescent protein cassette could be employed allowing for subsequent screening of clones with fluorescence-activated cell sorting (FACS).

Nesbø et al. (42) also recently used I-CeuI to clone fragments containing 23S rRNA genes from environmental DNA samples. Their strategy involved the construction of a specialized vector that allowed for the insertion of DNA digested with a blunt-end restriction enzyme and I-CeuI. However, their approach did not yield full-length 23S and 16S rRNA genes, limiting the phylogenetic and taxonomic value of their clones. The HERMI approach does not have this limitation and, as 16S and 23S rRNA genes are often tightly clustered together, will frequently yield clones with 16S rRNA information, for which more comprehensive databases are currently available. We also note that the linkage of 16S and 23S rRNA information will also help to improve annotation of unknown clades in current 23S rRNA databases, as illustrated by the phylogenetic analysis in our study (Figure 4 and 5). Another benefit of the HERMI strategy is that it can be applied retrospectively to any existing environmental DNA library, many of which have so far only been screened for expressed, functional genes (46–48). Screening those libraries for phlyogenetic markers will generate more information about compositional-features of the DNA from uncultured organisms, which in turn will improve our ability to classify bins of environmental DNA sequences (Figure 6) and hence reveal more phylotype–function relationship.

Sequence analysis of the 26-bp recognition site in the Silva 23S rRNA database indicated that only a small proportion (<2%) of all currently known 23S rRNA genes might not be captured with our approach. Importantly, this limitation can be quantified for each sample prior to the screening, for example, by constructing a 23S or 16S rRNA gene PCR library and determine its phylogenetic composition. This is typically done as a first step for most environmental diversity studies (49) and hence would not impose an additional work load.

Finally, homing endonucleases are not only restricted to cleaving the 23S rRNA genes. Arnould and coworkers have recently demonstrated that a semi-rational design can yield I-CreI mutant homing endonucleases with highly specific recognition patterns, which are distinct from the wild-type activity (50). As such other homing endonuclease mutants are available or could be designed that target other common genes used for phylogenetic analysis (e.g. the 16S rRNA gene, recA), functional genes or non-coding regions, and which could then be applied in a analogous screening strategy as described here. This will further enhance the utility and versatility of the HERMI approach.


    ACCESSION NUMBERS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
GQ160460–GQ160467.


    FUNDING
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 
The Gordon and Betty Moore Foundation and the Australian Research Council (LP0668235). Funding for open access charge: internal funds from the Centre for Marine Bio-Innovation.

Conflict of interest statement. None declared.


    ACKNOWLEDGEMENTS
 
The authors thank Robert Friedman and acknowledge the J. Craig Venter Institute (JCVI) Joint Technology Center, under the leadership of Yu-Hui Rogers, for producing whole-fosmid sequence data. The shotgun sequencing of Cymbastela concentrica is available at GenBank under GenomeProject ID 34751 and at the Community Cyberinfrastructure for Advanced Marine Microbial Ecology Research and Analysis (CAMERA) website (http://camera.calit2.net/). The ribosomal RNA gene and fosmid sequences reported here have been deposited in Genbank under accessions GQ160459–GQ160467.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 ACCESSION NUMBERS
 FUNDING
 REFERENCES
 

  1. Ferrer M, Beloqui A, Timmis KN, Golyshin PN. Metagenomics for mining new genetic resources of microbial communities. J. Mol. Microbiol. Biotechnol. (2009) 16:109–123.[CrossRef][Web of Science][Medline]

  2. Handelsman J. Metagenomics: application of genomics to uncultured microorganisms. Microbiol. Mol. Biol. Rev. (2004) 68:669–685.[Abstract/Free Full Text]

  3. Li X, Qin L. Metagenomics-based drug discovery and marine microbial diversity. Trends Biotechnol. (2005) 23:539–543.[CrossRef][Web of Science][Medline]

  4. Streit WR, Schmitz RA. Metagenomics - the key to the uncultured microbes. Curr. Opin. Microbiol. (2004) 7:492–498.[CrossRef][Web of Science][Medline]

  5. Rusch DB, Halpern AL, Sutton G, Heidelberg KB, Williamson S, Yooseph S, Wu D, Eisen JA, Hoffman JM, Remington K, et al. The Sorcerer II Global Ocean Sampling expedition: northwest Atlantic through eastern tropical Pacific. PLoS Biol. (2007) 5:e77:0398–0431.

  6. Venter JC, Remington K, Heidelberg JF, Halpern AL, Rusch D, Eisen JA, Wu D, Paulsen I, Nelson KE, Nelson W, et al. Environmental genome shotgun sequencing of the Sargasso Sea. Nature (2004) 304:66–74.

  7. Martin-Cuadrado AB, Rodriguez-Valera F, Moreira D, Alba JC, Ivars-Martínez E, Henn MR, Talla E, López-García P. Hindsight in the relative abundance, metabolic potential and genome dynamics of uncultivated marine archaea from comparative metagenomic analyses of bathypelagic plankton of different oceanic regions. ISME J. (2008) 2:865–886.[CrossRef][Web of Science][Medline]

  8. Allen HK, Moe LA, Rodbumrer J, Gaarder A, Handelsman J. Functional metagenomics reveals diverse beta-lactamases in a remote Alaskan soil. ISME J. (2009) 3:243–251.[CrossRef][Web of Science][Medline]

  9. Daniel R. The metagenomics of soil. Nat. Rev. Microbiol. (2005) 3:470–478.[CrossRef][Web of Science][Medline]

  10. Schlüter A, Bekel T, Diaz NN, Dondrup M, Eichenlaub R, Gartemann KH, Krahn I, Krause L, Krömeke H, Kruse O, et al. The metagenome of a biogas-producing microbial community of a production-scale biogas plant fermenter analysed by the 454-pyrosequencing technology. J. Biotechnol. (2008) 136:77–90.[CrossRef][Web of Science][Medline]

  11. Tringe SG, Zhang T, Liu X, Yu Y, Lee WH, Yap J, Yao F, Suan ST, Ing SK, Haynes M, et al. The airborne metagenome in an indoor urban environment. PLoS ONE (2008) 3:e1862.[CrossRef][Medline]

  12. Fieseler L, Quaiser A, Schleper C, Hentschel U. Analysis of the first genome fragment from the marine sponge-associated, novel cadidate phylum Poribacteria by environmental genomics. Environ. Microbiol. (2006) 8:612–624.[CrossRef][Medline]

  13. Jones BV, Begley M, Hill C, Gahan CG, Marchesi JR. Functional and comparative metagenomic analysis of bile salt hydrolase activity in the human gut microbiome. Proc. Natl Sci. USA (2008) 105:13580–13585.[Abstract/Free Full Text]

  14. Warnecke F, Luginbühl P, Ivanova N, Ghassemian M, Richardson T, Stege J, Cayouette M, McHardy A, Djordjevic G, Aboushadi N, et al. Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite. Nature (2007) 450:560–565.[CrossRef][Web of Science][Medline]

  15. Dinsdale E, Edwards R, Hall D, Angly F, Breitbart M, Brulc J, Furlan M, Desnues C, Haynes M, Li L, et al. Functional metagenomic profiling of nine biomes. Nature (2008) 452:629–632.[CrossRef][Web of Science][Medline]

  16. Mardis E. Next-generation DNA sequencing methods. Annu. Rev. Genomics Hum Genet. (2008) 9:387–402.[CrossRef][Medline]

  17. Tringe S, von Mering C, Kobayashi A, Salamov A, Chen K, Chang H, Podar M, Short J, Mathur E, Detter J, et al. Comparative metagenomics of microbial communities. Science (2005) 308:554–557.[Abstract/Free Full Text]

  18. Kunin V, Copeland A, Lapidus A, Mavromatis K, Hugenholtz P. A bioinformatician's; guide to metagenomics. Microbiol. Mol. Biol. Rev. (2008) 72:557–578.[Abstract/Free Full Text]

  19. McHardy AC, Martin HG, Tsirigos A, Hugenholtz P, Rigoutsos I. Accurate phylogenetic classification of variable-length DNA fragments Nat. Methods (2007) 4:63–72.

  20. Teeling H, Waldman J, Lombardot T, Bauer M, Glöckner F. TETRA: a web-service and a stand-alone program for the analysis and comparison of tetranucleotide usage patterns in DNA sequences. BMC Bioinformatics (2004) 5:163.[CrossRef][Medline]

  21. Diaz N, Krause L, Goesmann A, Niehaus K, Nattkemper T. TACOA: taxonomic classification of environmental genomic fragments using a kernelized nearest neighbor approach. BMC Bioinformatics (2009) 10:56.[CrossRef][Medline]

  22. Beja O, Suzuki MT, Koonin EV, Aravind L, Hadd A, Nguyen LP, Villacorta R, Amjadi M, Garrigues C, Jovanovich SB, et al. Construction and analysis of bacterial artificial chromosome libraries from a marine microbial assemblage. Environ. Microbiol. (2000) 2:516–529.[CrossRef][Medline]

  23. Moreira D, Rodríguez-Valera F, López-García P. Metagenomic analysis of mesopelagic Antarctic plankton reveals a novel deltaproteobacterial group. Microbiology (2006) 152:505–517.[Abstract/Free Full Text]

  24. Stein JL, Marsh TL, Yu KY, Shizuya H, DeLong EF. Characterization of uncultivated prokaryotes: isolation and analysis of a 40-kilobase-pair genome fragment from a planktonic marine archaeon. J. Bacteriol. (1996) 178:591–599.[Abstract/Free Full Text]

  25. Liles MR, Manske BF, Bintrim SB, Handelsman J, Goodman RM. A census of rRNA genes and linked genomic sequences within a soil metagenomic library. Appl. Environ. Microbiol. (2003) 69:2684–2691.[Abstract/Free Full Text]

  26. Gilbert JA, Mühling M, Joint I. A rare SAR11 fosmid clone confirming genetic variability in the ‘Candidatus Pelagibacter ubique’ genome. ISME J. (2008) 2:106–116.

  27. Meng J, Wang F, Zheng Y, Peng X, Zhou H, Xiao X. An uncultivated crenarchaeota contains functional bacteriochlorophyll a synthase. ISME J. (2009) 3:106–116.[CrossRef][Web of Science][Medline]

  28. Martín-Cuadrado A, López-García P, Alba J, Moreira D, Monticelli L, Strittmatter A, Gottschalk G, Rodríguez-Valera F. Metagenomics of the deep Mediterranean, a warm bathypelagic habitat. PLoS ONE (2007) 2:e914.[CrossRef][Medline]

  29. Quaiser A, López-García P, Zivanovic Y, Henn M, Rodriguez-Valera F, Moreira D. Comparative analysis of genome fragments of Acidobacteria from deep Mediterranean plankton. Environ. Micro. (2008) 10:2704–2717.[CrossRef]

  30. Massana R, Karniol B, Pommier T, Bodaker I, Béjà O. Metagenomic retrieval of a ribosomal DNA repeat array from an uncultured marine alveolate. Environ. Micro. (2008) 10:1335–1343.[CrossRef]

  31. Marshall P, Lemieux C. The I-CeuI endonuclease recognizes a sequnce of 19 base pairs and preferentially cleaves the coding strand of the Chlamydomonas moewusii chloroplast large subunit rRNA gene. Nucleic Acids Res. (1992) 20:6401–6407.[Abstract/Free Full Text]

  32. Marshall P, Lemieux C. Cleavage pattern of the homing endomuclease encoded by the fifth intron in the chloroplast large subunit rRNA-encoding gene of Chlamydomonas eugametos. Gene (1991) 104:241–245.[CrossRef][Web of Science][Medline]

  33. Muyzer G, Brinkhoff T, Nübel U, Santegoeds C, Schäfer H, Wawer C. Molecular Microbial Ecology Manual—Akkermans A.D.L., van Elsas JD, de Bruijn F.J., eds. (1997) Vol. 3.4.4. Dordrecht, The Netherlands: Kluwer. 1–27.

  34. Hunt DE, Klepac-Ceraj V, Acinas SG, Gautier C, Bertilsson S, Polz MF. Evaluation of 23S rRNA PCR primers for use in phylogenetic studies of bacterial diversity. Appl. Environ. Microbiol. (2006) 72:2221–2225.[Abstract/Free Full Text]

  35. Lane DJ. 16S/23S rRNA sequencing. In: Nucleic Acid Techniques in Bacterial Systematics—Stachebrandt E, Goodfellow M, eds. (1991) New York: Wiley, Chichester. 115–175.

  36. Altschul S, Gish W, Miller W, Myers E, Lipman D. Basic local alignment search tool. J. Mol. Biol. (1990) 215:403–410.[CrossRef][Web of Science][Medline]

  37. Pruesse E, Quast C, Knittel K, Fuchs B, Ludwig W, Peplies J, Glöckner FO. SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nucleic Acids Res. (2007) 35:7188–7196.[Abstract/Free Full Text]

  38. Ludwig W, Strunk O, Westram R, Richter L, Meier H, Yadhukumar, Buchner A, Lai T, Steppi S, Jobb G, et al. ARB: a software environment for sequence data. Nucleic Acids Res. (2004) 32:1363–1371.[Abstract/Free Full Text]

  39. Gordon D, Abajian C, Green P. Consed: a graphical tool for sequence finishing. Genome Res. (1998) 8:195–202.[Abstract/Free Full Text]

  40. Woyke T, Teeling H, Ivanova N, Huntemann M, Richter M, Gloeckner F, Boffelli D, Anderson I, Barry K, Shapiro H, et al. Symbiosis insights through metagenomic analysis of a microbial consortium. Nature (2006) 443:950–955.[CrossRef][Medline]

  41. Eisen M, Spellman P, Brown P, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc. Natl Acad. Sci. USA. (1998) 95:14863–14868.[Abstract/Free Full Text]

  42. Nesbø CL, Boucher Y, Dlutek M, Doolittle WF. Lateral gene transfer and phylogenetic assignment of environmental fosmid clones. Environ. Microbiol. (2005) 7:2011–2026.[Medline]

  43. Liu SH, Hessel A, Sanderson KE. Genomic mapping with I-Ceu I, an intron-encoded endonuclease specific for genes for ribosomal RNA, in Salmonella spp. Escherichia coli, and other bacteria Proc. Natl Acad. Sci. USA (1993) 90:6874–6878.[Abstract/Free Full Text]

  44. Aagaard C, Awayez MJ, Garrett RA. Profile of the DNA recognition site of the archaeal homing endonuclease I-DmoI. Nucleic Acids Res. (1997) 25:1523–1530.[Abstract/Free Full Text]

  45. Jurica MS, Monnat RJ Jr, Stoddard BL. DNA recognition and cleavage by the LAGLIDADG homing endonuclease i-crei. Mol. Cell (1998) 2:469–476.[CrossRef][Web of Science][Medline]

  46. Lim HK, Chung EJ, Kim JC, Choi GJ, Jang KS, Chung YR, Cho KY, Lee SW. Characterization of a forest soil metagenome clone that confers indirubin and indigo production on Escherichia coli. Appl. Environ. Microbiol. (2005) 71:7768–7777.[Abstract/Free Full Text]

  47. Rondon MR, August PR, Bettermann AD, Brady SF, Grossman TH, Liles MR, Loiacono KA, Lynch BA, MacNeil IA, Minor C, et al. Cloning the soil metagenome: a strategy for accessing the genetic and functional diversity of uncultured microorganisms. Appl. Environ. Microbiol. (2000) 66:2541–2517.[Abstract/Free Full Text]

  48. MacNeil IA, Tiong CL, Minor C, August PR, Grossman TH, Loiacono KA, Lynch BA, Phillips T, Narula S, Sundaramoorthi R, et al. Expression and isolation of antimicrobial small molecules from soil DNA libraries. J. Mol. Microbiol. Biotechnol. (2001) 3:301–308.[Web of Science][Medline]

  49. Tringe S, Hugenholtz P. A renaissance for the pioneering 16S rRNA gene. Curr. Opin. Microbiol. (2008) 11:442–446.[CrossRef][Web of Science][Medline]

  50. Arnould S, Chames P, Perez C, Lacroix E, Duclert A, Epinat J, Stricher F, Petit A, Patin A, Guillier S, et al. Engineering of large numbers of highly specific homing endonucleases that induce recombination on novel DNA targets. J. Mol. Biol. (2006) 355:443–458.[CrossRef][Web of Science][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Print PDF (2794K) Freely available
Right arrow Screen PDF (416K) Freely available
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Commercial Re-use Guidelines
for Open Access NAR Content
Google Scholar
Right arrow Articles by Yung, P. Y.
Right arrow Articles by Thomas, T.
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yung, P. Y.
Right arrow Articles by Thomas, T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?