| Nucleic Acids Research | Pages |
Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences
Introduction
Materials And Methods
Primer design
Molecular and sequence analysis
Results
Detection of novel genomes using hybrid primers
Isolation of homologous sequences from a multi-gene family within one genome using hybrid primers
Analysis of hybrid primer utilization
Using the CODEHOP prediction program to isolate gene homologs from different genomes
Discussion
Acknowledgements
References
|
|
Consensus-degenerate hybrid oligonucleotide primers for amplification of distantly related sequences
ABSTRACT
INTRODUCTION
Most applications of the polymerase chain reaction (PCR) are based on designing primers that precisely match a known target sequence. However, in some situations, primers are targeted to unknown sequences, as when trying to isolate genes encoding proteins that belong to known protein families (1-5). In such cases, PCR primer design is usually based on reverse translation of multiply aligned sequences across the conserved regions of proteins (blocks). Various rules of thumb have been applied to this problem, but frequent failures to amplify a desired target sequence are often attributable to inadequate primer design. Primer design can be very difficult because of codon degeneracy and the additional degeneracy needed to represent multiple codons at a position in the alignment. These degeneracies lead to complications in trying to find suitable annealing temperatures and primer lengths. The need to target regions of high sequence conservation containing codons of low degeneracy limits PCR detection of unknown sequences to fairly close relatives, so improvements in primer design have the potential to be widely applicable.
To isolate distantly-related sequences by PCR, two strategies have previously been employed. One is to synthesize a pool of degenerate primers containing most or all of the possible nucleotide sequences implicit in a multiple alignment (Fig. 1A). One problem with this approach is that as the degeneracy increases to accomodate more divergent genes, the concentration of any single primer drops. As a result, the number of primer molecules in a PCR reaction that can prime synthesis during the amplification cycles drops, and these primers are used up early in the reaction. In addition, artifactual amplification occurs because of the dominance of primers in the pool which do not participate in amplification of the targeted gene but are available to prime non-specific synthesis. These problems are exacerbated by the low stringency annealing conditions that may be needed to detect mismatched homologs, especially when using short primers required for short conserved blocks. The result is a weak or undetectable band on a gel that might be no higher than background. The second strategy is to design a single consensus primer across the highly conserved region. The consensus primer is usually derived by choosing the most common nucleotide at every position of multiply aligned nucleotide sequences. Although this technique has been most successful in the isolation of highly conserved gene homologs, primer-to-template mismatches preclude its application to distantly related sequences.
Figure 1. Schematic comparison of standard degenerate PCR (A) with the CODEHOP strategy (B), illustrating regions of mismatch in primer-to-template annealing during early PCR cycles and in primer-to-product annealing during subsequent cycles. Vertical lines indicate nucleotide matches between primer (arrow) and template or synthesized product. The overall degeneracy is the product of degeneracies at each nucleotide position, so that the fraction of precisely hybridizing primers = 1/degeneracy. Here we describe a strategy that overcomes problems of both degenerate and consensus methods for primer design: COnsensus-DEgenerate Hybrid Oligonucleotide Primers (CODEHOP, Fig. 1B). Hybrid primers consist of a relatively short 3[prime] degenerate core and a 5[prime] non-degenerate consensus clamp. Reducing the length of the 3[prime] core to a minimum decreases the total number of individual primers in the degenerate primer pool. Hybridization of the 3[prime] degenerate core with the target template is stabilized by the 5[prime] non-degenerate consensus clamp, which allows higher annealing temperatures without increasing the degeneracy of the pool. Although potential mismatches may occur between the 5[prime] consensus clamp of the primer and the target sequence during the initial PCR cycles, they are situated away from the 3[prime] hydroxyl extension site, and so mismatches between the primer and the target are less disruptive to priming of polymerase extension. Further amplification of primed PCR products during subsequent rounds of primer hybridization and extension is enhanced by the sequence similarity of all primers in the pool; this potentially allows utilization of all primers in the reaction. We demonstrate the CODEHOP strategy by successfully amplifying unknown sequences from a background of genomic DNA. We also describe a program, implemented for the World Wide Web (WWW), for automatically predicting optimal primers that embody the CODEHOP strategy. The practical utility of this program is demonstrated by isolating members of a rapidly evolving family of novel cytosine methyltransferase homologs from diverse plants.
MATERIALS AND METHODS
Primer design
A CODEHOP is degenerate at the 3[prime] core region of length 11-12 bp across four codons of highly conserved amino acids and is non-degenerate at the 5[prime] consensus clamp region of 18-25 bp. Initially, such primers were designed by visual examination of protein multiple alignments made using ClustalW (6). This manual approach employing heuristic rules to identify suitable regions was later superseded by the development of a program that performs an exhaustive search. The CODEHOP program designs a pool of primers containing all possible 11- or 12mers for the 3[prime] degenerate core region and having the most probable nucleotide predicted for each position in the 5[prime] non-degenerate clamp region.
The program consists of the following steps. (i) A set of blocks is input, where a block is an aligned array of amino acid sequence segments without gaps that represents a highly conserved region of homologous proteins (7). A weight is provided for each sequence segment (8), which can be increased to favor the contribution of selected sequences in designing the primer. A codon usage table is chosen for the target genome. (ii) A position-specific scoring matrix is computed for each block using the odds-ratio method (9). (iii) A consensus amino acid residue is selected for each position of the block as the highest scoring amino acid in the matrix. (iv) For each position of the block, the most common codon corresponding to the amino acid chosen in step iii is selected utilizing the user-selected codon usage table (10). This selection is used for the default 5[prime] consensus clamp in step viii. (v) A DNA position-specific scoring matrix is calculated from the amino acid matrix (step ii) and the codon usage table. The DNA matrix has three positions for each position of the amino acid matrix. The score for each amino acid is divided among its codons in proportion to their relative weights from the codon usage table, and the scores for each of the four different nucleotides are combined in each DNA matrix position. Nucleotide positions are treated independently when the scores are combined. As an option, the highest scoring nucleotide residue from each position can replace the most common codons from step iv that are used in the consensus clamp. (vi) The degeneracy is determined at each position of the DNA matrix based on the number of bases found there. As an option, a weight threshold can be specified such that bases that contribute less than a minumum weight are ignored in determining degeneracy. (vii) Possible degenerate core regions are identified by scanning the DNA matrix in the 3[prime] to 5[prime] direction. A core region must start on an invariant 3[prime] nucleotide position, have a length of 11 or 12 positions ending on a codon boundary, and have a maximum degeneracy of 128 (current default). The degeneracy of a region is the product of the number of possible bases in each position. (viii) Candidate degenerate core regions are extended by addition of a 5[prime] consensus clamp from step iv or v. The length of the clamp is controlled by a melting point temperature calculation (11,12) (current default = 60°C) and is usually [sim]20 nucleotides. (ix) Steps vii and viii are repeated on the reverse complement of the DNA matrix from step v for primers corresponding to the opposite DNA strand.
Molecular and sequence analysis
Primers were synthesized either commercially (Oligos Etc) or by the Hutchinson Center Biotechnology facility. Nucleic acids were extracted from macaque and human tissues and cell lines as described (13) and from Arabidopsis leaves using a Qiagen plant DNA kit. A set of crude plant DNAs was a gift from Amy Denton. Each 50 µl amplification reaction was performed using 25 pmol of each primer pool in a thin-walled 0.5 ml microcentrifuge tube in either a Perkin-Elmer 480 or MJ Research PTC100 thermal cycler. Whole PCR products were cloned using the TOPO-TA cloning kit (Invitrogen). Agarose gel analysis and DNA sequencing were performed using standard methods (14,15). Dendrograms were produced using the neighbor-joining and bootstrapping procedures in ClustalW (6) as implemented on the Blocks WWW site (16).
RESULTS
The hybrid primer strategy was tested on problems in which the target sequence for amplification was unknown but could be predicted from multiply aligned protein sequences. In the first test, hybrid primers aimed at identifying a new primate herpes virus were designed from multiple sequence alignments of DNA polymerases from different herpes viruses. The second test used hybrid primers designed from alignments of reverse transcriptases from different retroviral genomes to identify a family of related retroviral elements within the human genome. In these tests, the hybrid primers were manually designed from multiple sequence alignments. The third test utilized the automated CODEHOP prediction program to design optimal primers from BlockMaker-generated alignments (17) of several DNA methyltransferases. Predicted CODEHOPs were used to identify members of a new subfamily of DNA methyltransferases from different plant genomes.
Figure 2. Hybrid primer design strategy for DNA polymerase genes of different herpes viruses. The nucleotide sequences across the conserved YGDTD sequence block from a variety of herpes virus DNA polymerase genes are aligned by codons. The invariant nucleotide positions are shown in shaded boxes. The amino acid sequences encoded at the various positions are shown on top with the YGDTD motif highlighted. The sequences are grouped within the [alpha], [beta] and [gamma] subclasses of herpesviruses in descending order in the figure with the catfish herpes virus as an outlier. The GDTD1B hybrid primer pool was designed as a negative strand primer and is shown underlined. The IUBPAC codes for nucleotide degeneracies are used, and the degenerate positions are indicated (*). The primer pool is 64-fold degenerate, and each primer is 35 bp in length. (hHSV1, human herpes simplex virus 1 GenBank #X14112; hVZV, human varicella virus GenBank #X04370; eHV1, equine herpes virus 1 GenBank #M86664; hHV6, human herpes virus 6 GenBank #M63804; hHV7, human herpes virus 7 GenBank #U43400; hCMV, human cytomegalovirus GenBank #X17403; gpCMV, guinea pig cytomegalovirus, GenBank #L25706; mCMV, mouse cytomegalovirus GenBank #M73549; HVS, herpes virus saimiri #X64346; hEBV, human Epstein-Barr virus GenBank #V015555; iHV, ictalurid (catfish) herpes virus GenBank #M75136). We predicted that macaque retroperitoneal fibromatosis, a tumor similar to Kaposi's sarcoma, might contain a herpes virus homologous to the newly identified Kaposi's sarcoma-associated herpes virus (13). To identify and characterize such an unknown herpes virus, the amino acid sequences of the DNA polymerase genes ([sim]1000 aa) from 11 different herpes virus genomes from the [alpha], [beta] and [gamma] subclasses were multiply aligned. Visual examination of the alignment revealed five blocks that contained invariant regions suitable for primer prediction. Three blocks were chosen for primer design after evaluation of codon degeneracy within the blocks and distance between blocks. Primers were designed from these three regions using all codon possibilities for the 3[prime] degenerate core and the most frequent nucleotide in each position for the 5[prime] consensus clamp. The design strategy is shown for the most conserved sequence block (Fig. 2). As previously described (13), a hemi-nested PCR strategy was developed to use these three primers in two successive amplification reactions at 60°C to detect low amounts of viral DNA in a background of cellular genomic DNA from formalin-fixed paraffin-embedded samples. A PCR product of the correct size was detected on an electrophoretic gel. This product was cloned and sequenced and was shown to correspond to a DNA polymerase gene of a new macaque herpes virus most closely related to the human Kaposi's sarcoma-associated herpes virus (13). The success of the hybrid primer strategy in this example encouraged its refinement and extension to isolate other distantly-related sequences. Figure 3. Hybrid primer design strategy for reverse transcriptase genes from various retroviral sequences. The nucleotide sequences across the conserved LQPG sequence blocks from a variety of retroviral sequences are aligned by codons. The invariant nucleotide positions are shown in open boxes. The amino acid sequences encoded at the various positions are shown on top with the evident LQPG motif highlighted. The sequences are grouped depending on the presence of a `W', `M' or `F' codon immediately following the LQPG block, and the conserved nucleotides within these codons are shown in shaded boxes. The three hybrid primers designed from the `W', `M' and `F' sequence groups are listed below with the degenerate positons indicated (*). (HIV1, human immunodeficiency virus type 1 GenBank #M38432; HIV2, human immunodeficiency virus type 2 GenBank #A05350; SIVAGM, simian immunodeficiency virus strain AGM GenBank #X07805; CAEV, caprine arthritis encephalitis virus GenBank #M33677; OMVV, ovine lentivirus GenBank #M31646; BIV, bovine immunodeficiency virus GenBank#M32690; FIV, feline immunodeficiency virus GenBank #M25381; SMRV, simian sarcoma virus GenBank #M23385; MMTV mouse mammary tumor virus #M15122; RSV, Rous Sarcoma Virus GenBank #J02342; HTLV1, human T-cell lymphotrophic virus 1 GenBank #L36905; HTLV2, human T-cell lymphotrophic virus 2 GenBank #L11456; HERSEQA, human endogenous retrovirus sequence GenBank #M96062; EIA, equine infectious anemia virus GenBank #U01866). To determine the nature and extent of retroviral sequence elements within the human genome, we designed primers to detect unknown reverse transcriptase-like sequences. The amino acid sequences of reverse transcriptase genes from 14 different retroviruses and retroviral sequences were multiply aligned. Two invariant sequence motifs (LPQG) and (YMDD) separated by [sim]40 aa (120 bp) were identified. The LPQG motif could be separated into three different sequence groups based on the identity of the amino acid immediately following the LPQG motif (M, W or F), and so three different hybrid primers were designed. The primer pools were 32-fold degenerate and 29-30 bp in length (Fig. 3). Amplification was performed at 55°C using the different combinations of the upstream hybrid primer pools (LPQGM, LPQGF, and LPQGW) and the downstream primer pool (YMDD), which was 24-fold degenerate and 30 bp in length. Electrophoretic analysis revealed a single band of the expected size in the amplification reactions from the two tissue samples examined using the LPQGM and YMDD primers. No bands were detected using the LPQGF or LPQGW primers. The LPQGM-YMDD reaction mixtures were used for cloning, and 52 individual clones were sequenced, 26 from each of the two tissue sources. Forty-eight of the clones contained amplified products corresponding to reverse transcriptase coding regions, which are closely related to the mouse mammary tumor virus sequences. Twenty-seven different sequences were identified: four of these are possible pseudogenes because of the presence of insertions or deletions within the coding region. A phylogenetic analysis of the multiply aligned sequences (Fig. 4) demonstrates the varied nature of retroviral sequence elements within the human genome. An additional four clones contained artifactual sequences not related to reverse transcriptases. Three of the 27 clones contained a sequence identical to that of AMV reverse transcriptase, the enzyme used for cDNA synthesis, indicating the likely presence of DNA contamination in the enzyme preparation. In summary, our results demonstrate that hybrid primers can be used to isolate diverse members of multi-gene families simultaneously. Figure 4. Alignment (A) and dendrogram (B) of amino acid sequences encoded by multiple endogenous reverse transcriptase-related sequences detected with hybrid primers LPQGM and YMDD from human tissue. Nucleic acids were prepared from paraffin blocks of lesions from Kaposi's sarcoma (clones designated 19) and rheumatoid arthritis (clones designated 15) using xylene washes and proteinase-K digestion as described (13). cDNA was synthesized using AMV reverse transcriptase with the hybrid primer pool (YMDD) predicted from the downstream YMDD motif. Amplification was performed using either of the upstream LPQGM, LPQGW or LPQGF hybrid primers (50 pmol) in combination with the downstream YMDD hybrid primer pool (50 pmol) in 0.067 M Tris buffer (pH 8.8), 4 mM MgCl2, 16 mM (NH4)2SO4, 10 mM 2-mercaptoethanol containing 100 µg bovine serum albumin per ml (16) for 35 cycles (1 min at 94°C, 1 min at 55°C, 1 min at 72°C). A hot start was obtained by initially incubating at 65°C prior to addition of Taq polymerase (2.5 U/50 µl). The amplification products were visualized on a 2.5% agarose gel with ethidium bromide and UV irradiation. The encoded amino acid sequences of series 19 and 15 cloned inserts (GenBank #AF047584-AF047597 and #AF050504-AF050516) are aligned with the corresponding sequences from 10 endogenous and viral reverse transcriptase sequences (RTVL-Hp3, GenPept #423062; HOMORT2 #257757; HERVK10, GenBank #M14123; HUMREVTRAA, #M25766; HUMREVTRAC, #M25768; AMV #S74099). Positions containing insertions or deletions in pseudogenes are indicated (*). Our results can be compared with those obtained in two previous studies using the LPQG and YMDD reverse transcriptase regions for conventional degenerate primer design (2,18). In both studies, gel purification of PCR products was necessary. Nevertheless, in one study, only three of 17 clones were correct (2). In the other study, successful amplification was only obtained using purified viral template (18). In contrast, application of our hybrid primer method to minute amounts of genomic DNA present in formalin-fixed paraffin block sections yielded 48/52 correct clones from unpurified PCR products. Figure 5. Analysis of hybrid primer utilization. The sequences of the hybrid primers, LPQGM and YMDD, incorporated into PCR products during the final amplification reaction of the experiment described in Figure 4, were determined from clones 19-O, -K, -B and -Z which contain a fragment of the retroviral element HEU2742 (GenBank #U27242). Nucleotide and amino acid sequences of the LPQGM (A) and YMDD (B) primer binding sites in HEU2742 are shown. The sequences of hybrid primer pools are aligned with the HEU2742 sequences and the sequences of degenerate codons in the primer pools are in shaded boxes. The direction of polymerase extension is indicated and the downstream YMDD primer is shown as its complement for clarity. Sequences from the incorporated primer for each clone are aligned with that of HEU2742, where identical residues are indicated (.). To determine the utilization of hybrid primers during PCR amplification, we analyzed the sequences across the primers incorporated into four of the clones obtained with the LPQGM and YMDD primers. These four clones (19-B, -K, -O, -Z) corresponded to the human retroviral element HEU2742 whose sequence was available in GenBank. The sequences across the LPQGM and YMDD primer binding sites in HEU2742 were compared with the sequences obtained from the primers incorporated into the four different clones (Fig. 5A and B). In the core regions, the unknown template was found to encode the same invariant amino acid residues present in the alignment used to predict the primer. Consistent with the premise that multiple hybrid primers would participate in amplifying the correct target, six of the eight clone ends had incorporated primers with different sequences. As expected, the sequences corresponding to the 5[prime] consensus region of the cloned primers were identical to one another but differed from the sequence of the HEU2742 template. In the case of the LPQGM primers, the 5[prime] consensus region matched the HEU2742 template sequence at 16/20 nucleotide residues. However, in the case of the YMDD primers, only 4/17 nucleotide residues in the consensus region matched the template. This poorly-matched 5[prime] clamp appears to have stabilized the 3[prime] core during the 55°C annealing step, because even a perfectly-matched core should have melted at 34°C (12). Degenerate PCR primers have been used with limited success for obtaining eukaryotic C5 DNA methyltransferases. For example, the mouse DNA methyltransferase was used to design degenerate PCR primer pools that led to isolation of the Arabidopsis thaliana MET1 gene based on typical low stringency amplification and purification of a gel fragment of the correct size (19). These primers were used in an attempt to obtain DNA methyltransferases from other plants, including oak, salal and rhododendron; however, no bands of the correct size (except for Arabidopsis) were resolved (data not shown). Therefore, we judged that eukaryotic C5 DNA methyltransferases represent a challenging family for isolation of new members by PCR. A program to design consensus-degenerate hybrid oligonucleotide primers (CODEHOP) was written that applies the general rules used to design primers in the previous sections. Program input is a set of blocks and output is a primer map that lists CODEHOPs which fulfill specified stringency criteria. To test the CODEHOP strategy on the higher eukaryotic C5 DNA methyltransferases, all eight available sequences were presented to BlockMaker (17), resulting in a set of six blocks corresponding to the six well-known conserved regions of these proteins (7; 20). Two of the sequences are from the `chromomethylase' subfamily of predicted proteins in A.thaliana and its closest relative, Cardaminopsis arenosa (21). The other six sequences comprise a set of presumed DNA methyltransferase orthologs from animals (sea urchins to humans) and a plant (A.thaliana MET1). To bias the primers towards chromomethylases, the two members of this subfamily were upweighted by an arbitrary factor of four times the sequence weights, which are automatically provided by BlockMaker to reduce redundancy of close relatives (8). Using the C5 DNA methyltransferase blocks as input, three pairs of optimal primers were identified. Two pairs would potentially amplify a sufficiently short region in the known chromomethylase genomic sequences (<500 bp) to be of practical use. For one of the predicted primers, the primer design strategy is shown (Fig. 6). Figure 6. The highlighted CODEHOP, consisting of an 11 residue 3[prime] degenerate core and a 19 residue 5[prime] consensus clamp, was predicted from the alignment shown. (A) Portion of a block alignment of eight sequences. MTCH_ARATH and MTCH_CARAR are chromomethylases from A.thaliana and C.arenosa, respectively; these were given weights four times those assigned by the position-based sequence weighting method (8) in order to bias the hybrid primers towards them. (B) The consensus residues from the amino acid PSSM for the block (which is not shown), and the corresponding most common codons according to the codon usage table for A.thaliana. (C) DNA PSSM with the most degenerate residue and degeneracy value at each position. The best suggested CODEHOP has degeneracy of 16 in the core region and the degenerate residues are underlined; the clamp region is drawn from the most common codons in (B), also underlined. One CODEHOP pair produced complicated patterns of bands in various plant samples and even in the presumed negative control from Drosophila melanogaster, so products were not analyzed in detail. The other CODEHOP pair amplified products of the expected size ([sim]250 bp) using DNAs from A.thaliana, broccoli, rhododendron, salal, stonecrop, oak and barley. The PCR reaction product from each sample was used for cloning into a plasmid vector without purification. Sequence analysis revealed that correct amplification of a putative chromomethylase occurred for A.thaliana (2/2 clones), broccoli (2/2 clones) rhododendron (2/2 clones), salal (1/1 clones), stonecrop (2/2 clones) and oak (1/2 clones) (Fig. 7A). A dendrogram of the translated sequences shows that the branch lengths of the putative chromomethylases from these dicot plants are almost two-fold longer than the branch lengths of animal C5 DNA methyltransferases, ranging from mammals to sea urchins (Fig. 7B). Therefore, this CODEHOP pair successfully amplified chromomethylases that appear to be more diverse than the orthologous set of DNA methyltransferases from vertebrates and echinoderms. Figure 7. Alignment of higher eukaryotic DNA methyltransferases and translated PCR products (in bold) obtained using CODEHOP-designed primers (A) and the corresponding dendrogram showing bootstrap resampling percentages (B). GenBank accession numbers for amplified DNA sequences are AF47322-AF47328. PCR reactions were performed using primers designed by the CODEHOP program with BlockMaker MOTIF-generated blocks from the eight protein sequences listed in Figure 6 as input. The upstream primer was 5[prime]-CATGGTTTGTGGAGGACCTCCNTGYCARGG-3[prime] (Fig. 6) and the downstream primer was 5[prime]-TTGCATCATTCCGAATCTACAYTGRTANYYCAT-3[prime]. A hot-start was obtained by using Ampli-Taq Gold (Perkin-Elmer, 2.5 U/50 µl) and buffer with 4 mM MgCl2 (Perkin-Elmer) with a 9 min pre-heating step at 94°C, followed by 40 cycles (30 s at 94°C, 30 s at 53°C and 30 s at 72°C) and a final 7 min, 72°C incubation. Interestingly, the two broccoli clones came from different chromomethylase-like genomic sequences. The dendrogram indicates that one broccoli sequence is more closely similar to the CMT1 sequences of other mustards, A.thaliana and C.arenosa, than it is to the other plants, as expected. However, the other broccoli sequence, designated CMT2, groups with the other plants. This result was confirmed by using a broccoli CMT2 CODEHOP-based clone to select by filter hybridization an A.thaliana genomic clone containing a CMT2 homolog. Sequencing revealed that A.thaliana CMT2 has an almost identical exon/intron structure to CMT1 and encodes a chromomethylase that aligns with 43% amino acid identity over the full length of CMT1, with a CMT2-specific N-terminal extension (L.Comai, C.M.McCallum and S.Henikoff, unpublished results). One of three clones from barley (a monocot with a 5000 Mb genome) yielded a sequence that is significantly different from A.thaliana MET1 but not from the known animal DNA methyltransferases, which are thought to be orthologous to MET1. The presence of this sequence in the crude barley DNA preparation was confirmed by subsequent amplifications using specific primers internal to the CODEHOP pair. However, these internal primers failed to amplify any specific product from a highly purified barley DNA preparation derived from a different source (data not shown). It therefore appears that the non-plant-like sequence arose from contamination of our first barley sample with an organism unrelated to barley, such as a fungus growing on the barley. Regardless of the source of this sequence, it is interesting that a member of the orthologous set of eukaryotic C5 DNA methyltransferases was identified using primers biased towards chromomethylases, indicating that CODEHOPs are able to amplify DNA methyltransferases from two diverged subfamilies in a background of complex genomic DNA.
Detection of novel genomes using hybrid primers
Isolation of homologous sequences from a multi-gene family within one genome using hybrid primers
Analysis of hybrid primer utilization
Using the CODEHOP prediction program to isolate gene homologs from different genomes
DISCUSSION
Isolation of an unknown sequence related to known sequences is a powerful method for investigating biological function. The sequence of an unknown protein in one organism may be homologous to those of known proteins from different organisms, or may be related to a known protein sequence belonging to a multigene family within an organism. In many cases, low-stringency hybridization or PCR methods have succeeded in obtaining such desired genes. However, as the degree of protein similarity decreases, so does success in gene isolation. When only a single sequence is known, low-stringency hybridization is used, although a fairly long region of similarity may be needed. Moreover, considerable effort is required to determine whether a candidate clone is a correct one. If a family of proteins is available, then consensus or degenerate PCR methods may be used, because regions of high sequence similarity can be identified and utilized in the design of PCR primers. PCR methods are not only faster and easier than low-stringency hybridization, but product size and homogeneity can also be used to judge probable success. However, consensus primers may be too dissimilar to an unknown target to efficiently anneal to the original template, and degenerate primers may be too dissimilar to each other to efficiently amplify the synthesized product. In either case, mismatches in oligonucleotide annealing are typically limiting; however, ignorance of how mismatches affect annealing (22) has resulted in primer designs that are largely subjective and that must be optimized by time-consuming trial-and-error testing.
Our novel hybrid strategy overcomes drawbacks of both consensus and degenerate methods by basing primer design on precisely-matched regions only. We presume that correctly amplified products are initially produced by precise matching of primer to template in the 3[prime] core and later by precise matching of primer to product in the 5[prime] clamp. The CODEHOP algorithm is aimed at minimizing mismatches between the consensus clamp and unknown templates, so that mismatches are unlikely to limit the application of our strategy to challenging problems. It seems more likely that our method is limited by the degeneracy of the 3[prime] core, which our algorithm optimally selects.
The practical utility of the hybrid method is demonstrated by successful amplification of unknown sequences that are too diverged from known sequences to be readily isolated by standard methods. In addition, the hybrid method was successful in amplifying unknown target sequences from sources containing small quantities of degraded nucleic acids, even single viral sequences present in a small minority of cells. In all cases, single PCR products of the correct size were observed by analytical agarose gel electrophoresis, so no gel purification was necessary. Our method was also successful in isolating diverse related products in a single reaction.
Although we rely on stabilization of the 3[prime] core by the presumably mismatched 5[prime] clamp in annealing to template, our data indicate that even poorly-matched clamps can be effective. This suggests that the actual sequence of the clamp is not always important, in which case annealing to template would be stabilized by any 5[prime] extension. It may be that the common practice of adding an arbitrary 5[prime] extension to a degenerate primer in order to introduce a restriction site is inadvertently responsible for many successful amplifications of unknown sequences in the past. Furthermore, the evident effectiveness of a clamp that is mismatched to template suggests that our hybrid strategy can be used for gene isolation when only short peptide sequences are available for primer design. In such cases, the 3[prime] core would correspond to reverse translation of the least degenerate 3-4 amino acid region, and the 5[prime] clamp could extend beyond available sequence with arbitrarily chosen residues.
As sequence databanks grow and more sequences are classified into known families, the conserved protein regions become better delineated; this can aid in PCR primer design. At present, the Blocks Database (v. 9.3) contains 3417 alignment blocks representing 932 protein families, with an average of 23 sequences per family (16). Blocks from relatively similar sequences have been previously used for designing effective degenerate PCR primers (5). However, for more diverged families, there are too few consecutive invariant and highly conserved residues with low codon degeneracy to design efficient degenerate or consensus PCR primers. Because our hybrid strategy requires no more than four consecutive highly conserved amino acids, it can be more generally applied to these diverse protein families.
We have implemented the CODEHOP method as a computer program that is available for interactive use on the WWW. Previous programs have been introduced to design PCR primers to match known templates (11,23-25). When designing primers to unknown templates, other programs have been developed to minimize potential mismatches by identifying regions of low variability and codon degeneracy (26). Unfortunately, no theory or systematic method exists to guide primer design for unknown templates (22). Our new strategy, however, provides guidelines for design of efficient primers by limiting the degeneracy to just the 3[prime] 11-12 nucleotides of a primer and stabilizing annealing with a long consensus clamp. Moreover, the CODEHOP program utilizes all of the information available in the input alignment and takes into account the codon usage of the target genome to aid in primer design. The program first converts protein multiple sequence alignments into scoring matrices that consider sequence redundancy and amino acid conservation. These matrices are then converted to DNA frequency matrices tailored by organism-specific codon usage tables, and these DNA matrices are searched for optimal hybrid primers. Primers are displayed on a map that shows the level of degeneracy of the 3[prime] core and the maximum annealing temperature of the 5[prime] clamp, the length of which is based on the nearest-neighbor free energy method (12).
WWW implementation of the CODEHOP program has allowed it to be directly linked to the BlockMaker site for producing suitable multiple alignments from related protein sequences submitted by the user. The program is used interactively, so that parameters may be varied if needed: users can adjust the desired annealing temperature, the degree of degeneracy and the cut-off frequency level for bases allowed in the 3[prime] core region. Because there are no mismatches between primers and PCR products in the 5[prime] clamp region, stringent annealing conditions may be used, thus minimizing mispriming. We have found that annealing temperatures as high as 65°C can yield correct product, although stepwise reduction of the annealing temperature down to 50°C may lead to successful amplification without unacceptable background if no product is detected initially. A useful feature of the program is the ability to manually modify alignments or weights as desired. For example, reweighting sequences in order to favor certain ones was employed in designing CODEHOP pairs for the preferential amplification of plant chromomethylases relative to other C5 DNA methyltransferases.
We have found that the CODEHOP method can be extended to even more divergent target sequences by using higher degeneracies and purifying PCR products of the anticipated size on high resolution polyacrylamide electrophoretic gels (T.M.R., unpublished results). We are currently testing the use of touchdown PCR (27) and polymerase time-release with the CODEHOP method (S.H., unpublished data). Other possible enhancements might increase the effectiveness of our method, such as changes in the program that would vary the length of the degenerate core or score the consensus clamp. These and other refinements should lead to even more efficient isolation of distantly-related unknown sequences than can be obtained at present.
ACKNOWLEDGEMENTS
This work was supported in part by a grant to T.M.R. from the M.J.Murdock Charitable Trust and by a grant to S.H. from NIH. S.P. is a Howard Hughes Medical Institute Fellow of the Life Sciences Research Foundation.
REFERENCES
This page is run by Oxford University Press, Great Clarendon Street, Oxford OX2 6DP, as part of the OUP Journals
Comments and feedback: www-admin{at}oup.co.uk
Last modification: 24 Mar 1998
Copyright© Oxford University Press, 1998.
This article has been cited by other articles:
![]() |
S. Maksimovic, T. A. Cook, and E. K. Buschbeck Spatial distribution of opsin-encoding mRNAs in the tiered larval retinas of the sunburst diving beetle Thermonectus marmoratus (Coleoptera: Dytiscidae) J. Exp. Biol., December 1, 2009; 212(23): 3781 - 3794. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-C. Liao, T.-P. Lin, W.-C. Lan, J.-D. Chung, and S.-Y. Hwang Duplication of the class I cytosolic small heat shock protein gene and potential functional divergence revealed by sequence variations flanking the {alpha}-crystallin domain in the genus Rhododendron (Ericaceae) Ann. Bot., November 3, 2009; (2009) mcp272v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Liu, K. Kang, J. Zhang, Q. Ouyang, Z. Zhou, S. Tian, and M. Xing A novel Physarum polycephalum SR protein kinase specifically phosphorylates the RS domain of the human SR protein, ASF/SF2 Acta Biochim Biophys Sin, August 1, 2009; 41(8): 657 - 667. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. G. Healy, K. P. Eaton, P. Limsirichai, J. F. Aldrich, A. K. Plowman, and R. R. King Characterization of {gamma}-Butyrolactone Autoregulatory Signaling Gene Homologs in the Angucyclinone Polyketide WS5995B Producer Streptomyces acidiscabies J. Bacteriol., August 1, 2009; 191(15): 4786 - 4797. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Viljakainen, J. D. Evans, M. Hasselmann, O. Rueppell, S. Tingek, and P. Pamilo Rapid Evolution of Immune Proteins in Social Insects Mol. Biol. Evol., August 1, 2009; 26(8): 1791 - 1801. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. A. Durkin, T. Mock, and E. V. Armbrust Chitin in Diatoms and Its Association with the Cell Wall Eukaryot. Cell, July 1, 2009; 8(7): 1038 - 1050. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Contreras-Moreira, B. Sachman-Ruiz, I. Figueroa-Palacios, and P. Vinuesa primers4clades: a web server that uses phylogenetic trees to design lineage-specific PCR primers for metagenomic and diversity studies Nucleic Acids Res., July 1, 2009; 37(suppl_2): W95 - W100. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Boyce, P. Chilana, and T. M. Rose iCODEHOP: a new interactive program for designing COnsensus-DEgenerate Hybrid Oligonucleotide Primers from multiply aligned protein sequences Nucleic Acids Res., July 1, 2009; 37(suppl_2): W222 - W228. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Huang, P. Shi, Y. Wang, H. Luo, N. Shao, G. Wang, P. Yang, and B. Yao Diversity of Beta-Propeller Phytase Genes in the Intestinal Contents of Grass Carp Provides Insight into the Release of Major Phosphorus from Phytate in Nature Appl. Envir. Microbiol., March 15, 2009; 75(6): 1508 - 1516. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Weller, V. Hecht, J. K. Vander Schoor, S. E. Davidson, and J. J. Ross Light Regulation of Gibberellin Biosynthesis in Pea Is Mediated through the COP1/HY5 Pathway PLANT CELL, March 1, 2009; 21(3): 800 - 813. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. E. Grus and J. Zhang Origin of the Genetic Components of the Vomeronasal System in the Common Ancestor of all Extant Vertebrates Mol. Biol. Evol., February 1, 2009; 26(2): 407 - 419. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Havird, M. M. Miyamoto, K. P. Choe, and D. H. Evans Gene Duplications and Losses within the Cyclooxygenase Family of Teleosts and Other Chordates Mol. Biol. Evol., November 1, 2008; 25(11): 2349 - 2359. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Gillard, V. Devos, M. J.J. Huysman, L. De Veylder, S. D'Hondt, C. Martens, P. Vanormelingen, K. Vannerum, K. Sabbe, V. A. Chepurnov, et al. Physiological and Transcriptomic Evidence for a Close Coupling between Chloroplast Ontogeny and Cell Cycle Progression in the Pennate Diatom Seminavis robusta Plant Physiology, November 1, 2008; 148(3): 1394 - 1411. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Quemeneur, A. Heinrich-Salmeron, D. Muller, D. Lievremont, M. Jauzein, P. N. Bertin, F. Garrido, and C. Joulian Diversity Surveys and Evolutionary Relationships of aoxB Genes in Aerobic Arsenite-Oxidizing Bacteria Appl. Envir. Microbiol., July 15, 2008; 74(14): 4567 - 4573. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Kim, C. Laney, J. Curry, and G. A. Unguez Expression of myogenic regulatory factors in the muscle-derived electric organ of Sternopygus macrurus J. Exp. Biol., July 1, 2008; 211(13): 2172 - 2184. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Elworthy, M. Hargrave, R. Knight, K. Mebus, and P. W. Ingham Expression of multiple slow myosin heavy chain genes reveals a diversity of zebrafish slow twitch muscle fibres with differing requirements for Hedgehog and Prdm1 activity Development, June 15, 2008; 135(12): 2115 - 2126. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pilhofer, K. Rappl, C. Eckl, A. P. Bauer, W. Ludwig, K.-H. Schleifer, and G. Petroni Characterization and Evolution of Cell Division and Cell Wall Synthesis Genes in the Bacterial Phyla Verrucomicrobia, Lentisphaerae, Chlamydiae, and Planctomycetes and Phylogenetic Comparison with rRNA Genes J. Bacteriol., May 1, 2008; 190(9): 3192 - 3202. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Labes, E. N. Karlsson, O. H. Fridjonsson, P. Turner, G. O. Hreggvidson, J. K. Kristjansson, O. Holst, and P. Schonheit Novel Members of Glycoside Hydrolase Family 13 Derived from Environmental DNA Appl. Envir. Microbiol., March 15, 2008; 74(6): 1914 - 1921. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Voisset, R. A. Weiss, and D. J. Griffiths Human RNA "Rumor" Viruses: the Search for Novel Human Retroviruses in Chronic Disease Microbiol. Mol. Biol. Rev., March 1, 2008; 72(1): 157 - 196. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Gostincar, M. Turk, T. Trbuha, T. Vaupotic, A. Plemenitas, and N. Gunde-Cimerman Expression of fatty-acid-modifying enzymes in the halotolerant black yeast Aureobasidium pullulans (de Bary) G. Arnaud under salt stress. Stud Mycol, January 1, 2008; 61: 51 - 59. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. I. Culley and G. F. Steward New Genera of RNA Viruses in Subtropical Seawater, Inferred from Polymerase Gene Sequences Appl. Envir. Microbiol., September 15, 2007; 73(18): 5937 - 5944. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. L. Villeneuve, L. S. Blake, J. D. Brodin, K. J. Greene, I. Knoebl, A. L. Miracle, D. Martinovic, and G. T. Ankley Transcription of Key Genes Regulating Gonadal Steroidogenesis in Control and Ketoconazole- or Vinclozolin-Exposed Fathead Minnows Toxicol. Sci., August 1, 2007; 98(2): 395 - 407. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ban, C. Honda, Y. Hatsuyama, M. Igarashi, H. Bessho, and T. Moriguchi Isolation and Functional Analysis of a MYB Transcription Factor Gene that is a Key Regulator for the Development of Red Coloration in Apple Skin Plant Cell Physiol., July 1, 2007; 48(7): 958 - 970. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Pilhofer, G. Rosati, W. Ludwig, K.-H. Schleifer, and G. Petroni Coexistence of Tubulins and ftsZ in Different Prosthecobacter Species Mol. Biol. Evol., July 1, 2007; 24(7): 1439 - 1442. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Hayes, C. Davies, and I. B. Dry Isolation, functional characterization, and expression analysis of grapevine (Vitis vinifera L.) hexose transporters: differential roles in sink and source tissues J. Exp. Bot., June 1, 2007; 58(8): 1985 - 1997. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Hecht, C. L. Knowles, J. K. Vander Schoor, L. C. Liew, S. E. Jones, M. J.M. Lambert, and J. L. Weller Pea LATE BLOOMER1 Is a GIGANTEA Ortholog with Roles in Photoperiodic Flowering, Deetiolation, and Transcriptional Regulation of Circadian Clock Gene Homologs Plant Physiology, June 1, 2007; 144(2): 648 - 661. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-W. Nam and T. J. Kappock Cloning and transcriptional analysis of Crepis alpina fatty acid desaturases affecting the biosynthesis of crepenynic acid J. Exp. Bot., April 1, 2007; 58(6): 1421 - 1432. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-L. Wu, T.-P. Lin, M.-Y. Lin, Y.-P. Cheng, and S.-Y. Hwang Divergent Evolution of the Chloroplast Small Heat Shock Protein Gene in the Genera Rhododendron (Ericaceae) and Machilus (Lauraceae) Ann. Bot., March 1, 2007; 99(3): 461 - 475. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Zluvova, M. Nicolas, A. Berger, I. Negrutiu, and F. Moneger Premature arrest of the male flower meristem precedes sexual dimorphism in the dioecious plant Silene latifolia PNAS, December 5, 2006; 103(49): 18854 - 18859. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. G. Bruce, A. M. Bakke, H. Bielefeldt-Ohmann, J. T. Ryan, M. E. Thouless, C.-C. Tsai, and T. M. Rose High levels of retroperitoneal fibromatosis (RF)-associated herpesvirus in RF lesions in macaques are associated with ORF73 LANA expression in spindleoid tumour cells J. Gen. Virol., December 1, 2006; 87(12): 3529 - 3538. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Cuellar, J. A. Kim, and G. A. Unguez Evidence of post-transcriptional regulation in the maintenance of a partial muscle phenotype by electrogenic cells of S. macrurus FASEB J, December 1, 2006; 20(14): 2540 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. A. Ahlgren and G. Rocap Culture Isolation and Culture-Independent Clone Libraries Reveal New Marine Synechococcus Ecotypes with Distinctive Light and N Physiologies Appl. Envir. Microbiol., November 1, 2006; 72(11): 7193 - 7204. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sano, K. Myojo, and T. Omura Cloning of a Heavy-Metal-Binding Protein Derived from Activated-Sludge Microorganisms Appl. Envir. Microbiol., September 1, 2006; 72(9): 6377 - 6380. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. A. Nix, M. S. Oberste, and M. A. Pallansch Sensitive, Seminested PCR Amplification of VP1 Sequences for Direct Identification of All Enterovirus Serotypes from Original Clinical Specimens. J. Clin. Microbiol., August 1, 2006; 44(8): 2698 - 2704. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Li, J. Ju, H. Osada, and B. Shen Utilization of the Methoxymalonyl-Acyl Carrier Protein Biosynthesis Locus for Cloning of the Tautomycin Biosynthetic Gene Cluster from Streptomyces spiroverticillatus J. Bacteriol., June 1, 2006; 188(11): 4148 - 4152. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. A. Barthelson, P. Sundareshan, D. W. Galbraith, and R. L. Woosley Development of a comprehensive detection method for medicinal and toxic plant species Am. J. Botany, April 1, 2006; 93(4): 566 - 574. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. A. Paul, S. L. Quackenbush, C. Sutton, R. N. Casey, P. R. Bowser, and J. W. Casey Identification and Characterization of an Exogenous Retrovirus from Atlantic Salmon Swim Bladder Sarcomas J. Virol., March 15, 2006; 80(6): 2941 - 2948. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. S. Bulmer and R. H. Crozier Variation in Positive Selection in Termite GNBPs and Relish Mol. Biol. Evol., February 1, 2006; 23(2): 317 - 326. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. W. Odom and G. R. Vasta Characterization of a Binary Tandem Domain F-type Lectin from Striped Bass (Morone saxatilis) J. Biol. Chem., January 20, 2006; 281(3): 1698 - 1713. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. A. Mena and P. S. Daugherty Automated design of degenerate codon libraries Protein Eng. Des. Sel., December 1, 2005; 18(12): 559 - 561. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Scheffer, C. Chen, P. Heidrich, M. B. Dickman, and P. Tudzynski A CDC42 Homologue in Claviceps purpurea Is Involved in Vegetative Differentiation and Is Essential for Pathogenicity Eukaryot. Cell, July 1, 2005; 4(7): 1228 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Fredslund, L. Schauser, L. H. Madsen, N. Sandal, and J. Stougaard PriFi: using a multiple alignment of related sequences to find primers for amplification of homologs Nucleic Acids Res., July 1, 2005; 33(suppl_2): W516 - W520. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Leber, L. Kaderali, A. Schonhuth, and R. Schrader A fractional programming approach to efficient DNA melting temperature calculation Bioinformatics, May 15, 2005; 21(10): 2375 - 2382. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Hecht, F. Foucher, C. Ferrandiz, R. Macknight, C. Navarro, J. Morin, M. E. Vardy, N. Ellis, J. P. Beltran, C. Rameau, et al. Conservation of Arabidopsis Flowering Genes in Model Legumes Plant Physiology, April 1, 2005; 137(4): 1420 - 1434. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Fourquin, M. Vinauger-Douard, B. Fogliani, C. Dumas, and C. P. Scutt Evidence that CRABS CLAW and TOUSLED have conserved their roles in carpel development since the ancestor of the extant angiosperms PNAS, March 22, 2005; 102(12): 4649 - 4654. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Igawa, T. Ochiai-Fukuda, N. Takahashi-Ando, S. Ohsato, T. Shibata, I. Yamaguchi, and M. Kimura New TAXI-type Xylanase Inhibitor Genes are Inducible by Pathogens and Wounding in Hexaploid Wheat Plant Cell Physiol., October 15, 2004; 45(10): 1347 - 1360. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. L. Cooper and S. Henikoff Adaptive Evolution of the Histone Fold Domain in Centromeric Histones Mol. Biol. Evol., September 1, 2004; 21(9): 1712 - 1718. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Cho, S.-W. Jin, A. Cohen, and R. E. Ellis A Phylogeny of Caenorhabditis Reveals Frequent Loss of Introns During Nematode Evolution Genome Res., July 1, 2004; 14(7): 1207 - 1220. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aruga, A. M. Pajor, K. Nakamura, L. Liu, O. W. Moe, P. A. Preisig, and R. J. Alpern OKP cells express the Na-dicarboxylate cotransporter NaDC-1 Am J Physiol Cell Physiol, July 1, 2004; 287(1): C64 - C72. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. C. Davidson, J. H. Nett, E. Renfer, H. Li, T. A. Stadheim, B. J. Miller, R. G. Miele, S. R. Hamilton, B.-K. Choi, T. I. Mitchell, et al. Functional analysis of the ALG3 gene encoding the Dol-P-Man: Man5GlcNAc2-PP-Dol mannosyltransferase enzyme of P. pastoris Glycobiology, May 1, 2004; 14(5): 399 - 407. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Louis, S. H. Duncan, S. I. McCrae, J. Millar, M. S. Jackson, and H. J. Flint Restricted Distribution of the Butyrate Kinase Pathway among Butyrate-Producing Bacteria from the Human Colon J. Bacteriol., April 1, 2004; 186(7): 2099 - 2106. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. A. Reeves and R. G. Olmstead Evolution of the TCP Gene Family in Asteridae: Cladistic and Network Approaches to Understanding Regulatory Gene Family Diversification and Its Impact on Morphological Evolution Mol. Biol. Evol., December 1, 2003; 20(12): 1997 - 2009. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Olsson, M. Thelander, and H. Ronne A Novel Type of Chloroplast Stromal Hexokinase Is the Major Glucose-phosphorylating Enzyme in the Moss Physcomitrella patens J. Biol. Chem., November 7, 2003; 278(45): 44439 - 44447. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. K. Greenwood, P. C. Butler, R. B. White, U. DeMarco, D. Pearce, and R. D. Fernald Multiple Corticosteroid Receptors in a Teleost Fish: Distinct Sequences, Expression Patterns, and Transcriptional Activities Endocrinology, October 1, 2003; 144(10): 4226 - 4236. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. G. Becker, J. Schweitzer, J. Feldner, T. Becker, and M. Schachner Tenascin-R as a Repellent Guidance Molecule for Developing Optic Axons in Zebrafish J. Neurosci., July 16, 2003; 23(15): 6232 - 6237. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Whitby, A. Stossel, C. Gamache, J. Papin, M. Bosch, A. Smith, D. H. Kedes, G. White, R. Kennedy, and D. P. Dittmer Novel Kaposi's Sarcoma-Associated Herpesvirus Homolog in Baboons J. Virol., July 15, 2003; 77(14): 8159 - 8165. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Rose, J. Henikoff, and S. Henikoff CODEHOP (COnsensus-DEgenerate Hybrid Oligonucleotide Primer) PCR primer design Nucleic Acids Res., July 1, 2003; 31(13): 3763 - 3766. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Provencher, G. LaPointe, S. Sirois, M.-R. Van Calsteren, and D. Roy Consensus-Degenerate Hybrid Oligonucleotide Primers for Amplification of Priming Glycosyltransferase Genes of the Exopolysaccharide Locus in Strains of the Lactobacillus casei Group Appl. Envir. Microbiol., June 1, 2003; 69(6): 3299 - 3307. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zuber, M. J. Hynes, and A. Andrianopoulos The G-Protein {alpha}-Subunit GasC Plays a Major Role in Germination in the Dimorphic Fungus Penicillium marneffei Genetics, June 1, 2003; 164(2): 487 - 499. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. M. Rose, J. T. Ryan, E. R. Schultz, B. W. Raden, and C.-C. Tsai Analysis of 4.3 Kilobases of Divergent Locus B of Macaque Retroperitoneal Fibromatosis-Associated Herpesvirus Reveals a Close Similarity in Gene Sequence and Genome Organization to Kaposi's Sarcoma-Associated Herpesvirus J. Virol., May 1, 2003; 77(9): 5084 - 5097. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Kempf, M. E. Kadin, A. M. Dvorak, C. C. Lord, G. Burg, N. L. Letvin, and I. J. Koralnik Endogenous retroviral elements, but not exogenous retroviruses, are detected in CD30-positive lymphoproliferative disorders of the skin Carcinogenesis, February 1, 2003; 24(2): 301 - 306. [Full Text] [PDF] |
||||
![]() |
Y.-Q. Cheng, G.-L. Tang, and B. Shen Identification and Localization of the Gene Cluster Encoding Biosynthesis of the Antitumor Macrolactam Leinamycin in Streptomyces atroolivaceus S-140 J. Bacteriol., December 15, 2002; 184(24): 7013 - 7024. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. K. Sluis, L. A. Sayavedra-Soto, and D. J. Arp Molecular analysis of the soluble butane monooxygenase from 'Pseudomonas butanovora' Microbiology, November 1, 2002; 148(11): 3617 - 3629. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. D. McMahon, M. A. Dojka, N. R. Pace, D. Jenkins, and J. D. Keasling Polyphosphate Kinase from Activated Sludge Performing Enhanced Biological Phosphorus Removal Appl. Envir. Microbiol., October 1, 2002; 68(10): 4971 - 4978. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. S. Ahn, A. Konno, J. A. Gebe, A. Aruffo, M. J. Hamilton, Y. H. Park, and W. C. Davis Scavenger receptor cysteine-rich domains 9 and 11 of WC1 are receptors for the WC1 counter receptor J. Leukoc. Biol., August 1, 2002; 72(2): 382 - 390. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. J. L. Bell, A. Sunna, M. D. Gibbs, N. C. Curach, H. Nevalainen, and P. L. Bergquist Prospecting for novel lipase genes using PCR Microbiology, August 1, 2002; 148(8): 2283 - 2291. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Sanchez, J. R Lopez-Lopez, M T. Perez-Garcia, G. Sanz-Alfayate, A. Obeso, M. D Ganfornina, and C. Gonzalez Molecular identification of Kv{alpha} subunits that contribute to the oxygen-sensitive K+ current of chemoreceptor cells of the rabbit carotid body J. Physiol., July 15, 2002; 542(2): 369 - 382. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Wong, G. Butler, and K. H. Wolfe Gene order evolution and paleopolyploidy in hemiascomycete yeasts PNAS, July 9, 2002; 99(14): 9272 - 9277. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. S. Vangnai, D. J. Arp, and L. A. Sayavedra-Soto Two Distinct Alcohol Dehydrogenases Participate in Butane Metabolism by Pseudomonas butanovora J. Bacteriol., April 1, 2002; 184(7): 1916 - 1924. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. B. Utaker, K. Andersen, A. Aakra, B. Moen, and I. F. Nes Phylogeny and Functional Expression of Ribulose 1,5-Bisphosphate Carboxylase/Oxygenase from the Autotrophic Ammonia-Oxidizing Bacterium Nitrosospira sp.Isolate 40KI J. Bacteriol., January 15, 2002; 184(2): 468 - 478. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Waskiewicz, H. A. Rikhof, R. E. Hernandez, and C. B. Moens Zebrafish Meis functions to stabilize Pbx proteins and regulate hindbrain patterning Development, November 1, 2001; 128(21): 4139 - 4151. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. V. Armbrust and H. M. Galindo Rapid Evolution of a Sexual Reproduction Gene in Centric Diatoms of the Genus Thalassiosira Appl. Envir. Microbiol., August 1, 2001; 67(8): 3501 - 3513. [Abstract] [Full Text] [PDF] |
||||
![]() |
P.-J. Chen, S. Cho, S.-W. Jin, and R. E. Ellis Specification of Germ Cell Fates by FOG-3 Has Been Conserved During Nematode Evolution Genetics, August 1, 2001; 158(4): 1513 - 1525. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. M. Papa, N. M. Springer, M. G. Muszynski, R. Meeley, and S. M. Kaeppler Maize Chromomethylase Zea methyltransferase2 Is Required for CpNpG Methylation PLANT CELL, August 1, 2001; 13(8): 1919 - 1928. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Bartee, F. Malagnac, and J. Bender Arabidopsis cmt3 chromomethylase mutations block non-CG methylation and silencing of an endogenous gene Genes & Dev., July 15, 2001; 15(14): 1753 - 1758. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ehrenshaft and M. E. Daub Isolation of PDX2, a Second Novel Gene in the Pyridoxine Biosynthesis Pathway of Eukaryotes, Archaebacteria, and a Subset of Eubacteria J. Bacteriol., June 1, 2001; 183(11): 3383 - 3390. [Abstract] [Full Text] |
||||
![]() |
Y. Kubo, S. Okazaki, T. Anzai, and H. Fujiwara Structural and Phylogenetic Analysis of TRAS, Telomeric Repeat-Specific Non-LTR Retrotransposon Families in Lepidopteran Insects Mol. Biol. Evol., May 1, 2001; 18(5): 848 - 857. [Abstract] [Full Text] |
||||
![]() |
S. Miyazawa, K. Azumi, and M. Nonaka Cloning and Characterization of Integrin {{alpha}} Subunits from the Solitary Ascidian, Halocynthia roretzi J. Immunol., February 1, 2001; 166(3): 1710 - 1715. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. A. Serikawa, D. M. Porterfield, and D. F. Mandoli Asymmetric Subcellular mRNA Distribution Correlates with Carbonic Anhydrase Activity in Acetabularia acetabulum Plant Physiology, February 1, 2001; 125(2): 900 - 911. [Abstract] [Full Text] |
||||
![]() |
C. McAnulla, C. A. Woodall, I. R. McDonald, A. Studer, S. Vuilleumier, T. Leisinger, and J. C. Murrell Chloromethane Utilization Gene Cluster from Hyphomicrobium chloromethanicum Strain CM2T and Development of Functional Gene Probes To Detect Halomethane-Degrading Bacteria Appl. Envir. Microbiol., January 1, 2001; 67(1): 307 - 316. [Abstract] [Full Text] |
||||
![]() |
S. van Beek and F. G. Priest Decarboxylation of Substituted Cinnamic Acids by Lactic Acid Bacteria Isolated during Malt Whisky Fermentation Appl. Envir. Microbiol., December 1, 2000; 66(12): 5322 - 5328. [Abstract] [Full Text] |
||||
![]() |
C. Handschin, M. Podvinec, and U. A. Meyer CXR, a chicken xenobiotic-sensing orphan nuclear receptor, is related to both mammalian pregnane X receptor (PXR) and constitutive androstane receptor (CAR) PNAS, September 26, 2000; 97(20): 10769 - 10774. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. R. Schultz, G. W. Rankin Jr., M.-P. Blanc, B. W. Raden, C.-C. Tsai, and T. M. Rose Characterization of Two Divergent Lineages of Macaque Rhadinoviruses Related to Kaposi's Sarcoma-Associated Herpesvirus J. Virol., May 15, 2000; 74(10): 4919 - 4928. [Abstract] [Full Text] |
||||
![]() |
X. Cao, N. M. Springer, M. G. Muszynski, R. L. Phillips, S. Kaeppler, and S. E. Jacobsen Conserved plant genes with similarity to mammalian de novo DNA methyltransferases PNAS, April 25, 2000; 97(9): 4979 - 4984. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Xiong, V. K. Singh, G. Cabrera, and R. K. Jayaswal Molecular characterization of the ferric-uptake regulator, Fur, from Staphylococcus aureus Microbiology, March 1, 2000; 146(3): 659 - 668. [Abstract] [Full Text] |
||||
![]() |
J. Essner, W. Branford, J Zhang, and H. Yost Mesendoderm and left-right brain, heart and gut development are differentially regulated by pitx2 isoforms Development, January 3, 2000; 127(5): 1081 - 1093. [Abstract] [PDF] |
||||
![]() |
J. G. Henikoff, E. A. Greene, S. Pietrokovski, and S. Henikoff Increased coverage of protein families with the Blocks Database servers Nucleic Acids Res., January 1, 2000; 28(1): 228 - 230. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. M. Lindroth, X. Cao, J. P. Jackson, D. Zilberman, C. M. McCallum, S. Henikoff, and S. E. Jacobsen Requirement of CHROMOMETHYLASE3 for Maintenance of CpXpG Methylation Science, June 15, 2001; 292(5524): 2077 - 2080. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. M. Roper, E. Raux, A. A. Brindley, H. L. Schubert, S. E. Gharbia, H. N. Shah, and M. J. Warren The Enigma of Cobalamin (Vitamin B12) Biosynthesis in Porphyromonas gingivalis. IDENTIFICATION AND CHARACTERIZATION OF A FUNCTIONAL CORRIN PATHWAY J. Biol. Chem., December 15, 2000; 275(51): 40316 - 40323. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. A. Yoder, M. G. Mueller, S. Wei, B. C. Corliss, D. M. Prather, T. Willis, R. T. Litman, J. Y. Djeu, and G. W. Litman Immune-type receptor genes in zebrafish share genetic and functional properties with genes encoded by the mammalian leukocyte receptor cluster PNAS, June 5, 2001; 98(12): 6771 - 6776. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





































