Article |
dsCheck: highly sensitive off-target search software for double-stranded RNA-mediated RNA interference
1Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan 2Undergraduate Program for Bioinformatics and Systems Biology, School of Science, University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan 3Department of Computational Biology, Graduate School of Frontier Sciences, University of Tokyo 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8562, Japan
*To whom correspondence should be addressed. Tel: +81 47 136 3984; Fax: +81 47 136 3977; Email: moris{at}k.u-tokyo.ac.jp
Received February 14, 2005. Revised March 21, 2005. Accepted March 21, 2005.
| ABSTRACT |
|---|
|
|
|---|
Off-target effects are one of the most serious problems in RNA interference (RNAi). Here, we present dsCheck (http://dsCheck.RNAi.jp/), web-based online software for estimating off-target effects caused by the long double-stranded RNA (dsRNA) used in RNAi studies. In the biochemical process of RNAi, the long dsRNA is cleaved by Dicer into short-interfering RNA (siRNA) cocktails. The software simulates this process and investigates individual 19 nt substrings of the long dsRNA. Subsequently, the software promptly enumerates a list of potential off-target gene candidates based on the order of off-target effects using its novel algorithm, which significantly improves both the efficiency and the sensitivity of the homology search. The website not only provides a rigorous off-target search to verify previously designed dsRNA sequences but also presents off-target minimized dsRNA design, which is essential for reliable experiments in RNAi-based functional genomics.
| INTRODUCTION |
|---|
|
|
|---|
RNA interference (RNAi) is now widely used to knockdown gene expression in a sequence-specific manner, making it a powerful tool for studying gene function (13). The process of RNAi is mediated by double-stranded RNA (dsRNA) that contains a sequence homologous to the target mRNA. Long dsRNA introduced into the cell is cleaved by the enzyme Dicer into short-interfering RNA (siRNA) followed by incorporation into the RNA-induced silencing complex (RISC), which is responsible for target mRNA degradation (4).
One of the most serious problems in RNAi is off-target silencing effects (5). Off-target silencing effects are caused by siRNA (introduced directly into cells, or produced in vivo from long dsRNA) that has sequence similarities with unrelated genes. In Caenorhabditis elegans, Drosophila or plants, RNAi experiments are usually performed using long dsRNAs. In these cases, there is a high risk of cross-suppression or co-suppression between closely related genes that share a highly conserved region.
To minimize the possibility of off-target effects, it is necessary to perform an off-target search to design dsRNA or siRNA that has limited sequence similarities with unrelated genes. Recently, fast and sensitive off-target search software for siRNA design has been reported (6,7), but commonly used siRNA design servers are not useful in performing off-target searches for long dsRNAs. DEQOR server uses BLAST to perform off-target searches for endoribonuclease-prepared siRNAs (8), although BLAST frequently fails to identify off-targets (6). Therefore, we have developed a new web-based online software system, dsCheck, to provide fast and accurate off-target searches for long dsRNA sequences. The software dices the input sequence into an siRNA cocktail and performs an exhaustive scan for each siRNA to find off-target gene candidates, simulating the biochemical process of dsRNA-mediated RNAi in vivo. dsCheck also provides efficient design of off-target minimized dsRNA by avoiding regions that share a considerable number of diced siRNAs with a specific off-target gene, and monitoring the total number of off-target hits. The software should be especially useful for checking whether previously designed dsRNAs have off-target gene candidates, as well as for designing target-specific dsRNA when off-target effects are suspected.
| METHODS |
|---|
|
|
|---|
Off-target search strategies for long dsRNA
The key idea of the program follows the biochemical process of dsRNA-mediated RNAi shown in Figure 1A. The input dsRNA sequence is diced into 19 nt substrings of an siRNA cocktail, and an exhaustive off-target search is performed for all individual siRNAs using the siDirect engine, which makes it possible to enumerate the complete set of off-targets in a reasonable amount of time (7). In dsCheck, the in silico dicing size is set to 19, as a complete match at the 19 nt double-stranded region of an siRNA is sufficient for the target mRNA degradation. For example, an input 500 bp dsRNA sequence is processed into 482 substrings each 19 nt in length, which are subjected to the off-target search individually. In the next step, all the hits with a complete match (i.e. 19/19 matches), one mismatch (18/19 matches) or two mismatches (17/19 matches) are counted individually for every off-target gene candidate and sorted in descending lexicographic order for the output.
|
Figure 1B shows a typical output for a 1497 bp query sequence of the Drosophila POU domain protein, pdm2 (NM_078834 [GenBank] , coding region). The result shows significant hits against pdm2 (two splicing variants: NM_078834 [GenBank] and NM_165017 [GenBank] ), and two unrelated genes, nub (NM_057311 [GenBank] ) and vvl (NM_079224 [GenBank] ). These proteins share the highly conserved POU domain shown in Figure 1C, indicating a high risk of cross-suppression by dsRNA targeting this region.
Designing off-target minimized dsRNA sequences
To design off-target minimized dsRNA sequences, one approach would be to suppose that the off-target effects are caused by a considerable number of collaborative hits by diced siRNAs on the same gene, and to select a region that minimizes the maximum number of collaborative off-target hits, which are defined as complete or partial matches of multiple 19 nt substrings against the same off-target gene. According to this criterion, dsCheck starts by selecting a region that minimizes the maximum number of complete match collaborative off-target hits. If multiple regions are optimal, it also examines the maximum number of partial match collaborative off-target hits to select the best one. If the complete match, collaborative hits on a sequence exceed 80% of the total number of diced 19 nt substrings, dsCheck regards the sequence as the intended target gene.
Some dsRNA sequences include 19 nt substrings that may react with a large number of off-target genes, which differs from the collaborative silencing effects acting on a single off-target gene. An additional criterion is necessary to evaluate the silencing effect of one siRNA sequence on many off-targets, although the effect may not be as serious as the collaborative silencing effect, as the concentration of single siRNA is low in diced siRNA cocktails. One reasonable measure would be the total number of off-target hits for each 19 nt substring of designed dsRNA. To attract attention to this risk, dsCheck displays a warning if the total number of off-target hits exceeds a specified threshold.
Figure 2 illustrates how dsCheck designs target-specific dsRNA for the Drosophila pdm2 gene (NM_078834 [GenBank] , coding region). Given that the length of dsRNA is 100 bp, dsCheck returns the positions 424523 for the target-specific region that successfully avoids the collaborative silencing effects on the major off-target genes nub (NM_057311 [GenBank] ) and vvl (NM_079224 [GenBank] ).
|
Efficacy of each diced siRNA
In mammalian RNAi, the efficacy of each siRNA varies widely depending on its sequence; hence, several groups have reported guidelines for the selection of siRNAs (912). However, in Drosophila cells, it is reported that most, if not all, siRNA sequences may act as effective silencers (9). Incorporation of siRNA efficacy prediction may run the risk of underestimating off-target effects in non-mammalian RNAi. Therefore, all siRNA sequences are treated equally in dsCheck.
Database maintenance
Currently, off-target searches can be performed against the Drosophila, C.elegans, Arabidopsis and Oryza sativa mRNA sequences stored in the NCBI RefSeq database (13). Since off-target searches demand a substantial number of mRNA sequences that are likely to cover the entire set of transcripts, we plan to incorporate additional species when ample cDNA collections are available.
| ACKNOWLEDGEMENTS |
|---|
This work was supported in part by the Special Coordination Fund for Promoting Science and Technology to K.S., the Leading Project for Biosimulation to S.M. and Grants-in-Aid for Scientific Research to K.U.-T., K.S. and S.M. from the Ministry of Education, Culture, Sports, Science and Technology of Japan. Funding to pay the Open Access publication charges for this article was provided by the Ministry of Education, Culture, Sports, Science and Technology of Japan.
Conflict of interest statement. None declared.
| Footnotes |
|---|
Correspondence may also be addressed to Kumiko Ui-Tei. Tel: +81 3 5841 3044; Fax: +81 3 5841 3044; Email: ktei{at}biochem.s.u-tokyo.ac.jp
The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors
| REFERENCES |
|---|
|
|
|---|
- Fire, A., Xu, S., Montgomery, M.K., Kostas, S.A., Driver, S.E., Mello, C.C. (1998) Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans Nature, 391, 806811[CrossRef][Medline] .
- Dykxhoorn, D.M., Novina, C.D., Sharp, P.A. (2003) Killing the messenger: short RNAs that silence gene expression Nature Rev. Mol. Cell Biol., 4, 457467[CrossRef][Web of Science][Medline] .
- Mello, C.C. and Conte, D., Jr. (2004) Revealing the world of RNA interference Nature, 431, 338342[CrossRef][Medline] .
- Meister, G. and Tuschl, T. (2004) Mechanisms of gene silencing by double-stranded RNA Nature, 431, 343349[CrossRef][Medline] .
- Jackson, A.L. and Linsley, P.S. (2004) Noise amidst the silence: off-target effects of siRNAs? Trends Genet., 20, 521524[CrossRef][Web of Science][Medline] .
- Naito, Y., Yamada, T., Ui-Tei, K., Morishita, S., Saigo, K. (2004) siDirect: highly effective, target-specific siRNA design software for mammalian RNA interference Nucleic Acids Res., 32, W124W129
[Abstract/Free Full Text] . - Yamada, T. and Morishita, S. (2005) Accelerated off-target search algorithm for siRNA Bioinformatics, 21, 13161324
[Abstract/Free Full Text] . - Henschel, A., Buchholz, F., Habermann, B. (2004) DEQOR: a web-based tool for the design and quality control of siRNAs Nucleic Acids Res., 32, W113W120
[Abstract/Free Full Text] . - Ui-Tei, K., Naito, Y., Takahashi, F., Haraguchi, T., Ohki-Hamazaki, H., Juni, A., Ueda, R., Saigo, K. (2004) Guidelines for the selection of highly effective siRNA sequences for mammalian and chick RNA interference Nucleic Acids Res., 32, 936948
[Abstract/Free Full Text] . - Reynolds, A., Leake, D., Boese, Q., Scaringe, S., Marshall, W.S., Khvorova, A. (2004) Rational siRNA design for RNA interference Nat. Biotechnol., 22, 326330[CrossRef][Web of Science][Medline] .
- Amarzguioui, M. and Prydz, H. (2004) An algorithm for selection of functional siRNA sequences Biochem. Biophys. Res. Commun., 316, 10501058[CrossRef][Web of Science][Medline] .
- Chalk, A.M., Wahlestedt, C., Sonnhammer, E.L. (2004) Improved and automated prediction of effective siRNA Biochem. Biophys. Res. Commun., 319, 264274[CrossRef][Web of Science][Medline] .
- Pruitt, K.D., Tatusova, T., Maglott, D.R. (2005) NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins Nucleic Acids Res., 33, D501D504
[Abstract/Free Full Text] .
This article has been cited by other articles:
![]() |
X. Wang, Y. Wu, and B. Zhou Dietary zinc absorption is mediated by ZnT1 in Drosophila melanogaster FASEB J, August 1, 2009; 23(8): 2650 - 2661. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Dahlgren, H.-Y. Zhang, Q. Du, M. Grahn, G. Norstedt, C. Wahlestedt, and Z. Liang Analysis of siRNA specificity on targets with double-nucleotide mismatches Nucleic Acids Res., May 1, 2008; 36(9): e53 - e53. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xu, S.-H. Mirmalek-Sani, F. Lin, J. Zhang, and R. O.C. Oreffo Adipocyte differentiation induced using nonspecific siRNA controls in cultured human mesenchymal stem cells RNA, August 1, 2007; 13(8): 1179 - 1183. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Matsumoto, M. Ukai-Tadenuma, R. G. Yamada, J. Houl, K. D. Uno, T. Kasukawa, B. Dauwalder, T. Q. Itoh, K. Takahashi, R. Ueda, et al. A functional genomics strategy reveals clockwork orange as a transcriptional regulator in the Drosophila circadian clock Genes & Dev., July 1, 2007; 21(13): 1687 - 1700. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. Flockhart, M. Booker, A. Kiger, M. Boutros, S. Armknecht, N. Ramadan, K. Richardson, A. Xu, N. Perrimon, and B. Mathey-Prevot FlyRNAi: the Drosophila RNAi screening center database Nucleic Acids Res., January 1, 2006; 34(suppl_1): D489 - D494. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||





