Published online 24 June 2005
Article |
Human microRNA prediction through a probabilistic co-learning model of sequence and structure
1Graduate Program in Bioinformatics, Seoul National University Seoul 151-744, Korea 2Center for Bioinformation Technology (CBIT), Seoul National University Seoul 151-744, Korea 3Biointelligence Laboratory, School of Computer Science and Engineering, Seoul National University Seoul 151-744, Korea 4Department of Biological Sciences, Seoul National University Seoul 151-744, Korea
*To whom correspondence should be addressed. Tel: +82 2 880 1847; Fax: +82 2 875 2240; Email: btzhang{at}bi.snu.ac.kr
Received January 10, 2005. Revised May 27, 2005. Accepted June 6, 2005.
MicroRNAs (miRNAs) are small regulatory RNAs of
22 nt. Although hundreds of miRNAs have been identified through experimental complementary DNA cloning methods and computational efforts, previous approaches could detect only abundantly expressed miRNAs or close homologs of previously identified miRNAs. Here, we introduce a probabilistic co-learning model for miRNA gene finding, ProMiR, which simultaneously considers the structure and sequence of miRNA precursors (pre-miRNAs). On 5-fold cross-validation with 136 referenced human datasets, the efficiency of the classification shows 73% sensitivity and 96% specificity. When applied to genome screening for novel miRNAs on human chromosomes 16, 17, 18 and 19, ProMiR effectively searches distantly homologous patterns over diverse pre-miRNAs, detecting at least 23 novel miRNA gene candidates. Importantly, the miRNA gene candidates do not demonstrate clear sequence similarity to the known miRNA genes. By quantitative PCR followed by RNA interference against Drosha, we experimentally confirmed that 9 of the 23 representative candidate genes express transcripts that are processed by the miRNA biogenesis enzyme Drosha in HeLa cells, indicating that ProMiR may successfully predict miRNA genes with at least 40% accuracy. Our study suggests that the miRNA gene family may be more abundant than previously anticipated, and confer highly extensive regulatory networks on eukaryotic cells.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A. Oulas, A. Boutla, K. Gkirtzou, M. Reczko, K. Kalantidis, and P. Poirazi Prediction of novel microRNA genes in cancer-associated genomic regions--a combined computational and experimental approach Nucleic Acids Res., June 1, 2009; 37(10): 3276 - 3287. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. D. Mendes, A. T. Freitas, and M.-F. Sagot Current tools for the identification of miRNA genes and their targets Nucleic Acids Res., May 1, 2009; 37(8): 2419 - 2433. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Ghosh, B. Mallick, and J. Chakrabarti Cellular versus viral microRNAs in host-virus interaction Nucleic Acids Res., March 1, 2009; 37(4): 1035 - 1048. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Xu, X. Zhou, and W. Zhang MicroRNA prediction with a novel ranking algorithm based on random walks Bioinformatics, July 1, 2008; 24(13): i50 - i58. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Terai, T. Komori, K. Asai, and T. Kin miRRim: A novel system to find conserved miRNAs with high sensitivity and specificity RNA, December 1, 2007; 13(12): 2081 - 2090. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Meyer A practical guide to the art of RNA gene prediction Brief Bioinform, November 1, 2007; 8(6): 396 - 414. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Jiang, H. Wu, W. Wang, W. Ma, X. Sun, and Z. Lu MiPred: classification of real and pseudo microRNA precursors using random forest prediction model with combined features Nucleic Acids Res., July 13, 2007; 35(suppl_2): W339 - W344. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. S. Ng and S. K. Mishra De novo SVM classification of precursor microRNAs from genomic pseudo hairpins using global and intrinsic folding measures Bioinformatics, June 1, 2007; 23(11): 1321 - 1330. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Piriyapongsa, L. Marino-Ramirez, and I. K. Jordan Origin and Evolution of Human microRNAs From Transposable Elements Genetics, June 1, 2007; 176(2): 1323 - 1337. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. NG Kwang Loong and S. K. Mishra Unique folding of precursor microRNAs: Quantitative evidence and implications for de novo identification RNA, February 1, 2007; 13(2): 170 - 187. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. A. Helvik, O. Snove Jr, and P. Saetrom Reliable prediction of Drosha processing sites improves microRNA gene prediction Bioinformatics, January 15, 2007; 23(2): 142 - 149. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Osada and T. Takahashi MicroRNAs in biological processes and carcinogenesis Carcinogenesis, January 1, 2007; 28(1): 2 - 12. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Berezikov, G. van Tetering, M. Verheul, J. van de Belt, L. van Laake, J. Vos, R. Verloop, M. van de Wetering, V. Guryev, S. Takada, et al. Many novel mammalian microRNA candidates identified by extensive cloning and RAKE analysis Genome Res., October 1, 2006; 16(10): 1289 - 1298. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-W. Nam, J. Kim, S.-K. Kim, and B.-T. Zhang ProMiR II: a web server for the probabilistic prediction of clustered, nonclustered, conserved and nonconserved microRNAs. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W455 - W458. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Yousef, M. Nebozhyn, H. Shatkay, S. Kanterakis, L. C. Showe, and M. K. Showe Combining multi-species genomic data for microRNA identification using a Naive Bayes classifier Bioinformatics, June 1, 2006; 22(11): 1325 - 1334. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. He, L. Cai, G. Skogerbo, W. Deng, T. Liu, X. Zhu, Y. Wang, D. Jia, Z. Zhang, Y. Tao, et al. Profiling Caenorhabditis elegans non-coding RNA expression with a combined microarray Nucleic Acids Res., May 31, 2006; 34(10): 2976 - 2983. [Abstract] [Full Text] [PDF] |
||||






