Published online 25 September 2005
Article |
Analysis of the transcriptome of the protozoan Theileria parva using MPSS reveals that the majority of genes are transcriptionally active in the schizont stage
The International Livestock Research Institute (ILRI) PO Box 30709, Nairobi, Kenya 1Department of Computer Science, University of Exeter North Park Road, Exeter EX4 4QF, UK 2Department of Biochemistry and Microbiology Petch Building, Ring Road University of Victoria Victoria BC V8W 3P6, Canada 3Department of Computer Science, University of Manchester Oxford Road, Manchester M13 9PL, UK 4The Institute for Genomic Research (TIGR) 9712 Medical Center Drive, Rockville, MD 20850, USA
*To whom correspondence should be addressed. Tel: +254 20 4223002; Email: r.bishop{at}cgiar.org
Received June 2, 2005. Revised July 19, 2005. Accepted August 19, 2005.
Massively parallel signature sequencing (MPSS) was used to analyze the transcriptome of the intracellular protozoan Theileria parva. In total 1 095 000, 20 bp sequences representing 4371 different signatures were generated from T.parva schizonts. Reproducible signatures were identified within 73% of potentially detectable predicted genes and 83% had signatures in at least one MPSS cycle. A predicted leader peptide was detected on 405 expressed genes. The quantitative range of signatures was 452 256 transcripts per million (t.p.m.). Rare transcripts (<50 t.p.m.) were detected from 36% of genes. Sequence signatures approximated a lognormal distribution, as in microarray. Transcripts were widely distributed throughout the genome, although only 47% of 138 telomere-associated open reading frames exhibited signatures. Antisense signatures comprised 13.8% of the total, comparable with Plasmodium. Eighty five predicted genes with antisense signatures lacked a sense signature. Antisense transcripts were independently amplified from schizont cDNA and verified by sequencing. The MPSS transcripts per million for seven genes encoding schizont antigens recognized by bovine CD8 T cells varied 1000-fold. There was concordance between transcription and protein expression for heat shock proteins that were very highly expressed according to MPSS and proteomics. The data suggests a low level of baseline transcription from the majority of protein-coding genes.