Nucleic Acids Research, 1984, Vol. 12, No. 21 7965-7973
© 1984
MOLECULAR BIOLOGY |
Nucleotide sequence of a Sequence of a sendai virus genome region covering the entire M gene and the 3' proximal 1013 nudeotides of the F gene
Dept. Physiol. Chem., Tokyo Metropolitan Inst. Med. Sci. Honkomagome, Bunkyo-ku, Tokyo 113 1Dept. Microbiol., Fac. Med., Univ. Tokyo Hongo, Bunkyo-ku, Tokyo 113, Japan 2Dept. Viral Infection, Inst. Med. Sci., Univ. Tokyo Shirokanedai, Minato-ku, Tokyo 108, Japan
*To whom correspondence should be addressed
Received September 24, 1984. Accepted October 8, 1984.
We determined the sequence of the 2,138 nucleotides in the Sendai virus genome just following the 3' proximal 3,686 nucleotides which we had previously reported (Nucleic Acids Res. 11, 73177330, 1983). This covers the entire third gene of 1,173 nucleotides and the 3' proximal 1,013 nucleotides of the fourth gene. Like the NP and P+C genes, both the third and fourth genes start from consensus sequence Rl (3'-UCCCAC(or UA)UUUC) at the 3' end and the third gene terminates with consensus sequence R2 (3'-AUUCUUUUU) at the 5' end. The third gene was identified as M, and the deduced 348 amino acids indicated that the M protein is rich in basic residues and has hydrophobic domains near the C-terminal. The fourth gene, although sequencing is not complete yet, was identified as F, since a large open reading frame found in the gene contains the characteristic sequence of 20 amino acids located at the N-terminal of the F1 protein. Analyses of the amino acid sequence suggested that the structure of the F gene product is NH2-signal peptide-F2-F1-COOH.