Nucleic Acids Research, 1982, Vol. 10, No. 24 8323-8339
© 1982
MOLECULAR BIOLOGY |
Statistical significance of symmetrical and repetitive segments in DNA
Departments of Mathematics and Biochemistry, University of Maine , Orono ME 04473, USA
* To whom inquiries about programs should be addressed
Received July 27, 1982. Accepted October 18, 1982.
Methods of computer analysis for the recurrence of symmetrical and repetitive elements in large numbers of DNA sequences are described, together with derivations of appropriate quantitative criteria for the evaluation of the statistical significance of these elements in DNAs of different base composition. Examples of some extraordinary variations in the occurence of symmetrical and repetitive elements are provided, many of which are new.
Special consideration is devoted to a determination of the statistical significance of a two-fold palindrome at the origin of replication. A computer search of 14 independently determined DNA sequences containing an origin of replication locus indicates each contains a large two-fold palindrome. The average length of this palindrome is 28±6 base pairs, of which 22 contribute to the palindrome symmetry. The probability of occurrence of such a palindrome is only 1/26000, while the probability of occurrence in all 14 different species is (1/26000)14.