Distribution of distances between symmetric words in the human genome: analysis of regular peaks

Finding DNA sites with high potential for the formation of hairpin/cruciform structures is an important task. Previous works studied the distances between adjacent reversed complement words (symmetric word pairs) and also for non-adjacent words. It was observed that for some words a few distances we...

Full description

Bibliographic Details
Main Author: Bastos, Carlos A. C. (author)
Other Authors: Afreixo, Vera (author), Rodrigues, João M. O. S. (author), Pinho, Armando J. (author), Silva, Raquel M. (author)
Format: article
Language:eng
Published: 2020
Subjects:
Online Access:http://hdl.handle.net/10773/27706
Country:Portugal
Oai:oai:ria.ua.pt:10773/27706
Description
Summary:Finding DNA sites with high potential for the formation of hairpin/cruciform structures is an important task. Previous works studied the distances between adjacent reversed complement words (symmetric word pairs) and also for non-adjacent words. It was observed that for some words a few distances were favoured (peaks) and that in some distributions there was strong peak regularity. The present work extends previous studies, by improving the detection and characterization of peak regularities in the symmetric word pairs distance distributions of the human genome. This work also analyzes the location of the sequences that originate the observed strong peak periodicity in the distance distribution. The results obtained in this work may indicate genomic sites with potential for the formation of hairpin/cruciform structures.