Exploring the Exceptional Genomic Word Symmetry among Regions

Research Article

Austin J Proteomics Bioinform & Genomics. 2015; 2(1): 1011.

Exploring the Exceptional Genomic Word Symmetry among Regions

Afreixo V1,2,3,4*, Rodrigues JMOS4,5 and Bastos CAC4,5

¹Department of Mathematics, University of Aveiro, Portugal

²Institute for Research in Biomedicine, University of Aveiro, Portugal

³Center for Research and Development in Mathematics and Applications, University of Aveiro, Portugal

4Institute of Electronics and Telematics Engineering of Aveiro, University of Aveiro, Portugal

5Department of Electronics Telecommunications and Informatics, University of Aveiro, Portugal

*Corresponding author: Vera Afreixo, Department of Mathematics, University of Aveiro, Aveiro, Portugal

Received: February 13, 2015; Accepted: October 05, 2015; Published: October 10, 2015

Abstract

The second Chargaff’s parity rule and its extensions are reported as a universal phenomenon in DNA sequences [1-3]. However, parity of the frequencies of reverse complementary oligonucleotides could be a mere consequence of the single nucleotide parity rule, if nucleotide independence is assumed. Exceptional symmetry has been proposed as a meaningful measure of the extension of the second parity rule to oligonucleotides (symmetry above that expected in independence contexts) [4]. For short genomic word length (lower than 13) the global exceptional symmetry was detected in long and short organism genomes [5]. But there are some issues that have not been explored: Is the local symmetry behavior distinct from the symmetry in the full organism? What is the variation of the exceptional symmetry along the sequence? To explore the exceptional genomic word symmetry along the genome sequences, we propose a sliding window method to extract the values of exceptional symmetry (for all words or by word groups). We compare the exceptional symmetry effect distribution in real genomes against control scenarios, testing the differences and performing a residual analysis. We compare and sort the word groups taking into account the exceptional symmetry variation along the sequence and the exceptional symmetry effect distribution by word group.

Introduction

Chargaff’s first parity rule states that, in any sequence of doublestranded DNA molecules, the total number of complementary nucleotides is exactly equal. Chargaff’s second parity rule states that those quantities are almost equal in a single strand of DNA.

%A≅T and %C ≅G

The extensions to the second parity rule state that, in each DNA strand, the proportion of an oligonucleotide should be similar to that of its reversed complement [1].