Paper: | MLSP-P6.1 |
Session: | Biomedical and Other Applications |
Time: | Friday, May 19, 16:30 - 18:30 |
Presentation: |
Poster
|
Topic: |
Machine Learning for Signal Processing: Bioinformatics Applications |
Title: |
Exploring Three-Base Periodicity for DNA Compression and Modeling |
Authors: |
Paulo J. S. G. Ferreira, António J. R. Neves, Vera Afreixo, Armando J. Pinho, University of Aveiro, Portugal |
Abstract: |
To explore the three-base periodicity often found in protein-coding DNA regions, we introduce a DNA model based on three deterministic states, where each state implements a finite-context model. The results obtained show compression gains in relation to the single finite-context model counterpart. Additionally, and potentially more interesting than the compression gain on its own, is the observation that the entropy associated to each of the three states differs and that this variation is not the same among the organisms analyzed. |