TY - GEN
T1 - Emergence of new structure from non-stationary analysis of genomic sequences
AU - Bouaynaya, Nidhal
AU - Schonfeld, Dan
PY - 2008
Y1 - 2008
N2 - In this paper, we will bring to bear new tools to analyze non-stationary signals that have emerged in the statistical and signal processing community over the past few years. The emergence of these new methods will be used to shed new light and help resolve the issues of (i) the existence of long-range correlations in DNA sequences and (ii) whether they are present in both coding and non-coding segments or only in the latter. It turns out that the statistical differences between coding and non-coding segments are much more subtle than previously thought using stationary analysis. In particular, both coding and non-coding sequences exhibit long-range correlations, as asserted by a 1/f β(n) evolutionary (i.e., time-dependent) spectrum. However, we will use an index of randomness, which we derive from the Hilbert-Huang Transform, to demonstrate that coding sequences, although not random as previously suspected, are often "more random" (i.e., more white) than non-coding sequences. Moreover, the study of the evolution of the rate of change of these time-dependent parameters in homologous gene families shows a sudden jump around the rat, which might be related to the well-known supercharged evolution of this rodent.
AB - In this paper, we will bring to bear new tools to analyze non-stationary signals that have emerged in the statistical and signal processing community over the past few years. The emergence of these new methods will be used to shed new light and help resolve the issues of (i) the existence of long-range correlations in DNA sequences and (ii) whether they are present in both coding and non-coding segments or only in the latter. It turns out that the statistical differences between coding and non-coding segments are much more subtle than previously thought using stationary analysis. In particular, both coding and non-coding sequences exhibit long-range correlations, as asserted by a 1/f β(n) evolutionary (i.e., time-dependent) spectrum. However, we will use an index of randomness, which we derive from the Hilbert-Huang Transform, to demonstrate that coding sequences, although not random as previously suspected, are often "more random" (i.e., more white) than non-coding sequences. Moreover, the study of the evolution of the rate of change of these time-dependent parameters in homologous gene families shows a sudden jump around the rat, which might be related to the well-known supercharged evolution of this rodent.
UR - https://www.scopus.com/pages/publications/51549119397
UR - https://www.scopus.com/pages/publications/51549119397#tab=citedBy
U2 - 10.1109/GENSIPS.2008.4555666
DO - 10.1109/GENSIPS.2008.4555666
M3 - Conference contribution
AN - SCOPUS:51549119397
SN - 9781424423729
T3 - GENSIPS'08 - 6th IEEE International Workshop on Genomic Signal Processing and Statistics
BT - GENSIPS'08 - 6th IEEE International Workshop on Genomic Signal Processing and Statistics
T2 - 6th IEEE International Workshop on Genomic Signal Processing and Statistics, GENSIPS'08
Y2 - 8 June 2008 through 10 June 2008
ER -