cWINNOWER algorithm for finding fuzzy DNA motifs

S Liang, MP Samanta, BA Biegel - Journal of bioinformatics and …, 2004 - World Scientific
Journal of bioinformatics and computational biology, 2004World Scientific
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in proteinbinding
signals. A signal is defined as any short nucleotide pattern having up to d mutations differing
from a motif of length l. The algorithm finds such motifs if a clique consisting of a suffciently
large number of mutated copies of the motif (ie, the signals) is present in the DNA sequence.
The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of
Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker …
The cWINNOWER algorithm detects fuzzy motifs in DNA sequences rich in proteinbinding signals. A signal is defined as any short nucleotide pattern having up to d mutations differing from a motif of length l. The algorithm finds such motifs if a clique consisting of a suffciently large number of mutated copies of the motif (i.e., the signals) is present in the DNA sequence. The cWINNOWER algorithm substantially improves the sensitivity of the winnower method of Pevzner and Sze by imposing a consensus constraint, enabling it to detect much weaker signals. We studied the minimum detectable clique size qc as a function of sequence length N for random sequences. We found that qc increases linearly with N for a fast version of the algorithm based on counting threemember sub-cliques. Imposing consensus constraints reduces qc by a factor of three in this case, which makes the algorithm dramatically more sensitive. Our most sensitive algorithm, which counts four-member sub-cliques, needs a minimum of only 13 signals to detect motifs in a sequence of length N=12,000 for (l,d)=(15,4).
World Scientific
以上显示的是最相近的搜索结果。 查看全部搜索结果