A normalized Levenshtein distance metric

L Yujian, L Bo - IEEE transactions on pattern analysis and …, 2007 - ieeexplore.ieee.org
Although a number of normalized edit distances presented so far may offer good
performance in some applications, none of them can be regarded as a genuine metric …

Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs

B Chevreux, T Pfisterer, B Drescher, AJ Driesel… - Genome …, 2004 - genome.cshlp.org
We present an EST sequence assembler that specializes in reconstruction of pristine mRNA
transcripts, while at the same time detecting and classifying single nucleotide …

Tight hardness results for LCS and other sequence similarity measures

A Abboud, A Backurs… - 2015 IEEE 56th Annual …, 2015 - ieeexplore.ieee.org
Two important similarity measures between sequences are the longest common
subsequence (LCS) and the dynamic time warping distance (DTWD). The computations of …

Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment

S Baichoo, CA Ouzounis - Biosystems, 2017 - Elsevier
A multitude of algorithms for sequence comparison, short-read assembly and whole-
genome alignment have been developed in the general context of molecular biology, to …

Eliminating fuzzy duplicates in data warehouses

R Ananthakrishna, S Chaudhuri, V Ganti - VLDB'02: Proceedings of the …, 2002 - Elsevier
Publisher Summary This chapter develops an algorithm for eliminating duplicates in
dimensional tables in a data warehouse, which are usually associated with hierarchies. It …

Consequences of faster alignment of sequences

A Abboud, VV Williams, O Weimann - … , July 8-11, 2014, Proceedings, Part I …, 2014 - Springer
Abstract The Local Alignment problem is a classical problem with applications in biology.
Given two input strings and a scoring function on pairs of letters, one is asked to find the …

[图书][B] Handbook of computational molecular biology

S Aluru - 2005 - taylorfrancis.com
The enormous complexity of biological systems at the molecular level must be answered
with powerful computational methods. Computational biology is a young field, but has seen …

On the similarity metric and the distance metric

S Chen, B Ma, K Zhang - Theoretical Computer Science, 2009 - Elsevier
Similarity and dissimilarity measures are widely used in many research areas and
applications. When a dissimilarity measure is used, it is normally required to be a distance …

Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences

AJ Mackey, TAJ Haystead, WR Pearson - Molecular & Cellular Proteomics, 2002 - ASBMB
We describe two novel sequence similarity search algorithms, FASTS and FASTF, that use
multiple short peptide sequences to identify homologous sequences in protein or DNA …

Towards a reliable objective function for multiple sequence alignments

JD Thompson, F Plewniak, R Ripp, JC Thierry… - Journal of molecular …, 2001 - Elsevier
Multiple sequence alignment is a fundamental tool in a number of different domains in
modern molecular biology, including functional and evolutionary studies of a protein family …