An efficient uniform-cost normalized edit distance algorithm- 学术资源搜索

An efficient uniform-cost normalized edit distance algorithm

AN Arslan, O Egecioglu - 6th International Symposium on …, 1999 - ieeexplore.ieee.org

6th International Symposium on String Processing and Information …, 1999•ieeexplore.ieee.org

A common model for computing the similarity of two strings X and Y of lengths m, and n respectively with m/spl ges/n, is to transform X into Y through a sequence of three types of edit operations: insertion, deletion, and substitution. The model assumes a given cost function which assigns a non-negative real weight to each edit operation. The amortized weight for a given edit sequence is the ratio of its weight to its length, and the minimum of this ratio over all edit sequences is the normalized edit distance. Existing algorithms for normalized edit distance computation with proven complexity bounds require O(mn/sup 2/) time in the worst-case. We give an O(mn log n)-time algorithm for the problem when the cost function is uniform, i.e., the weight of each edit operation is constant within the same type, except substitutions can have different weights depending on whether they are matching or non-matching.

ieeexplore.ieee.org

展开收起

被引用次数：29 相关文章所有 4 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

An efficient uniform-cost normalized edit distance algorithm

引用