Fast kernels for string and tree matching

SVN Vishwanathan, AJ Smola - 2004 - direct.mit.edu
This algorithm can be extended in various ways to provide linear time prediction cost in the
length of the sequence to be classified. We demonstrate extensions in the case of position …

[PDF][PDF] Fast string kernels using inexact matching for protein sequences.

C Leslie, R Kuang, K Bennett - Journal of Machine Learning Research, 2004 - jmlr.org
We describe several families of k-mer based string kernels related to the recently presented
mismatch kernel and designed for use with support vector machines (SVMs) for …

Efficient approximate and dynamic matching of patterns using a labeling paradigm

SC Sahinalp, U Vishkin - … of 37th Conference on Foundations of …, 1996 - ieeexplore.ieee.org
A key approach in string processing algorithmics has been the labeling paradigm which is
based on assigning labels to some of the substrings of a given string. If these labels are …

Efficient tree pattern matching

SR Kosaraju - 30th Annual Symposium on Foundations of Computer …, 1989 - computer.org
A classic open problem on tree pattern matching is whether the naive O (mn)-step algorithm
for finding all the occurrences of a pattern tree of size m in a text tree of size n can be …

Faster algorithms for string matching problems: Matching the convolution bound

P Indyk - Proceedings 39th Annual Symposium on Foundations …, 1998 - ieeexplore.ieee.org
In this paper we give a randomized O (nlogn)-time algorithm for the string matching with
don't cares problem. This improves the Fischer-Paterson bound from 1974 and answers the …

Transforming strings to vector spaces using prototype selection

B Spillmann, M Neuhaus, H Bunke, E Pękalska… - Structural, Syntactic, and …, 2006 - Springer
A common way of expressing string similarity in structural pattern recognition is the edit
distance. It allows one to apply the k NN rule in order to classify a set of strings. However …

[PDF][PDF] VGRAM: Improving Performance of Approximate Queries on String Collections Using Variable-Length Grams.

C Li, B Wang, X Yang - VLDB, 2007 - researchgate.net
Many applications need to solve the following problem of approximate string matching: from
a collection of strings, how to find those similar to a given string, or the strings in another …

[图书][B] String searching algorithms

GA Stephen - 1994 - books.google.com
String searching is a subject of both theoretical and practical interest in computer science.
This book presents a bibliographic overview of the field and an anthology of detailed …

[PDF][PDF] The stringdist package for approximate string matching.

MPJ Van der Loo - R J., 2014 - journal.r-project.org
Comparing text strings in terms of distance functions is a common and fundamental task in
many statistical text-processing applications. Thus far, string distance functionality has been …

A faster algorithm for approximate string matching

R Baeza-Yates, G Navarro - Annual Symposium on Combinatorial Pattern …, 1996 - Springer
We present a new algorithm for on-line approximate string matching. The algorithm is based
on the simulation of a non-deterministic finite automaton built from the pattern and using the …