Alignment-free inference of hierarchical and reticulate phylogenomic relationships

G Bernard, CX Chan, Y Chan, XY Chua… - Briefings in …, 2019 - academic.oup.com
We are amidst an ongoing flood of sequence data arising from the application of high-
throughput technologies, and a concomitant fundamental revision in our understanding of …

[图书][B] Kernel methods in computational biology

B Schölkopf, K Tsuda, JP Vert - 2004 - books.google.com
A detailed overview of current research in kernel methods and their application to
computational biology. Modern machine learning techniques are proving to be extremely …

Sketching and sublinear data structures in genomics

G Marçais, B Solomon, R Patro… - Annual Review of …, 2019 - annualreviews.org
Large-scale genomics demands computational methods that scale sublinearly with the
growth of data. We review several data structures and sketching techniques that have been …

[图书][B] Handbook of data structures and applications

DP Mehta, S Sahni - 2004 - taylorfrancis.com
Although there are many advanced and specialized texts and handbooks on algorithms,
until now there was no book that focused exclusively on the wide variety of data structures …

Fast kernels for string and tree matching

SVN Vishwanathan, AJ Smola - 2004 - direct.mit.edu
This algorithm can be extended in various ways to provide linear time prediction cost in the
length of the sequence to be classified. We demonstrate extensions in the case of position …

Reducing the space requirement of suffix trees

S Kurtz - Software: Practice and Experience, 1999 - Wiley Online Library
We show that suffix trees store various kinds of redundant information. We exploit these
redundancies to obtain more space efficient representations. The most space efficient of our …

Text document clustering based on frequent word meaning sequences

Y Li, SM Chung, JD Holt - Data & Knowledge Engineering, 2008 - Elsevier
Most of existing text clustering algorithms use the vector space model, which treats
documents as bags of words. Thus, word sequences in the documents are ignored, while …

Efficient phrase-based document similarity for clustering

H Chim, X Deng - IEEE Transactions on knowledge and data …, 2008 - ieeexplore.ieee.org
In this paper, we propose a phrase-based document similarity to compute the pair-wise
similarities of documents based on the suffix tree document (STD) model. By mapping each …

[图书][B] Handbook of computational molecular biology

S Aluru - 2005 - taylorfrancis.com
The enormous complexity of biological systems at the molecular level must be answered
with powerful computational methods. Computational biology is a young field, but has seen …

[图书][B] Advanced data structures

P Brass - 2008 - helloplanetcpp.wordpress.com
This book is a graduate-level textbook on data structures. A data structure is a method1 to
realize a set of operations on some data. The classical example is to keep track of a set of …