The theory and practice of genome sequence assembly

JT Simpson, M Pop - Annual review of genomics and human …, 2015 - annualreviews.org
The current genomic revolution was made possible by joint advances in genome
sequencing technologies and computational approaches for analyzing sequence data. The …

Discovering functional motifs in long noncoding RNAs

CJ Ross, I Ulitsky - Wiley Interdisciplinary Reviews: RNA, 2022 - Wiley Online Library
Long noncoding RNAs (lncRNAs) are products of pervasive transcription that closely
resemble messenger RNAs on the molecular level, yet function through largely unknown …

Towards automated log parsing for large-scale log data analysis

P He, J Zhu, S He, J Li, MR Lyu - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
Logs are widely used in system management for dependability assurance because they are
often the only data available that record detailed system runtime behaviors in production …

[图书][B] The algorithm design manual

SS Skiena - 1998 - Springer
This newly expanded and updated second edition of the best-selling classic continues to
take the" mystery" out of designing algorithms, and analyzing their efficacy and efficiency …

Tight hardness results for LCS and other sequence similarity measures

A Abboud, A Backurs… - 2015 IEEE 56th Annual …, 2015 - ieeexplore.ieee.org
Two important similarity measures between sequences are the longest common
subsequence (LCS) and the dynamic time warping distance (DTWD). The computations of …

A survey of longest common subsequence algorithms

L Bergroth, H Hakonen, T Raita - … International Symposium on …, 2000 - ieeexplore.ieee.org
The aim of this paper is to give a comprehensive comparison of well-known longest common
subsequence algorithms (for two input strings) and study their behaviour in various …

[PDF][PDF] Data compression via textual substitution

JA Storer, TG Szymanski - Journal of the ACM (JACM), 1982 - dl.acm.org
A general model for data compression which includes most data compression systems in the
fiterature as special cases is presented. Macro schemes are based on the principle of …

LogSig: Generating system events from raw textual logs

L Tang, T Li, CS Perng - Proceedings of the 20th ACM international …, 2011 - dl.acm.org
Modern computing systems generate large amounts of log data. System administrators or
domain experts utilize the log data to understand and optimize system behaviors. Most …

Generalized random shapelet forests

I Karlsson, P Papapetrou, H Boström - Data mining and knowledge …, 2016 - Springer
Shapelets are discriminative subsequences of time series, usually embedded in shapelet-
based decision trees. The enumeration of time series shapelets is, however, computationally …

Towards trajectory anonymization: a generalization-based approach

ME Nergiz, M Atzori, Y Saygin - … of the SIGSPATIAL ACM GIS 2008 …, 2008 - dl.acm.org
Trajectory datasets are becoming more and more popular due to the massive usage of GPS
and other location-based devices and services. In this paper, we address privacy issues …