KABOOM! A new suffix array based algorithm for clustering expression data

W Li, L Fu, B Niu, S Wu, J Wooley - Briefings in bioinformatics, 2012 - academic.oup.com

The rapid advances of high-throughput sequencing technologies dramatically prompted
metagenomic studies of microbial communities that exist at various environments …

被引用次数：506 相关文章所有 16 个版本

[PDF] frontiersin.org

Large differences in gene expression responses to drought and heat stress between elite barley cultivar Scarlett and a Spanish landrace

CP Cantalapiedra, MJ García-Pereira… - Frontiers in plant …, 2017 - frontiersin.org

Drought causes important losses in crop production every season. Improvement for drought
tolerance could take advantage of the diversity held in germplasm collections, much of …

被引用次数：70 相关文章所有 8 个版本

[PDF] oup.com

A bioinformatician's guide to the forefront of suffix array construction algorithms

AMS Shrestha, MC Frith, P Horton - Briefings in bioinformatics, 2014 - academic.oup.com

The suffix array and its variants are text-indexing data structures that have become
indispensable in the field of bioinformatics. With the uninitiated in mind, we provide an …

被引用次数：58 相关文章所有 16 个版本

[PDF] springer.com

gsufsort: constructing suffix arrays, LCP arrays and BWTs for string collections

FA Louza, GP Telles, S Gog, N Prezza… - Algorithms for Molecular …, 2020 - Springer

Background The construction of a suffix array for a collection of strings is a fundamental task
in Bioinformatics and in many other applications that process strings. Related data …

被引用次数：18 相关文章所有 16 个版本

Hadooping the genome: The impact of big data tools on biology

H Stevens - BioSocieties, 2016 - Springer

This essay examines the consequences of the so-called 'big data'technologies in
biomedicine. Analyzing algorithms and data structures used by biologists can provide …

被引用次数：21 相关文章所有 2 个版本

[PDF] psu.edu

Scalable and Versatile k-mer Indexing for High-Throughput Sequencing Data

N Välimäki, E Rivals - International Symposium on Bioinformatics …, 2013 - Springer

Abstract Philippe et al.(2011) proposed a data structure called Gk arrays for indexing and
querying large collections of high-throughput sequencing data in main-memory. The data …

被引用次数：16 相关文章所有 10 个版本

Diagaf: A more accurate and efficient pre-alignment filter for sequence alignment

C Yu, Y Zhao, C Zhao, H Ma… - IEEE/ACM Transactions …, 2021 - ieeexplore.ieee.org

Sequence alignment is an essential step in computational genomics. More accurate and
efficient sequence pre-alignment methods that run before conducting expensive …

被引用次数：3 相关文章所有 5 个版本

An overview of string processing applications to data analytics

H Koponen, N Mhaskar… - 2021 Reconciling Data …, 2021 - ieeexplore.ieee.org

Data analytics may conveniently be divided into four stages: preparation, preprocessing,
analysis, and post-processing. Especially in the second and third of these, where the data is …

被引用次数：3 相关文章

[PDF] researchgate.net

Efficient soft relational clustering based on randomized search applied to selection of bio-basis for amino acid sequence analysis

MA Mahfouz, MA Ismail - 2012 Seventh International …, 2012 - ieeexplore.ieee.org

Protein sequence clustering is a process that aims to identify sets of homologous proteins in
a protein database. In this paper, two efficient soft c-mediods clustering algorithms for …

被引用次数：7 相关文章所有 3 个版本

[PDF] uni-mainz.de

Accelerating bioinformatics applications on CUDA-enabled multi-GPU systems

R Kobus - 2023 - openscience.ub.uni-mainz.de

A wide range of bioinformatics applications have to deal with a continuously growing
amount of data generated by high-throughput sequencing techniques. Exclusively CPU …

高级搜索

QQ 群