Gapped BLAST and PSI-BLAST: a new generation of protein database search programs

SF Altschul, TL Madden, AA Schäffer… - Nucleic acids …, 1997 - academic.oup.com
The BLAST programs are widely used tools for searching protein and DNA databases for
sequence similarities. For protein comparisons, a variety of definitional, algorithmic and …

DNA binding sites: representation and discovery

GD Stormo - Bioinformatics, 2000 - academic.oup.com
The purpose of this article is to provide a brief history of the development and application of
computer algorithms for the analysis and prediction of DNA binding sites. This problem can …

Springer series in statistics

P Bickel, P Diggle, S Fienberg, U Gather, I Olkin… - Principles and Theory …, 2009 - Springer
The idea for this book came from the time the authors spent at the Statistics and Applied
Mathematical Sciences Institute (SAMSI) in Research Triangle Park in North Carolina …

[图书][B] Biological sequence analysis: probabilistic models of proteins and nucleic acids

R Durbin, SR Eddy, A Krogh, G Mitchison - 1998 - books.google.com
Probabilistic models are becoming increasingly important in analysing the huge amount of
data being produced by large-scale DNA-sequencing efforts such as the Human Genome …

[图书][B] Monte Carlo strategies in scientific computing

JS Liu, JS Liu - 2001 - Springer
This book provides a self-contained and up-to-date treatment of the Monte Carlo method
and develops a common framework under which various Monte Carlo techniques can be" …

Detecting subtle sequence signals: a Gibbs sampling strategy for multiple alignment

CE Lawrence, SF Altschul, MS Boguski, JS Liu… - science, 1993 - science.org
A wealth of protein and DNA sequence data is being generated by genome projects and
other sequencing efforts. A crucial barrier to deciphering these sequences and …

Identifying DNA and protein patterns with statistically significant alignments of multiple sequences.

GZ Hertz, GD Stormo - Bioinformatics (Oxford, England), 1999 - academic.oup.com
MOTIVATION: Molecular biologists frequently can obtain interesting insight by aligning a set
of related DNA, RNA or protein sequences. Such alignments can be used to determine …

Weight matrix descriptions of four eukaryotic RNA polymerase II promoter elements derived from 502 unrelated promoter sequences

P Bucher - Journal of molecular biology, 1990 - Elsevier
Optimized weight matrices defining four major eukaryotic promoter elements, the TATA-box,
cap signal, CCAAT-, and GC-box, are presented; they were derived by comparative …

Amino acid substitution matrices from an information theoretic perspective

SF Altschul - Journal of molecular biology, 1991 - Elsevier
Protein sequence alignments have become an important tool for molecular biologists. Local
alignments are frequently constructed with the aid of a “substitution score matrix” that …

Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF

VR Iyer, CE Horak, CS Scafe, D Botstein, M Snyder… - Nature, 2001 - nature.com
Proteins interact with genomic DNA to bring the genome to life; and these interactions also
define many functional features of the genome. SBF and MBF are sequence-specific …