[HTML][HTML] The language of proteins: NLP, machine learning & protein sequences

D Ofer, N Brandes, M Linial - Computational and Structural Biotechnology …, 2021 - Elsevier
Natural language processing (NLP) is a field of computer science concerned with automated
text and language analysis. In recent years, following a series of breakthroughs in deep and …

Protein flexibility in the light of structural alphabets

P Craveur, AP Joseph, J Esque, TJ Narwani… - Frontiers in molecular …, 2015 - frontiersin.org
Protein structures are valuable tools to understand protein function. Nonetheless, proteins
are often considered as rigid macromolecules while their structures exhibit specific flexibility …

A structure-informed atlas of human-virus interactions

G Lasso, SV Mayer, ER Winkelmann, T Chu, O Elliot… - Cell, 2019 - cell.com
While knowledge of protein-protein interactions (PPIs) is critical for understanding virus-host
relationships, limitations on the scalability of high-throughput methods have hampered their …

Sequence-structure-function relationships in the microbial protein universe

J Koehler Leman, P Szczerbiak, PD Renfrew… - Nature …, 2023 - nature.com
For the past half-century, structural biologists relied on the notion that similar protein
sequences give rise to similar structures and functions. While this assumption has driven …

Fast protein structure comparison through effective representation learning with contrastive graph neural networks

C Xia, SH Feng, Y Xia, X Pan… - PLoS computational …, 2022 - journals.plos.org
Protein structure alignment algorithms are often time-consuming, resulting in challenges for
large-scale protein structure similarity-based retrieval. There is an urgent need for more …

[HTML][HTML] Beyond sequence: Structure-based machine learning

J Durairaj, D de Ridder, ADJ van Dijk - Computational and Structural …, 2023 - Elsevier
Recent breakthroughs in protein structure prediction demarcate the start of a new era in
structural bioinformatics. Combined with various advances in experimental structure …

Tertiary alphabet for the observable protein structural universe

CO Mackenzie, J Zhou… - Proceedings of the …, 2016 - National Acad Sciences
Here, we systematically decompose the known protein structural universe into its basic
elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone …

A survey of computational methods for protein function prediction

A Shehu, D Barbará, K Molloy - Big data analytics in genomics, 2016 - Springer
Rapid advances in high-throughout genome sequencing technologies have resulted in
millions of protein-encoding gene sequences with no functional characterization. Automated …

A sweep of earth's virome reveals host-guided viral protein structural mimicry and points to determinants of human disease

G Lasso, B Honig, SD Shapira - Cell Systems, 2021 - cell.com
Viruses deploy genetically encoded strategies to coopt host machinery and support viral
replicative cycles. Here, we use protein structure similarity to scan for molecular mimicry …

Rapid search for tertiary fragments reveals protein sequence–structure relationships

J Zhou, G Grigoryan - Protein Science, 2015 - Wiley Online Library
Finding backbone substructures from the Protein Data Bank that match an arbitrary query
structural motif, composed of multiple disjoint segments, is a problem of growing relevance …