SaprotHub: Making Protein Modeling Accessible to All Biologists

J Su, Z Li, C Han, Y Zhou, Y He, J Shan, X Zhou… - bioRxiv, 2024 - biorxiv.org
Training and deploying deep learning models pose challenges for users without machine
learning (ML) expertise. SaprotHub offers a user-friendly platform that democratizes the …

Tokenized and Continuous Embedding Compressions of Protein Sequence and Structure

AX Lu, W Yan, KK Yang, V Gligorijevic, K Cho… - bioRxiv, 2024 - biorxiv.org
Existing protein machine learning representations typically model either the sequence or
structure distribution, with the other modality implicit. The latent space of sequence-to …

Bio2Token: All-atom tokenization of any biomolecular structure with Mamba

A Liu, A Elaldi, N Russell, O Viessmann - arXiv preprint arXiv:2410.19110, 2024 - arxiv.org
Efficient encoding and representation of large 3D molecular structures with high fidelity is
critical for biomolecular design applications. Despite this, many representation learning …