VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling

S Li, Z Wang, Z Liu, D Wu, C Tan, J Zheng… - arXiv preprint arXiv …, 2024 - arxiv.org
Similar to natural language models, pre-trained genome language models are proposed to
capture the underlying intricacies within genomes with unsupervised sequence modeling …

ChatNT: A Multimodal Conversational Agent for DNA, RNA and Protein Tasks

G Richard, BP de Almeida, H Dalla-Torre, C Blum… - bioRxiv, 2024 - biorxiv.org
Language models are thriving, powering conversational agents that assist and empower
humans to solve a number of tasks. Recently, these models were extended to support …

Multi-modal Transfer Learning between Biological Foundation Models

JJ Garau-Luis, P Bordes, L Gonzalez, M Roller… - arXiv preprint arXiv …, 2024 - arxiv.org
Biological sequences encode fundamental instructions for the building blocks of life, in the
form of DNA, RNA, and proteins. Modeling these sequences is key to understand disease …