Structure-aware protein self-supervised learning

C Chen, J Zhou, F Wang, X Liu, D Dou - Bioinformatics, 2023 - academic.oup.com
Motivation Protein representation learning methods have shown great potential to many
downstream tasks in biological applications. A few recent studies have demonstrated that …

Bidirectional learning for offline infinite-width model-based optimization

C Chen, Y Zhang, J Fu, XS Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc
In offline model-based optimization, we strive to maximize a black-box objective function by
only leveraging a static dataset of designs and their scores. This problem setting arises in …

Importance-aware co-teaching for offline model-based optimization

Y Yuan, CS Chen, Z Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Offline model-based optimization aims to find a design that maximizes a property of interest
using only an offline dataset, with applications in robot, protein, and molecule design …

Parallel-mentoring for offline model-based optimization

CS Chen, C Beckham, Z Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc
We study offline model-based optimization to maximize a black-box objective function with a
static dataset of designs and scores. These designs encompass a variety of domains …

Gradient-based bi-level optimization for deep learning: A survey

C Chen, X Chen, C Ma, Z Liu, X Liu - arXiv preprint arXiv:2207.11719, 2022 - arxiv.org
Bi-level optimization, especially the gradient-based category, has been widely used in the
deep learning community including hyperparameter optimization and meta-knowledge …

Bootstrapped training of score-conditioned generator for offline design of biological sequences

M Kim, F Berto, S Ahn, J Park - Advances in Neural …, 2024 - proceedings.neurips.cc
We study the problem of optimizing biological sequences, eg, proteins, DNA, and RNA, to
maximize a black-box score function that is only evaluated in an offline dataset. We propose …

Protein representation learning via knowledge enhanced primary structure reasoning

HY Zhou, Y Fu, Z Zhang, B Cheng… - The Eleventh International …, 2023 - openreview.net
Protein representation learning has primarily benefited from the remarkable development of
language models (LMs). Accordingly, pre-trained protein models also suffer from a problem …

Importance-aware adaptive dataset distillation

G Li, R Togo, T Ogawa, M Haseyama - Neural Networks, 2024 - Elsevier
Herein, we propose a novel dataset distillation method for constructing small informative
datasets that preserve the information of the large original datasets. The development of …

Diffusion models as constrained samplers for optimization with unknown constraints

L Kong, Y Du, W Mu, K Neklyudov, V De Bortol… - arXiv preprint arXiv …, 2024 - arxiv.org
Addressing real-world optimization problems becomes particularly challenging when
analytic objective functions or constraints are unavailable. While numerous studies have …

Advances of Deep Learning in Protein Science: A Comprehensive Survey

B Hu, C Tan, L Wu, J Zheng, J Xia, Z Gao, Z Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Protein representation learning plays a crucial role in understanding the structure and
function of proteins, which are essential biomolecules involved in various biological …