Machine learning for synthetic data generation: a review

Y Lu, M Shen, H Wang, X Wang, C van Rechem… - arXiv preprint arXiv …, 2023 - arxiv.org
Machine learning heavily relies on data, but real-world applications often encounter various
data-related issues. These include data of poor quality, insufficient data points leading to …

Machine learning-aided generative molecular design

Y Du, AR Jamasb, J Guo, T Fu, C Harris… - Nature Machine …, 2024 - nature.com
Abstract Machine learning has provided a means to accelerate early-stage drug discovery
by combining molecule generation and filtering steps in a single architecture that leverages …

Trialbench: Multi-modal artificial intelligence-ready clinical trial datasets

J Chen, Y Hu, Y Wang, Y Lu, X Cao, M Lin, H Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
Clinical trials are pivotal for developing new medical treatments, yet they typically pose
some risks such as patient mortality, adverse events, and enrollment failure that waste …

Semignn-ppi: Self-ensembling multi-graph neural network for efficient and generalizable protein-protein interaction prediction

Z Zhao, P Qian, X Yang, Z Zeng, C Guan… - arXiv preprint arXiv …, 2023 - arxiv.org
Protein-protein interactions (PPIs) are crucial in various biological processes and their study
has significant implications for drug development and disease diagnosis. Existing deep …

Uncertainty quantification on clinical trial outcome prediction

T Chen, N Hao, Y Lu, C Van Rechem - arXiv preprint arXiv:2401.03482, 2024 - arxiv.org
The importance of uncertainty quantification is increasingly recognized in the diverse field of
machine learning. Accurately assessing model prediction uncertainty can help provide …

Smiles-mamba: Chemical mamba foundation models for drug admet prediction

B Xu, Y Lu, C Li, L Yue, X Wang, N Hao, T Fu… - arXiv preprint arXiv …, 2024 - arxiv.org
In drug discovery, predicting the absorption, distribution, metabolism, excretion, and toxicity
(ADMET) properties of small-molecule drugs is critical for ensuring safety and efficacy …

Uncertainty quantification and interpretability for clinical trial approval prediction

Y Lu, T Chen, N Hao, C Van Rechem, J Chen… - Health Data …, 2024 - spj.science.org
Background: Clinical trial is a crucial step in the development of a new therapy (eg,
medication) and is remarkably expensive and time-consuming. Forecasting the approval of …

Trialenroll: Predicting clinical trial enrollment success with deep & cross network and large language models

L Yue, S Xing, J Chen, T Fu - arXiv preprint arXiv:2407.13115, 2024 - arxiv.org
Clinical trials need to recruit a sufficient number of volunteer patients to demonstrate the
statistical power of the treatment (eg, a new drug) in curing a certain disease. Clinical trial …

Biomamba: A pre-trained biomedical language representation model leveraging mamba

L Yue, S Xing, Y Lu, T Fu - arXiv preprint arXiv:2408.02600, 2024 - arxiv.org
The advancement of natural language processing (NLP) in biology hinges on models' ability
to interpret intricate biomedical literature. Traditional models often struggle with the complex …

A Survey on Large Language Models for Critical Societal Domains: Finance, Healthcare, and Law

ZZ Chen, J Ma, X Zhang, N Hao, A Yan… - arXiv preprint arXiv …, 2024 - arxiv.org
In the fast-evolving domain of artificial intelligence, large language models (LLMs) such as
GPT-3 and GPT-4 are revolutionizing the landscapes of finance, healthcare, and law …