[HTML][HTML] A review of ensemble learning and data augmentation models for class imbalanced problems: combination, implementation and evaluation

AA Khan, O Chaudhari, R Chandra - Expert Systems with Applications, 2023 - Elsevier
Class imbalance (CI) in classification problems arises when the number of observations
belonging to one class is lower than the other. Ensemble learning combines multiple models …

Tabular data: Deep learning is not all you need

R Shwartz-Ziv, A Armon - Information Fusion, 2022 - Elsevier
A key element in solving real-life data science problems is selecting the types of models to
use. Tree ensemble models (such as XGBoost) are usually recommended for classification …

A performance-driven benchmark for feature selection in tabular deep learning

V Cherepanova, R Levin, G Somepalli… - Advances in …, 2024 - proceedings.neurips.cc
Academic tabular benchmarks often contain small sets of curated features. In contrast, data
scientists typically collect as many features as possible into their datasets, and even …

Switchtab: Switched autoencoders are effective tabular learners

J Wu, S Chen, Q Zhao, R Sergazinov, C Li… - Proceedings of the …, 2024 - ojs.aaai.org
Self-supervised representation learning methods have achieved significant success in
computer vision and natural language processing (NLP), where data samples exhibit explicit …

Recontab: Regularized contrastive representation learning for tabular data

S Chen, J Wu, N Hovakimyan, H Yao - arXiv preprint arXiv:2310.18541, 2023 - arxiv.org
Representation learning stands as one of the critical machine learning techniques across
various domains. Through the acquisition of high-quality features, pre-trained embeddings …

iPiDA-GCN: Identification of piRNA-disease associations based on Graph Convolutional Network

J Hou, H Wei, B Liu - PLOS Computational Biology, 2022 - journals.plos.org
Motivation Piwi-interacting RNAs (piRNAs) play a critical role in the progression of various
diseases. Accurately identifying the associations between piRNAs and diseases is important …

The diversified ensemble neural network

S Zhang, M Liu, J Yan - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Ensemble is a general way of improving the accuracy and stability of learning models,
especially for the generalization ability on small datasets. Compared with tree-based …

Transfer learning with deep tabular models

R Levin, V Cherepanova, A Schwarzschild… - arXiv preprint arXiv …, 2022 - arxiv.org
Recent work on deep learning for tabular data demonstrates the strong performance of deep
tabular models, often bridging the gap between gradient boosted decision trees and neural …

[HTML][HTML] Databases and computational methods for the identification of piRNA-related molecules: A survey

C Guo, X Wang, H Ren - Computational and Structural Biotechnology …, 2024 - Elsevier
Piwi-interacting RNAs (piRNAs) are a class of small non-coding RNAs (ncRNAs) that play
important roles in many biological processes and major cancer diagnosis and treatment …

Precision machine learning

EJ Michaud, Z Liu, M Tegmark - Entropy, 2023 - mdpi.com
We explore unique considerations involved in fitting machine learning (ML) models to data
with very high precision, as is often required for science applications. We empirically …