A closer look at deep learning on tabular data

HJ Ye, SY Liu, HR Cai, QL Zhou, DC Zhan - arXiv preprint arXiv …, 2024 - arxiv.org
Tabular data is prevalent across various domains in machine learning. Although Deep
Neural Network (DNN)-based methods have shown promising performance comparable to …

Modern neighborhood components analysis: A deep tabular baseline two decades later

HJ Ye, HH Yin, DC Zhan - arXiv preprint arXiv:2407.03257, 2024 - arxiv.org
The growing success of deep learning in various domains has prompted investigations into
its application to tabular data, where deep models have shown promising results compared …

Better by default: Strong pre-tuned mlps and boosted trees on tabular data

D Holzmüller, L Grinsztajn, I Steinwart - arXiv preprint arXiv:2407.04491, 2024 - arxiv.org
For classification and regression on tabular data, the dominance of gradient-boosted
decision trees (GBDTs) has recently been challenged by often much slower deep learning …

Tabular deep learning: a comparative study applied to multi-task genome-wide prediction

Y Fan, P Waldmann - BMC bioinformatics, 2024 - Springer
Purpose More accurate prediction of phenotype traits can increase the success of genomic
selection in both plant and animal breeding studies and provide more reliable disease risk …

A survey on deep tabular learning

S Somvanshi, S Das, SA Javed, G Antariksa… - arXiv preprint arXiv …, 2024 - arxiv.org
Tabular data, widely used in industries like healthcare, finance, and transportation, presents
unique challenges for deep learning due to its heterogeneous nature and lack of spatial …

TabM: Advancing Tabular Deep Learning with Parameter-Efficient Ensembling

Y Gorishniy, A Kotelnikov, A Babenko - arXiv preprint arXiv:2410.24210, 2024 - arxiv.org
Deep learning architectures for supervised learning on tabular data range from simple
multilayer perceptrons (MLP) to sophisticated Transformers and retrieval-augmented …

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

I Rubachev, N Kartashev, Y Gorishniy… - arXiv preprint arXiv …, 2024 - arxiv.org
Advances in machine learning research drive progress in real-world applications. To ensure
this progress, it is important to understand the potential pitfalls on the way from a novel …

[PDF][PDF] Comparative evaluation of anomaly detection methods for fraud detection in online credit card payments

H Thimonier, F Popineau, A Rimmel… - … on Information and …, 2024 - library.oapen.org
This study explores the application of anomaly detection (AD) methods in imbalanced
learning tasks, focusing on fraud detection using real online credit card payment data. We …

Learning a Decision Tree Algorithm with Transformers

Y Zhuang, L Liu, C Singh, J Shang, J Gao - arXiv preprint arXiv …, 2024 - arxiv.org
Decision trees are renowned for their interpretability capability to achieve high predictive
performance, especially on tabular data. Traditionally, they are constructed through …

A data-centric perspective on evaluating machine learning models for tabular data

A Tschalzev, S Marton, S Lüdtke, C Bartelt… - arXiv preprint arXiv …, 2024 - arxiv.org
Tabular data is prevalent in real-world machine learning applications, and new models for
supervised learning of tabular data are frequently proposed. Comparative studies assessing …