Challenges and applications in generative AI for clinical tabular data in physiology

C Umesh, M Mahendra, S Bej, O Wolkenhauer… - … -European Journal of …, 2024 - Springer
Recent advancements in generative approaches in AI have opened up the prospect of
synthetic tabular clinical data generation. From filling in missing values in real-world data …

Estimation of peanut southern blight severity in hyperspectral data using the synthetic minority oversampling technique and fractional-order differentiation

H Sun, L Zhou, M Shu, J Zhang, Z Feng, H Feng… - Agriculture, 2024 - mdpi.com
Southern blight significantly impacts peanut yield, and its severity is exacerbated by high-
temperature and high-humidity conditions. The mycelium attached to the plant's interior …

Convex space learning for tabular synthetic data generation

M Mahendra, C Umesh, S Bej, K Schultz… - arXiv preprint arXiv …, 2024 - arxiv.org
Generating synthetic samples from the convex space of the minority class is a popular
oversampling approach for imbalanced classification problems. Recently, deep-learning …

GQEO: Nearest neighbor graph-based generalized quadrilateral element oversampling for class-imbalance problem

Q Dai, L Wang, J Zhang, W Ding, L Chen - Neural Networks, 2025 - Elsevier
The class imbalance problem is one of the difficult factors affecting the performance of
traditional classifiers. The oversampling technique is the most common way to solve the …

imFTP: Deep Imbalance Learning via Fuzzy Transition and Prototypical Learning

Y Hou, W Ding, C Zhang - Information Sciences, 2024 - Elsevier
Although many methods have been proposed for tackling the class-imbalance problem, they
still suffer from the insufficient feature representative capability and the overfitting problem …

A multimodal data generation method for imbalanced classification with dual-discriminator constrained diffusion model and adaptive sample selection strategy

Q Li, X Gao, H Lu, B Li, F Zhai, T Wang, Z Meng, Y Hao - Information Fusion, 2025 - Elsevier
Data-level methods often suffer from mode collapse when the minority class has multiple
distribution patterns. Some studies have tried addressing the problem using similarity …

Preserving logical and functional dependencies in synthetic tabular data

C Umesh, K Schultz, M Mahendra, S Bej… - arXiv preprint arXiv …, 2024 - arxiv.org
Dependencies among attributes are a common aspect of tabular data. However, whether
existing tabular data generation algorithms preserve these dependencies while generating …

Frugal Generative Modeling for Tabular Data

A Lacan, B Hanczar, M Sebag - Joint European Conference on Machine …, 2024 - Springer
This paper presents a generative modeling approach called Gmda designed for tabular
data, adapted to its arbitrary feature correlation structure. The generative model is trained so …

Weighted Neural Network for Imbalanced Dataset with Undersampling

Y Motai, SY Alaba, N Siddique, E Benli… - Available at SSRN … - papers.ssrn.com
Class imbalance presents a significant challenge in machine learning, particularly in
applications where accurate detection of minority classes is essential. This study presents a …

A Novel Adaptive Hyperspherical Oversampling Method Based on Extended Natural Neighborhood for Imbalanced Classification

Y Zhou, X Yue, J Li, X Liu, W Sun, J Li - Available at SSRN 4978376 - papers.ssrn.com
Classifying imbalanced datasets remains a significant challenge for classifiers, with
oversampling techniques being a widely used solution. However, many existing …