Two-sample testing using deep learning

M Kirchler, S Khorasani, M Kloft… - … Conference on Artificial …, 2020 - proceedings.mlr.press
We propose a two-sample testing procedure based on learned deep neural network
representations. To this end, we define two test statistics that perform an asymptotic location …

Statistical tests and identifiability conditions for pooling and analyzing multisite datasets

HH Zhou, V Singh, SC Johnson… - Proceedings of the …, 2018 - National Acad Sciences
When sample sizes are small, the ability to identify weak (but scientifically interesting)
associations between a set of predictors and a response may be enhanced by pooling …

Few-shot text and image classification via analogical transfer learning

W Liu, X Chang, Y Yan, Y Yang… - ACM Transactions on …, 2018 - dl.acm.org
Learning from very few samples is a challenge for machine learning tasks, such as text and
image classification. Performance of such task can be enhanced via transfer of helpful …

Conditional recurrent flow: conditional generation of longitudinal samples with applications to neuroimaging

SJ Hwang, Z Tao, WH Kim… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
We develop a conditional generative model for longitudinal image datasets based on
sequential invertible neural networks. Longitudinal image acquisitions are common in …

When can Multi-Site Datasets be Pooled for Regression? Hypothesis Tests, -consistency and Neuroscience Applications

HH Zhou, Y Zhang, VK Ithapu… - International …, 2017 - proceedings.mlr.press
Many studies in biomedical and health sciences involve small sample sizes due to logistic or
financial constraints. Often, identifying weak (but scientifically interesting) associations …

Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning

J Sun, R Sharma, VS Lokhande, C Chen - arXiv preprint arXiv:2407.15894, 2024 - arxiv.org
Prompt Tuning has emerged as a prominent research paradigm for adapting vision-
language models to various downstream tasks. However, recent research indicates that …

Testing the equality of distributions using integrated maximum mean discrepancy

T Ding, Z Li, Y Zhang - Journal of Statistical Planning and Inference, 2025 - Elsevier
Comparing and testing for the homogeneity of two independent random samples is a
fundamental statistical problem with many applications across various fields. However …

[HTML][HTML] Distance assessment and analysis of high-dimensional samples using variational autoencoders

M Inacio, R Izbicki, B Gyires-Tóth - Information Sciences, 2021 - Elsevier
An important question in many machine learning applications is whether two samples arise
from the same generating distribution. Although an old topic in Statistics, simple …

Unsupervised domain adaptation for multi-center autism spectrum disorder identification

Y Jiang, Z Li, D Zhang - … & Communications, Cloud & Big Data …, 2019 - ieeexplore.ieee.org
Improving autism spectrum disorder (ASD) diagnosis through effective utilization of multi-
center data has attracted increasing attention recently. However, most previous studies do …

Cross-media retrieval with semantics clustering and enhancement

M Zhan, L Li, Q Huang, Y Liu - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
Cross-media retrieval, which uses a text query to search for images and vice-versa, has
attracted a wide attention in recent years. The mostly existing cross-media retrieval methods …