Model reprogramming: Resource-efficient cross-domain machine learning

PY Chen - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
In data-rich domains such as vision, language, and speech, deep learning prevails to deliver
high-performance task-specific models and can even learn general task-agnostic …

Prompt tuning of deep neural networks for speaker-adaptive visual speech recognition

M Kim, HI Kim, YM Ro - IEEE Transactions on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Visual Speech Recognition (VSR) aims to infer speech into text depending on lip
movements alone. As it focuses on visual information to model the speech, its performance …

Large-Scale Public Data Improves Differentially Private Image Generation Quality

R Wu, C Guo, K Chaudhuri - arXiv preprint arXiv:2309.00008, 2023 - arxiv.org
Public data has been frequently used to improve the privacy-accuracy trade-off of
differentially private machine learning, but prior work largely assumes that this data come …

Towards efficient task-driven model reprogramming with foundation models

S Xu, J Yao, R Luo, S Zhang, Z Lian, M Tan… - arXiv preprint arXiv …, 2023 - arxiv.org
Vision foundation models exhibit impressive power, benefiting from the extremely large
model capacity and broad training data. However, in practice, downstream scenarios may …

Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks

M Kim, Y Yu, S Park, YM Ro - arXiv preprint arXiv:2210.13186, 2022 - arxiv.org
These days, although deep neural networks (DNNs) have achieved a noticeable progress in
a wide range of research area, it lacks the adaptability to be employed in the real-world …