Improved input reprogramming for gan conditioning

PY Chen - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

In data-rich domains such as vision, language, and speech, deep learning prevails to deliver
high-performance task-specific models and can even learn general task-agnostic …

被引用次数：53 相关文章所有 4 个版本

[PDF] arxiv.org

Prompt tuning of deep neural networks for speaker-adaptive visual speech recognition

M Kim, HI Kim, YM Ro - IEEE Transactions on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Visual Speech Recognition (VSR) aims to infer speech into text depending on lip
movements alone. As it focuses on visual information to model the speech, its performance …

被引用次数：19 相关文章所有 2 个版本

[PDF] arxiv.org

Large-Scale Public Data Improves Differentially Private Image Generation Quality

R Wu, C Guo, K Chaudhuri - arXiv preprint arXiv:2309.00008, 2023 - arxiv.org

Public data has been frequently used to improve the privacy-accuracy trade-off of
differentially private machine learning, but prior work largely assumes that this data come …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Towards efficient task-driven model reprogramming with foundation models

S Xu, J Yao, R Luo, S Zhang, Z Lian, M Tan… - arXiv preprint arXiv …, 2023 - arxiv.org

Vision foundation models exhibit impressive power, benefiting from the extremely large
model capacity and broad training data. However, in practice, downstream scenarios may …

被引用次数：6 相关文章所有 2 个版本

[PDF] arxiv.org

Meta Input: How to Leverage Off-the-Shelf Deep Neural Networks

M Kim, Y Yu, S Park, YM Ro - arXiv preprint arXiv:2210.13186, 2022 - arxiv.org

These days, although deep neural networks (DNNs) have achieved a noticeable progress in
a wide range of research area, it lacks the adaptability to be employed in the real-world …

高级搜索

QQ 群