关注
Ethan Mendes
标题
引用次数
引用次数
年份
Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game
S Toyer, O Watkins, E Mendes, J Svegliato, L Bailey, T Wang, I Ong, ...
ICLR 2024 (Spotlight), 2023
282023
Can Language Models be Instructed to Protect Personal Information?
Y Chen*, E Mendes*, S Das, W Xu, A Ritter
arXiv preprint arXiv:2310.02224, 2023
142023
Human-in-the-loop Evaluation for Early Misinformation Detection: A Case Study of COVID-19 Treatments
E Mendes, Y Chen, A Ritter, W Xu
ACL 2023, 2022
72022
Defending Against Imperceptible Audio Adversarial Examples Using Proportional Additive Gaussian Noise
E Mendes, K Hogan
52020
Granular Privacy Control for Geolocation with Vision Language Models
E Mendes, Y Chen, J Hays, S Das, W Xu, A Ritter
arXiv preprint arXiv:2407.04952, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–5