V2A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

X Zhang, Y Xu, R Li, J Yu, W Li, Z Xu… - Proceedings of the 32nd …, 2024 - dl.acm.org
AI-generated video has revolutionized short video production, filmmaking, and personalized
media, making video local editing an essential tool. However, this progress also blurs the …

360dvd: Controllable panorama video generation with 360-degree video diffusion model

Q Wang, W Li, C Mou, X Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com
Panorama video recently attracts more interest in both study and application courtesy of its
immersive experience. Due to the expensive cost of capturing 360-degree panoramic videos …

Fakeshield: Explainable image forgery detection and localization via multi-modal large language models

Z Xu, X Zhang, R Li, Z Tang, Q Huang… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid development of generative AI is a double-edged sword, which not only facilitates
content creation but also makes image manipulation easier and more difficult to detect …

Resvr: Joint rescaling and viewport rendering of omnidirectional images

W Li, S Zhao, B Chen, X Cheng, J Li, L Zhang… - Proceedings of the …, 2024 - dl.acm.org
With the advent of virtual reality technology, omnidirectional image (ODI) rescaling
techniques are increasingly embraced to reduce transmitted and stored file sizes while …

Robust watermarking using generative priors against image editing: From benchmarking to advances

S Lu, Z Zhou, J Lu, Y Zhu, AWK Kong - arXiv preprint arXiv:2410.18775, 2024 - arxiv.org
Current image watermarking methods are vulnerable to advanced image editing techniques
enabled by large-scale text-to-image models. These models can distort embedded …

U Can't Gen This? A Survey of Intellectual Property Protection Methods for Data in Generative AI

T Šarčević, A Karlowicz, R Mayer… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Generative AI (GAI) models have the unparalleled ability to generate text, images,
audio, and other forms of media that are increasingly indistinguishable from human …

Are Watermarks Bugs for Deepfake Detectors? Rethinking Proactive Forensics

X Wu, X Liao, B Ou, Y Liu, Z Qin - arXiv preprint arXiv:2404.17867, 2024 - arxiv.org
AI-generated content has accelerated the topic of media synthesis, particularly Deepfake,
which can manipulate our portraits for positive or malicious purposes. Before releasing …

Proactive schemes: A survey of adversarial attacks for social good

V Asnani, X Yin, X Liu - arXiv preprint arXiv:2409.16491, 2024 - arxiv.org
Adversarial attacks in computer vision exploit the vulnerabilities of machine learning models
by introducing subtle perturbations to input data, often leading to incorrect predictions or …

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding

X Cheng, Z Zhu, B Yang, X Zhuang, H Li… - Findings of the …, 2024 - aclanthology.org
Owing to the scarcity of labeled training data, Spoken Language Understanding (SLU) is still
a challenging task in low-resource languages. Therefore, zero-shot cross-lingual SLU …

Geometry cloak: Preventing tgs-based 3d reconstruction from copyrighted images

Q Song, Z Luo, KC Cheung, S See, R Wan - arXiv preprint arXiv …, 2024 - arxiv.org
Single-view 3D reconstruction methods like Triplane Gaussian Splatting (TGS) have
enabled high-quality 3D model generation from just a single image input within seconds …