Editguard: Versatile image watermarking for tamper localization and copyright protection

V²A-Mark: Versatile Deep Visual-Audio Watermarking for Manipulation Localization and Copyright Protection

X Zhang, Y Xu, R Li, J Yu, W Li, Z Xu… - Proceedings of the 32nd …, 2024 - dl.acm.org

AI-generated video has revolutionized short video production, filmmaking, and personalized
media, making video local editing an essential tool. However, this progress also blurs the …

被引用次数：10 相关文章所有 2 个版本

[PDF] thecvf.com

360dvd: Controllable panorama video generation with 360-degree video diffusion model

Q Wang, W Li, C Mou, X Cheng… - Proceedings of the …, 2024 - openaccess.thecvf.com

Panorama video recently attracts more interest in both study and application courtesy of its
immersive experience. Due to the expensive cost of capturing 360-degree panoramic videos …

被引用次数：13 相关文章所有 3 个版本

[PDF] arxiv.org

Fakeshield: Explainable image forgery detection and localization via multi-modal large language models

Z Xu, X Zhang, R Li, Z Tang, Q Huang… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid development of generative AI is a double-edged sword, which not only facilitates
content creation but also makes image manipulation easier and more difficult to detect …

被引用次数：3 相关文章所有 2 个版本

Resvr: Joint rescaling and viewport rendering of omnidirectional images

W Li, S Zhao, B Chen, X Cheng, J Li, L Zhang… - Proceedings of the …, 2024 - dl.acm.org

With the advent of virtual reality technology, omnidirectional image (ODI) rescaling
techniques are increasingly embraced to reduce transmitted and stored file sizes while …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Robust watermarking using generative priors against image editing: From benchmarking to advances

S Lu, Z Zhou, J Lu, Y Zhu, AWK Kong - arXiv preprint arXiv:2410.18775, 2024 - arxiv.org

Current image watermarking methods are vulnerable to advanced image editing techniques
enabled by large-scale text-to-image models. These models can distort embedded …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

U Can't Gen This? A Survey of Intellectual Property Protection Methods for Data in Generative AI

T Šarčević, A Karlowicz, R Mayer… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Generative AI (GAI) models have the unparalleled ability to generate text, images,
audio, and other forms of media that are increasingly indistinguishable from human …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Are Watermarks Bugs for Deepfake Detectors? Rethinking Proactive Forensics

X Wu, X Liao, B Ou, Y Liu, Z Qin - arXiv preprint arXiv:2404.17867, 2024 - arxiv.org

AI-generated content has accelerated the topic of media synthesis, particularly Deepfake,
which can manipulate our portraits for positive or malicious purposes. Before releasing …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Proactive schemes: A survey of adversarial attacks for social good

V Asnani, X Yin, X Liu - arXiv preprint arXiv:2409.16491, 2024 - arxiv.org

Adversarial attacks in computer vision exploit the vulnerabilities of machine learning models
by introducing subtle perturbations to input data, often leading to incorrect predictions or …

被引用次数：1 相关文章所有 3 个版本

[PDF] aclanthology.org

Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding

X Cheng, Z Zhu, B Yang, X Zhuang, H Li… - Findings of the …, 2024 - aclanthology.org

Owing to the scarcity of labeled training data, Spoken Language Understanding (SLU) is still
a challenging task in low-resource languages. Therefore, zero-shot cross-lingual SLU …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

Geometry cloak: Preventing tgs-based 3d reconstruction from copyrighted images

Q Song, Z Luo, KC Cheung, S See, R Wan - arXiv preprint arXiv …, 2024 - arxiv.org

Single-view 3D reconstruction methods like Triplane Gaussian Splatting (TGS) have
enabled high-quality 3D model generation from just a single image input within seconds …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群