Backdoor attacks and countermeasures on deep learning: A comprehensive review

Y Gao, BG Doan, Z Zhang, S Ma, J Zhang, A Fu… - arXiv preprint arXiv …, 2020 - arxiv.org
This work provides the community with a timely comprehensive review of backdoor attacks
and countermeasures on deep learning. According to the attacker's capability and affected …

I know what you trained last summer: A survey on stealing machine learning models and defences

D Oliynyk, R Mayer, A Rauber - ACM Computing Surveys, 2023 - dl.acm.org
Machine-Learning-as-a-Service (MLaaS) has become a widespread paradigm, making
even the most complex Machine Learning models available for clients via, eg, a pay-per …

The false promise of imitating proprietary llms

A Gudibande, E Wallace, C Snell, X Geng, H Liu… - arXiv preprint arXiv …, 2023 - arxiv.org
An emerging method to cheaply improve a weaker language model is to finetune it on
outputs from a stronger model, such as a proprietary system like ChatGPT (eg, Alpaca, Self …

Cater: Intellectual property protection on text generation apis via conditional watermarks

X He, Q Xu, Y Zeng, L Lyu, F Wu… - Advances in Neural …, 2022 - proceedings.neurips.cc
Previous works have validated that text generation APIs can be stolen through imitation
attacks, causing IP violations. In order to protect the IP of text generation APIs, recent work …

A survey on ChatGPT: AI-generated contents, challenges, and solutions

Y Wang, Y Pan, M Yan, Z Su… - IEEE Open Journal of the …, 2023 - ieeexplore.ieee.org
With the widespread use of large artificial intelligence (AI) models such as ChatGPT, AI-
generated content (AIGC) has garnered increasing attention and is leading a paradigm shift …

M4: Multi-generator, multi-domain, and multi-lingual black-box machine-generated text detection

Y Wang, J Mansurov, P Ivanov, J Su… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have demonstrated remarkable capability to generate fluent
responses to a wide variety of user queries. However, this has also raised concerns about …

A recipe for watermarking diffusion models

Y Zhao, T Pang, C Du, X Yang, NM Cheung… - arXiv preprint arXiv …, 2023 - arxiv.org
Recently, diffusion models (DMs) have demonstrated their advantageous potential for
generative tasks. Widespread interest exists in incorporating DMs into downstream …

Protecting intellectual property of language generation apis with lexical watermark

X He, Q Xu, L Lyu, F Wu, C Wang - … of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
Nowadays, due to the breakthrough in natural language generation (NLG), including
machine translation, document summarization, image captioning, etc NLG models have …

A survey of deep neural network watermarking techniques

Y Li, H Wang, M Barni - Neurocomputing, 2021 - Elsevier
Abstract Protecting the Intellectual Property Rights (IPR) associated to Deep Neural
Networks (DNNs) is a pressing need pushed by the high costs required to train such …

Thieves on sesame street! model extraction of bert-based apis

K Krishna, GS Tomar, AP Parikh, N Papernot… - arXiv preprint arXiv …, 2019 - arxiv.org
We study the problem of model extraction in natural language processing, in which an
adversary with only query access to a victim model attempts to reconstruct a local copy of …