查看文章

thecvf.com 中的 [PDF]

Knockoff Nets: Stealing Functionality of Black-Box Models

作者

Tribhuvanesh Orekondy, Bernt Schiele, Mario Fritz

发表日期

2019/6

研讨会论文

Computer Vision and Pattern Recognition (CVPR)

简介

Machine Learning (ML) models are increasingly deployed in the wild to perform a wide range of tasks. In this work, we ask to what extent can an adversary steal functionality of such" victim" models based solely on blackbox interactions: image in, predictions out. In contrast to prior work, we study complex victim blackbox models, and an adversary lacking knowledge of train/test data used by the model, its internals, and semantics over model outputs. We formulate model functionality stealing as a two-step approach:(i) querying a set of input images to the blackbox model to obtain predictions; and (ii) training a" knockoff" with queried image-prediction pairs. We make multiple remarkable observations:(a) querying random images from a different distribution than that of the blackbox training data results in a well-performing knockoff;(b) this is possible even when the knockoff is represented using a different architecture; and (c) our reinforcement learning approach additionally improves query sample efficiency in certain settings and provides performance gains. We validate model functionality stealing on a range of datasets and tasks, as well as show that a reasonable knockoff of an image analysis API could be created for as little as 30.

引用总数

被引用次数：559

20182019202020212022202320242 15 51 96 127 177 90

学术搜索中的文章

Knockoff nets: Stealing functionality of black-box models

T Orekondy, B Schiele, M Fritz - Proceedings of the IEEE/CVF conference on computer …, 2019

被引用次数：559 相关文章所有 14 个版本