查看文章

arxiv.org 中的 [PDF]

Learned disentangled latent representations for scalable image coding for humans and machines

作者

Ezgi Özyılkan*, Mateen Ulhaq*, Hyomin Choi, Fabien Racapé

发表日期

2023/3/21

研讨会论文

2023 Data Compression Conference (DCC)

页码范围

42-51

出版商

IEEE

简介

As an increasing amount of image and video content will be analyzed by machines, there is demand for a new codec paradigm that is capable of compressing visual input primarily for the purpose of computer vision inference, while secondarily supporting input reconstruction. In this work, we propose a learned compression architecture that can be used to build such a codec. We introduce a novel variational formulation that explicitly takes feature data relevant to the desired inference task as input at the encoder side. As such, our learned scalable image codec encodes and transmits two disentangled latent representations for object detection and input reconstruction. We note that compared to relevant benchmarks, our proposed scheme yields a more compact latent representation that is specialized for the inference task. Our experiments show that our proposed system achieves a bit rate savings of 40.6% on the …

引用总数

被引用次数：7

2022202320241 3 3

学术搜索中的文章

Learned disentangled latent representations for scalable image coding for humans and machines

E Özyılkan, M Ulhaq, H Choi, F Racapé - 2023 Data Compression Conference (DCC), 2023

被引用次数：7 相关文章所有 6 个版本