Deep learning-based compressed domain multimedia for man and machine: a taxonomy and application to point cloud classification

A Seleem, AFR Guarda, NMM Rodrigues… - IEEE Access, 2023 - ieeexplore.ieee.org
In the current golden age of multimedia, human visualization is no longer the single main
target, with the final consumer often being a machine which performs some processing or …

Point cloud geometry and color coding in a learning-based ecosystem for jpeg coding standards

AFR Guarda, NMM Rodrigues… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Despite its novelty, learning-based coding for images and point clouds is already
outperforming some of the best long-standing conventional codecs. In addition to its rising …

Zero-shot image classification via visual–semantic feature decoupling

X Sun, Y Tian, H Li - Multimedia Systems, 2024 - Springer
Zero-shot image classification refers to the use of labeled images to train a classification
model that can correctly classify images of unseen categories. Traditional zero-shot methods …

The Image Calculator: 10x Faster Image-AI Inference by Replacing JPEG with Self-designing Storage Format

U Sirin, S Idreos - Proceedings of the ACM on Management of Data, 2024 - dl.acm.org
Numerous applications today rely on artificial intelligence over images. Image AI is,
however, extremely expensive. In particular, the inference cost of image AI dominates the …

Conditional encoder-based adaptive deep image compression with classification-driven semantic awareness

Z Lei, W Zhang, X Hong, J Shi, M Su, C Lin - Electronics, 2023 - mdpi.com
This paper proposes a new algorithm for adaptive deep image compression (DIC) that can
compress images for different purposes or contexts at different rates. The algorithm can …

The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine

AFR Guarda, NMM Rodrigues, F Pereira - arXiv preprint arXiv:2409.08130, 2024 - arxiv.org
Efficient point cloud coding has become increasingly critical for multiple applications such as
virtual reality, autonomous driving, and digital twin systems, where rich and interactive 3D …

[PDF][PDF] Human-Machine Collaborative Image and Video Compression: A Survey

H Li, X Zhang, S Wang, S Wang… - APSIPA Transactions on …, 2024 - nowpublishers.com
Traditional image and video compression methods are designed to maintain the quality of
human visual perception, which makes it necessary to reconstruct the image or video before …

Deep learning-based compressed domain point cloud classification

A Seleem, AFR Guarda… - … Conference on Image …, 2023 - ieeexplore.ieee.org
Deep learning (DL) based tools have recently reached performance levels similar to state-of-
the-art hand-crafted methods for Point Cloud (PC) coding and classification. In 2022, JPEG …

Enhanced multi-branch learning for long-tailed image recognition

J Wang, Z Guo, D Yi, Y Hua, Q Meng - Multimedia Systems, 2025 - Springer
Due to the severe class imbalance between head classes and tail classes of long-tailed
data, deep learning algorithms face significant challenges when dealing with long-tailed …

SS-CMT: a label independent cross-modal transferable adversarial video attack with sparse strategy

S Zhang, Z Cui, F Li, X Han, Z Huang - Multimedia Systems, 2024 - Springer
Deep neural networks are vulnerable to adversarial examples which are generated by
adding carefully crafted perturbations on benign examples. Some research works explore …