Intellicise wireless networks from semantic communications: A survey, research issues, and challenges

P Zhang, W Xu, Y Liu, X Qin, K Niu… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
Information and communication technology (ICT) has been an essential part of modern
society. However, the current communication systems are not sufficient to meet the demands …

Overview of intelligent video coding: from model-based to learning-based approaches

S Ma, J Gao, R Wang, J Chang, Q Mao, Z Huang… - Visual Intelligence, 2023 - Springer
Intelligent video coding (IVC), which dates back to the late 1980s with the concept of
encoding videos with knowledge and semantics, includes visual content compact …

Misc: Ultra-low bitrate image semantic compression driven by large multimodal model

C Li, G Lu, D Feng, H Wu, Z Zhang, X Liu… - … on Image Processing, 2024 - ieeexplore.ieee.org
With the evolution of storage and communication protocols, ultra-low bitrate image
compression has become a highly demanding topic. However, all existing compression …

Rethinking semantic image compression: Scalable representation with cross-modality transfer

P Zhang, S Wang, M Wang, J Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
This article proposes the scalable cross-modality compression (SCMC) paradigm, in which
the image compression problem is further cast into a representation task by hierarchically …

Semantic-aware visual decomposition for image coding

J Chang, J Zhang, J Li, S Wang, Q Mao, C Jia… - International Journal of …, 2023 - Springer
In this paper, we propose a novel image coding framework with semantic-aware visual
decomposition towards extremely low bitrate compression. In particular, an input image is …

Cross modal compression with variable rate prompt

J Gao, J Li, C Jia, S Wang, S Ma… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Traditional image/video compression compresses the highly redundant visual data while
preserving signal fidelity. Recently, cross modal compression (CMC) is proposed to …

Generative Visual Compression: A Review

B Chen, S Yin, P Chen, S Wang, Y Ye - arXiv preprint arXiv:2402.02140, 2024 - arxiv.org
Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the
acquisition of digital content and impelling the progress of visual compression towards …

Rate-Distortion Optimized Cross Modal Compression with Multiple Domains

J Gao, C Jia, Z Huang, S Wang, S Ma… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Cross-modal compression (CMC) aims to compress highly redundant visual data into
compact, common, and human-comprehensible domains, such as text, to preserve semantic …

When video coding meets multimodal large language models: A unified paradigm for video coding

P Zhang, J Li, M Wang, N Sebe, S Kwong… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing codecs are designed to eliminate intrinsic redundancies to create a compact
representation for compression. However, strong external priors from Multimodal Large …

Large Language Models and Artificial Intelligence Generated Content Technologies Meet Communication Networks

J Guo, M Wang, H Yin, B Song, Y Chi… - IEEE Internet of …, 2024 - ieeexplore.ieee.org
Artificial intelligence generated content (AIGC) technologies, with a predominance of large
language models (LLMs), have demonstrated remarkable performance improvements in …