Cross modal compression: Towards human-comprehensible semantic compression

Intellicise wireless networks from semantic communications: A survey, research issues, and challenges

P Zhang, W Xu, Y Liu, X Qin, K Niu… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org

Information and communication technology (ICT) has been an essential part of modern
society. However, the current communication systems are not sufficient to meet the demands …

被引用次数：10 相关文章

[PDF] springer.com

Overview of intelligent video coding: from model-based to learning-based approaches

S Ma, J Gao, R Wang, J Chang, Q Mao, Z Huang… - Visual Intelligence, 2023 - Springer

Intelligent video coding (IVC), which dates back to the late 1980s with the concept of
encoding videos with knowledge and semantics, includes visual content compact …

被引用次数：20 相关文章所有 2 个版本

Misc: Ultra-low bitrate image semantic compression driven by large multimodal model

C Li, G Lu, D Feng, H Wu, Z Zhang, X Liu… - … on Image Processing, 2024 - ieeexplore.ieee.org

With the evolution of storage and communication protocols, ultra-low bitrate image
compression has become a highly demanding topic. However, all existing compression …

被引用次数：9 相关文章所有 3 个版本

Rethinking semantic image compression: Scalable representation with cross-modality transfer

P Zhang, S Wang, M Wang, J Li… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

This article proposes the scalable cross-modality compression (SCMC) paradigm, in which
the image compression problem is further cast into a representation task by hierarchically …

被引用次数：23 相关文章所有 5 个版本

Semantic-aware visual decomposition for image coding

J Chang, J Zhang, J Li, S Wang, Q Mao, C Jia… - International Journal of …, 2023 - Springer

In this paper, we propose a novel image coding framework with semantic-aware visual
decomposition towards extremely low bitrate compression. In particular, an input image is …

被引用次数：7 相关文章所有 4 个版本

Cross modal compression with variable rate prompt

J Gao, J Li, C Jia, S Wang, S Ma… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Traditional image/video compression compresses the highly redundant visual data while
preserving signal fidelity. Recently, cross modal compression (CMC) is proposed to …

被引用次数：4 相关文章所有 2 个版本

[PDF] arxiv.org

Generative Visual Compression: A Review

B Chen, S Yin, P Chen, S Wang, Y Ye - arXiv preprint arXiv:2402.02140, 2024 - arxiv.org

Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the
acquisition of digital content and impelling the progress of visual compression towards …

被引用次数：8 相关文章所有 2 个版本

Rate-Distortion Optimized Cross Modal Compression with Multiple Domains

J Gao, C Jia, Z Huang, S Wang, S Ma… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Cross-modal compression (CMC) aims to compress highly redundant visual data into
compact, common, and human-comprehensible domains, such as text, to preserve semantic …

被引用次数：2 相关文章

[PDF] arxiv.org

When video coding meets multimodal large language models: A unified paradigm for video coding

P Zhang, J Li, M Wang, N Sebe, S Kwong… - arXiv preprint arXiv …, 2024 - arxiv.org

Existing codecs are designed to eliminate intrinsic redundancies to create a compact
representation for compression. However, strong external priors from Multimodal Large …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Large Language Models and Artificial Intelligence Generated Content Technologies Meet Communication Networks

J Guo, M Wang, H Yin, B Song, Y Chi… - IEEE Internet of …, 2024 - ieeexplore.ieee.org

Artificial intelligence generated content (AIGC) technologies, with a predominance of large
language models (LLMs), have demonstrated remarkable performance improvements in …

高级搜索

QQ 群