Learning based multi-modality image and video compression

You can ground earlier than see: An effective and efficient pipeline for temporal sentence grounding in compressed videos

X Fang, D Liu, P Zhou, G Nan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target
moment semantically according to a sentence query. Although previous respectable works …

被引用次数：30 相关文章所有 7 个版本

[PDF] arxiv.org

Towards automated urban planning: When generative and chatgpt-like ai meets urban planning

D Wang, CT Lu, Y Fu - arXiv preprint arXiv:2304.03892, 2023 - arxiv.org

The two fields of urban planning and artificial intelligence (AI) arose and developed
separately. However, there is now cross-pollination and increasing interest in both fields to …

被引用次数：41 相关文章所有 3 个版本

[PDF] arxiv.org

Generative Visual Compression: A Review

B Chen, S Yin, P Chen, S Wang, Y Ye - arXiv preprint arXiv:2402.02140, 2024 - arxiv.org

Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the
acquisition of digital content and impelling the progress of visual compression towards …

被引用次数：3 相关文章所有 2 个版本

[PDF] thecvf.com

Non-semantics suppressed mask learning for unsupervised video semantic compression

Y Tian, G Lu, G Zhai, Z Gao - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Most video compression methods aim to improve the decoded video visual quality, instead
of particularly guaranteeing the semantic-completeness, which deteriorates downstream …

被引用次数：5 相关文章所有 3 个版本

[PDF] google.com

Clsa: a contrastive learning framework with selective aggregation for video rescaling

Y Tian, Y Yan, G Zhai, L Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Video rescaling has recently drawn extensive attention for its practical applications such as
video compression. Compared to video super-resolution, which focuses on upscaling …

被引用次数：13 相关文章所有 6 个版本

[PDF] researchgate.net

Rethinking object saliency ranking: A novel whole-flow processing paradigm

M Song, L Li, D Wu, W Song… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Existing salient object detection methods are capable of predicting binary maps that
highlight visually salient regions. However, these methods are limited in their ability to …

被引用次数：5 相关文章所有 8 个版本

[PDF] arxiv.org

SMC++: Masked Learning of Unsupervised Video Semantic Compression

Y Tian, G Lu, G Zhai - arXiv preprint arXiv:2406.04765, 2024 - arxiv.org

Most video compression methods focus on human visual perception, neglecting semantic
preservation. This leads to severe semantic loss during the compression, hampering …

Rethinking Video Error Concealment: A Benchmark Dataset

B Zheng, M Wang - 2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org

Error concealment is an important technique to restore a damaged video bistream. Although
data-driven in-painting methods can be directly applied to video error concealment, existing …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding

P Zhang, J Li, M Wang, N Sebe, S Kwong… - arXiv preprint arXiv …, 2024 - arxiv.org

Existing codecs are designed to eliminate intrinsic redundancies to create a compact
representation for compression. However, strong external priors from Multimodal Large …

Your Camera Improves Your Point Cloud Compression

Y Lin, T Xu, Z Zhu, Y Li, Z Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

LiDAR point cloud compression is important for autonomous driving as it consumes a lot of
storage and bandwidth. Although the fusion of camera and LiDAR for vision perception has …

被引用次数：2 相关文章

高级搜索

QQ 群