You can ground earlier than see: An effective and efficient pipeline for temporal sentence grounding in compressed videos

X Fang, D Liu, P Zhou, G Nan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target
moment semantically according to a sentence query. Although previous respectable works …

Towards automated urban planning: When generative and chatgpt-like ai meets urban planning

D Wang, CT Lu, Y Fu - arXiv preprint arXiv:2304.03892, 2023 - arxiv.org
The two fields of urban planning and artificial intelligence (AI) arose and developed
separately. However, there is now cross-pollination and increasing interest in both fields to …

Generative Visual Compression: A Review

B Chen, S Yin, P Chen, S Wang, Y Ye - arXiv preprint arXiv:2402.02140, 2024 - arxiv.org
Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the
acquisition of digital content and impelling the progress of visual compression towards …

Non-semantics suppressed mask learning for unsupervised video semantic compression

Y Tian, G Lu, G Zhai, Z Gao - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Most video compression methods aim to improve the decoded video visual quality, instead
of particularly guaranteeing the semantic-completeness, which deteriorates downstream …

Clsa: a contrastive learning framework with selective aggregation for video rescaling

Y Tian, Y Yan, G Zhai, L Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Video rescaling has recently drawn extensive attention for its practical applications such as
video compression. Compared to video super-resolution, which focuses on upscaling …

Rethinking object saliency ranking: A novel whole-flow processing paradigm

M Song, L Li, D Wu, W Song… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Existing salient object detection methods are capable of predicting binary maps that
highlight visually salient regions. However, these methods are limited in their ability to …

SMC++: Masked Learning of Unsupervised Video Semantic Compression

Y Tian, G Lu, G Zhai - arXiv preprint arXiv:2406.04765, 2024 - arxiv.org
Most video compression methods focus on human visual perception, neglecting semantic
preservation. This leads to severe semantic loss during the compression, hampering …

Rethinking Video Error Concealment: A Benchmark Dataset

B Zheng, M Wang - 2023 IEEE International Conference on …, 2023 - ieeexplore.ieee.org
Error concealment is an important technique to restore a damaged video bistream. Although
data-driven in-painting methods can be directly applied to video error concealment, existing …

When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding

P Zhang, J Li, M Wang, N Sebe, S Kwong… - arXiv preprint arXiv …, 2024 - arxiv.org
Existing codecs are designed to eliminate intrinsic redundancies to create a compact
representation for compression. However, strong external priors from Multimodal Large …

Your Camera Improves Your Point Cloud Compression

Y Lin, T Xu, Z Zhu, Y Li, Z Wang… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
LiDAR point cloud compression is important for autonomous driving as it consumes a lot of
storage and bandwidth. Although the fusion of camera and LiDAR for vision perception has …