A review on deep learning techniques for video prediction

S Oprea, P Martinez-Gonzalez… - … on Pattern Analysis …, 2020 - ieeexplore.ieee.org
The ability to predict, anticipate and reason about future outcomes is a key component of
intelligent decision-making systems. In light of the success of deep learning in computer …

Polarized self-attention: Towards high-quality pixel-wise regression

H Liu, F Liu, X Fan, D Huang - arXiv preprint arXiv:2107.00782, 2021 - arxiv.org
Pixel-wise regression is probably the most common problem in fine-grained computer vision
tasks, such as estimating keypoint heatmaps and segmentation masks. These regression …

Deep high-resolution representation learning for visual recognition

J Wang, K Sun, T Cheng, B Jiang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
High-resolution representations are essential for position-sensitive vision problems, such as
human pose estimation, semantic segmentation, and object detection. Existing state-of-the …

A comprehensive review of modern object segmentation approaches

Y Wang, U Ahsan, H Li, M Hagen - Foundations and Trends® …, 2022 - nowpublishers.com
Image segmentation is the task of associating pixels in an image with their respective object
class labels. It has a wide range of applications in many industries including healthcare …

High-resolution representations for labeling pixels and regions

K Sun, Y Zhao, B Jiang, T Cheng, B Xiao, D Liu… - arXiv preprint arXiv …, 2019 - arxiv.org
High-resolution representation learning plays an essential role in many vision problems, eg,
pose estimation and semantic segmentation. The high-resolution network (HRNet)~\cite …

Psanet: Point-wise spatial attention network for scene parsing

H Zhao, Y Zhang, S Liu, J Shi… - Proceedings of the …, 2018 - openaccess.thecvf.com
We notice information flow in convolutional neural networks is restricted inside local
neighborhood regions due to the physical design of convolutional filters, which limits the …

Denseaspp for semantic segmentation in street scenes

M Yang, K Yu, C Zhang, Z Li… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Semantic image segmentation is a basic street scene understanding task in autonomous
driving, where each pixel in a high resolution image is categorized into a set of semantic …

Large-scale video panoptic segmentation in the wild: A benchmark

J Miao, X Wang, Y Wu, W Li, X Zhang… - Proceedings of the …, 2022 - openaccess.thecvf.com
In this paper, we present a new large-scale dataset for the video panoptic segmentation
task, which aims to assign semantic classes and track identities to all pixels in a video. As …

A survey on deep learning technique for video segmentation

T Zhou, F Porikli, DJ Crandall… - IEEE transactions on …, 2022 - ieeexplore.ieee.org
Video segmentation—partitioning video frames into multiple segments or objects—plays a
critical role in a broad range of practical applications, from enhancing visual effects in movie …

Polarized self-attention: Towards high-quality pixel-wise mapping

H Liu, F Liu, X Fan, D Huang - Neurocomputing, 2022 - Elsevier
We address the pixel-wise mapping problem that commonly exists in the fine-grained
computer vision tasks, such as estimating keypoint heatmaps and segmentation masks …