Weakly supervised visual saliency prediction

J Xie, Z Liu, G Li, X Lu, T Chen - Knowledge-Based Systems, 2024 - Elsevier

The human visual system effectively analyzes scenes based on local, global and semantic
properties. Deep learning-based saliency prediction models adopted two-stream networks …

被引用次数：3 相关文章所有 2 个版本

Feature aggregation with transformer for RGB-T salient object detection

P Zhang, M Xu, Z Zhang, P Gao, J Zhang - Neurocomputing, 2023 - Elsevier

The main purpose of RGB-T salient object detection (SOD) is to fully integrate and exploit
the information from the complementary fusion of modalities to address the …

被引用次数：6 相关文章所有 2 个版本

Multi-granular semantic mining for weakly supervised semantic segmentation

M Zhang, J Li, T Zhou - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

This paper solves the problem of learning image semantic segmentation using image-level
supervision. The task is promising in terms of reducing annotation efforts, yet extremely …

被引用次数：8 相关文章所有 2 个版本

[PDF] ulster.ac.uk

Elwnet: An extremely lightweight approach for real-time salient object detection

Z Wang, Y Zhang, Y Liu, D Zhu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Existing lightweight salient object detection (SOD) methods aim to solve the problem of high
computational costs that is prevalent with heavyweight methods. However, compared with …

被引用次数：6 相关文章所有 3 个版本

Sc2net: scale-aware crowd counting network with pyramid dilated convolution

L Liang, H Zhao, F Zhou, Q Zhang, Z Song, Q Shi - Applied Intelligence, 2023 - Springer

Accurate crowd counting is still challenging due to the variations of crowd heads. Most of
crowd counting methods adopt multi-branch networks to extract multi-scale information …

被引用次数：8 相关文章所有 3 个版本

CLFormer: a unified transformer-based framework for weakly supervised crowd counting and localization

M Deng, H Zhao, M Gao - The Visual Computer, 2024 - Springer

Recent progress in crowd counting and localization methods mainly relies on expensive
point-level annotations and convolutional neural networks with limited receptive filed, which …

被引用次数：4 相关文章所有 2 个版本

PDDNet: lightweight congested crowd counting via pyramid depth-wise dilated convolution

L Liang, H Zhao, F Zhou, M Ma, F Yao, X Ji - Applied Intelligence, 2023 - Springer

The accuracy of crowd counting is susceptible to scale variations of crowd head in the
congested scene. Some counting networks, such as crowd density pre-classification …

被引用次数：5 相关文章所有 3 个版本

MPLA-Net: Multiple Pseudo Label Aggregation Network for Weakly Supervised Video Salient Object Detection

C Ma, L Du, L Zhuo, J Li - … on Circuits and Systems for Video …, 2023 - ieeexplore.ieee.org

Weakly Supervised Video Salient Object Detection (WSVSOD) only requires coarse-grained
manual annotations, which can achieve a good trade-off between labeling efficiency and …

被引用次数：1 相关文章

[PDF] arxiv.org

Audio–visual collaborative representation learning for dynamic saliency prediction

H Ning, B Zhao, Z Hu, L He, E Pei - Knowledge-Based Systems, 2022 - Elsevier

Abstract The Dynamic Saliency Prediction (DSP) task simulates the human selective
attention mechanism to perceive a dynamic scene, which is significant and imperative in …

被引用次数：7 相关文章所有 4 个版本

[HTML] mdpi.com

[HTML][HTML] Exploring Focus and Depth-Induced Saliency Detection for Light Field

Y Zhang, F Chen, Z Peng, W Zou, C Zhang - Entropy, 2023 - mdpi.com

An abundance of features in the light field has been demonstrated to be useful for saliency
detection in complex scenes. However, bottom-up saliency detection models are limited in …

被引用次数：1 相关文章所有 8 个版本

高级搜索

QQ 群