MENet: Lightweight multimodality enhancement network for detecting salient objects in RGB-thermal images

J Wu, W Zhou, X Qian, J Lei, L Yu, T Luo - Neurocomputing, 2023 - Elsevier
Most red–green–blue and thermal (RGB-T) salient object detection methods require high
memory consumption and incur large computational costs, which limit their applicability. To …

TIVE: A toolbox for identifying video instance segmentation errors

W Jia, L Yang, Z Jia, W Zhao, Y Zhou, Q Song - Neurocomputing, 2023 - Elsevier
In this paper, we introduce TIVE, a Toolbox for Identifying Video instance segmentation
Errors. By directly operating output prediction files, TIVE defines isolated error types and …

SSDMM-VAE: variational multi-modal disentangled representation learning

AK Mondal, A Sailopal, P Singla, P Ap - Applied Intelligence, 2023 - Springer
Multi-modal learning aims at simultaneously modelling data from several modalities such as
image, text and speech. The goal is to simultaneously learn representations and make them …

Structural Transformer with Region Strip Attention for video object segmentation

Q Guan, H Fang, C Han, Z Wang, R Zhang, Y Zhang… - Neurocomputing, 2024 - Elsevier
Memory-based methods in semi-supervised video object segmentation (VOS) achieve
competitive performance by performing feature similarity between the current frame and …

Video Object Segmentation Using Multi-Scale Attention-Based Siamese Network

Z Zhu, L Qiu, J Wang, J Xiong, H Peng - Electronics, 2023 - mdpi.com
Video target segmentation is a fundamental problem in computer vision that aims to
segment targets from a background by learning their appearance information and movement …

Black-box Attack against Self-supervised Video Object Segmentation Models with Contrastive Loss

Y Chen, R Yao, Y Zhou, J Zhao, B Liu… - ACM Transactions on …, 2023 - dl.acm.org
Deep learning models have been proven to be susceptible to malicious adversarial attacks,
which manipulate input images to deceive the model into making erroneous decisions …

Self-supervised Video Object Segmentation Using Motion Feature Compensation

T Zhang, B Li - International Conference on Artificial Neural Networks, 2023 - Springer
Video object segmentation is a popular area of research in computer vision. Traditional
models are trained using annotated data, which is both time-consuming and expensive …

Location-scale Family of Laplacian Distributions and Its Applications to VAE and Some Extended VAES

A Zhu, P Cao - 2023 - researchsquare.com
Abstract Like Gaussian distribution, Laplacian distribution is also a location-scale family of
distributions. So we can choose the standard Laplace distribution (with location= 0, scale …

Hybrid Siamese-attention Robust Tracker for SOT

TE Ghoniemy, MM Fouad - … of the 6th International Conference on …, 2022 - dl.acm.org
Single object tracking (SOT) finds the object correspondence in a spatio-temporal domain,
and incorporates various applications in Computer Vision. The Siamese trackers have …