A survey on heterogeneous federated learning

D Gao, X Yao, Q Yang - arXiv preprint arXiv:2210.04505, 2022 - arxiv.org
Federated learning (FL) has been proposed to protect data privacy and virtually assemble
the isolated data silos by cooperatively training models among organizations without …

Multi-target knowledge distillation via student self-reflection

J Gou, X Xiong, B Yu, L Du, Y Zhan, D Tao - International Journal of …, 2023 - Springer
Abstract Knowledge distillation is a simple yet effective technique for deep model
compression, which aims to transfer the knowledge learned by a large teacher model to a …

Segvit v2: Exploring efficient and continual semantic segmentation with plain vision transformers

B Zhang, L Liu, MH Phan, Z Tian, C Shen… - International Journal of …, 2024 - Springer
This paper investigates the capability of plain Vision Transformers (ViTs) for semantic
segmentation using the encoder–decoder framework and introduce SegViTv2. In this study …

Multi-scale fine-grained alignments for image and sentence matching

W Li, Y Wang, Y Su, X Li, AA Liu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Image and sentence matching is a critical task to bridge the visual and textual discrepancy
due to the heterogeneous modalities. Great progress has been made by exploring the …

Deep semantic-aware proxy hashing for multi-label cross-modal retrieval

Y Huo, Q Qin, J Dai, L Wang, W Zhang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
Deep hashing has attracted broad interest in cross-modal retrieval because of its low cost
and efficient retrieval benefits. To capture the semantic information of raw samples and …

Robust semantic segmentation with multi-teacher knowledge distillation

A Amirkhani, A Khosravian, M Masih-Tehrani… - IEEE …, 2021 - ieeexplore.ieee.org
Recent studies have recently exploited knowledge distillation (KD) technique to address
time-consuming annotation task in semantic segmentation, through which one teacher …

Continual learning for cross-modal image-text retrieval based on domain-selective attention

R Yang, S Wang, Y Gu, J Wang, Y Sun, H Zhang… - Pattern Recognition, 2024 - Elsevier
Cross-modal image-text retrieval (CMITR) has been a high-value research topic for more
than a decade. In most of the previous studies, the data for all tasks are trained as a single …

Efficient semi-supervised multimodal hashing with importance differentiation regression

C Zheng, L Zhu, Z Zhang, J Li… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Multi-modal hashing learns compact binary hash codes by collaborating heterogeneous
multi-modal features at both the model training and online retrieval stages to support large …

Alignment efficient image-sentence retrieval considering transferable cross-modal representation learning

Y Yang, J Guo, G Li, L Li, W Li, J Yang - Frontiers of Computer Science, 2024 - Springer
Traditional image-sentence cross-modal retrieval methods usually aim to learn consistent
representations of heterogeneous modalities, thereby to search similar instances in one …

Knowledge Decomposition and Replay: A Novel Cross-modal Image-Text Retrieval Continual Learning Method

R Yang, S Wang, H Zhang, S Xu, YH Guo… - Proceedings of the 31st …, 2023 - dl.acm.org
To enable machines to mimic human cognitive abilities and alleviate the catastrophic
forgetting problem in cross-modal image-text retrieval (CMITR), this paper proposes a novel …