Deep multi-view learning methods: A review

X Yan, S Hu, Y Mao, Y Ye, H Yu - Neurocomputing, 2021 - Elsevier
Multi-view learning (MVL) has attracted increasing attention and achieved great practical
success by exploiting complementary information of multiple features or modalities …

Comparative analysis on cross-modal information retrieval: A review

P Kaur, HS Pannu, AK Malhi - Computer Science Review, 2021 - Elsevier
Human beings experience life through a spectrum of modes such as vision, taste, hearing,
smell, and touch. These multiple modes are integrated for information processing in our …

Deep multimodal representation learning: A survey

W Guo, J Wang, S Wang - Ieee Access, 2019 - ieeexplore.ieee.org
Multimodal representation learning, which aims to narrow the heterogeneity gap among
different modalities, plays an indispensable role in the utilization of ubiquitous multimodal …

Progressive learning with multi-scale attention network for cross-domain vehicle re-identification

Y Wang, J Peng, H Wang, M Wang - Science China Information Sciences, 2022 - Springer
Vehicle re-identification (reID) aims to identify vehicles across different cameras that have
non-overlapping views. Most existing vehicle reID approaches train the reID model with well …

Survey on deep multi-modal data analytics: Collaboration, rivalry, and fusion

Y Wang - ACM Transactions on Multimedia Computing …, 2021 - dl.acm.org
With the development of web technology, multi-modal or multi-view data has surged as a
major stream for big data, where each modal/view encodes individual property of data …

Unsupervised MR-to-CT synthesis using structure-constrained CycleGAN

H Yang, J Sun, A Carass, C Zhao, J Lee… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Synthesizing a CT image from an available MR image has recently emerged as a key goal
in radiotherapy treatment planning for cancer patients. CycleGANs have achieved promising …

Language-driven artistic style transfer

TJ Fu, XE Wang, WY Wang - European Conference on Computer Vision, 2022 - Springer
Despite having promising results, style transfer, which requires preparing style images in
advance, may result in lack of creativity and accessibility. Following human instruction, on …

Neural networks for entity matching: A survey

N Barlaug, JA Gulla - ACM Transactions on Knowledge Discovery from …, 2021 - dl.acm.org
Entity matching is the problem of identifying which records refer to the same real-world
entity. It has been actively researched for decades, and a variety of different approaches …

Where-and-when to look: Deep siamese attention networks for video-based person re-identification

L Wu, Y Wang, J Gao, X Li - IEEE Transactions on Multimedia, 2018 - ieeexplore.ieee.org
Video-based person re-identification (re-id) is a central application in surveillance systems
with a significant concern in security. Matching persons across disjoint camera views in their …

Creating something from nothing: Unsupervised knowledge distillation for cross-modal hashing

H Hu, L Xie, R Hong, Q Tian - Proceedings of the IEEE/CVF …, 2020 - openaccess.thecvf.com
In recent years, cross-modal hashing (CMH) has attracted increasing attentions, mainly
because its potential ability of mapping contents from different modalities, especially in …