Multi-modal representation learning with text-driven soft masks

J Park, B Han - Proceedings of the IEEE/CVF Conference …, 2023 - openaccess.thecvf.com
We propose a visual-linguistic representation learning approach within a self-supervised
learning framework by introducing a new operation, loss, and data augmentation strategy …

Multi-Modal Representation Learning with Text-Driven Soft Masks

J Park, B Han - 2023 IEEE/CVF Conference on Computer Vision and …, 2023 - computer.org
We propose a visual-linguistic representation learning approach within a self-supervised
learning framework by introducing a new operation, loss, and data augmentation strategy …

Multi-Modal Representation Learning with Text-Driven Soft Masks

J Park, B Han - arXiv e-prints, 2023 - ui.adsabs.harvard.edu
We propose a visual-linguistic representation learning approach within a self-supervised
learning framework by introducing a new operation, loss, and data augmentation strategy …

Multi-Modal Representation Learning with Text-Driven Soft Masks

J Park, B Han - arXiv preprint arXiv:2304.00719, 2023 - arxiv.org
We propose a visual-linguistic representation learning approach within a self-supervised
learning framework by introducing a new operation, loss, and data augmentation strategy …

Multi-Modal Representation Learning with Text-Driven Soft Masks

J Park, B Han - 2023 IEEE/CVF Conference on Computer …, 2023 - ieeexplore.ieee.org
We propose a visual-linguistic representation learning approach within a self-supervised
learning framework by introducing a new operation, loss, and data augmentation strategy …

Multi-Modal Representation Learning with Text-Driven Soft Masks

B Han, J Park - cvpr2023.thecvf.com
Multi-Modal Representation Learning with Text-Driven Soft Masks Page 1 Multi-Modal
Representation Learning with Text-Driven Soft Masks Bohyung Han Seoul National University …

Multi-Modal Representation Learning with Text-Driven Soft Masks

B Han, J Park - cvpr.thecvf.com
Multi-Modal Representation Learning with Text-Driven Soft Masks Page 1 Multi-Modal
Representation Learning with Text-Driven Soft Masks Bohyung Han Seoul National University …