Binding touch to everything: Learning unified multimodal tactile representations

F Yang, C Feng, Z Chen, H Park… - Proceedings of the …, 2024 - openaccess.thecvf.com
The ability to associate touch with other modalities has huge implications for humans and
computational systems. However multimodal learning with touch remains challenging due to …

Generating visual scenes from touch

F Yang, J Zhang, A Owens - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
An emerging line of work has sought to generate plausible imagery from touch. Existing
approaches, however, tackle only narrow aspects of the visuo-tactile synthesis problem, and …

Intelligent model update strategy for sequential recommendation

Z Lv, W Zhang, Z Chen, S Zhang, K Kuang - Proceedings of the ACM on …, 2024 - dl.acm.org
Modern online platforms are increasingly employing recommendation systems to address
information overload and improve user engagement. There is an evolving paradigm in this …

Foundation models for recommender systems: A survey and new perspectives

C Huang, T Yu, K Xie, S Zhang, L Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
Recently, Foundation Models (FMs), with their extensive knowledge bases and complex
architectures, have offered unique opportunities within the realm of recommender systems …

IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

J Fu, X Ge, X Xin, A Karatzoglou, I Arapakis… - Proceedings of the 47th …, 2024 - dl.acm.org
Multimodal foundation models are transformative in sequential recommender systems,
leveraging powerful representation learning capabilities. While Parameter-efficient Fine …

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

X Fang, D Liu, W Fang, P Zhou, Z Xu, W Xu… - Proceedings of the …, 2024 - ojs.aaai.org
Given an untrimmed video and a sentence query, video moment retrieval using language
(VMR) aims to locate a target query-relevant moment. Since the untrimmed video is …

ART: rule bAsed futuRe-inference deducTion

M Li, T Zhao, B Jionghao, B He, J Miao… - Proceedings of the …, 2023 - aclanthology.org
Deductive reasoning is a crucial cognitive ability of humanity, allowing us to derive valid
conclusions from premises and observations. However, existing works mainly focus on …

Dynamic network for language-based fashion retrieval

H Li, Y Wu, F Wang - Proceedings of the 1st International Workshop on …, 2023 - dl.acm.org
Language-based fashion image retrieval, as a kind of composed image retrieval, presents a
substantial challenge in the domain of multi-modal retrieval. This task aims to retrieve the …