Online distillation-enhanced multi-modal transformer for sequential recommendation

F Yang, C Feng, Z Chen, H Park… - Proceedings of the …, 2024 - openaccess.thecvf.com

The ability to associate touch with other modalities has huge implications for humans and
computational systems. However multimodal learning with touch remains challenging due to …

被引用次数：13 相关文章所有 4 个版本

[PDF] thecvf.com

Generating visual scenes from touch

F Yang, J Zhang, A Owens - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

An emerging line of work has sought to generate plausible imagery from touch. Existing
approaches, however, tackle only narrow aspects of the visuo-tactile synthesis problem, and …

被引用次数：14 相关文章所有 6 个版本

[PDF] researchgate.net

Intelligent model update strategy for sequential recommendation

Z Lv, W Zhang, Z Chen, S Zhang, K Kuang - Proceedings of the ACM on …, 2024 - dl.acm.org

Modern online platforms are increasingly employing recommendation systems to address
information overload and improve user engagement. There is an evolving paradigm in this …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Foundation models for recommender systems: A survey and new perspectives

C Huang, T Yu, K Xie, S Zhang, L Yao… - arXiv preprint arXiv …, 2024 - arxiv.org

Recently, Foundation Models (FMs), with their extensive knowledge bases and complex
architectures, have offered unique opportunities within the realm of recommender systems …

被引用次数：8 相关文章所有 2 个版本

[PDF] acm.org

IISAN: Efficiently Adapting Multimodal Representation for Sequential Recommendation with Decoupled PEFT

J Fu, X Ge, X Xin, A Karatzoglou, I Arapakis… - Proceedings of the 47th …, 2024 - dl.acm.org

Multimodal foundation models are transformative in sequential recommender systems,
leveraging powerful representation learning capabilities. While Parameter-efficient Fine …

被引用次数：1 相关文章所有 3 个版本

[PDF] aaai.org

Fewer Steps, Better Performance: Efficient Cross-Modal Clip Trimming for Video Moment Retrieval Using Language

X Fang, D Liu, W Fang, P Zhou, Z Xu, W Xu… - Proceedings of the …, 2024 - ojs.aaai.org

Given an untrimmed video and a sentence query, video moment retrieval using language
(VMR) aims to locate a target query-relevant moment. Since the untrimmed video is …

[PDF] aclanthology.org

ART: rule bAsed futuRe-inference deducTion

M Li, T Zhao, B Jionghao, B He, J Miao… - Proceedings of the …, 2023 - aclanthology.org

Deductive reasoning is a crucial cognitive ability of humanity, allowing us to derive valid
conclusions from premises and observations. However, existing works mainly focus on …

Dynamic network for language-based fashion retrieval

H Li, Y Wu, F Wang - Proceedings of the 1st International Workshop on …, 2023 - dl.acm.org

Language-based fashion image retrieval, as a kind of composed image retrieval, presents a
substantial challenge in the domain of multi-modal retrieval. This task aims to retrieve the …

被引用次数：1 相关文章

高级搜索

QQ 群