On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

在引用文章中搜索

[PDF] arxiv.org

Deep Learning based Visually Rich Document Content Understanding: A Survey

Y Ding, J Lee, SC Han - arXiv preprint arXiv:2408.01287, 2024 - arxiv.org

Visually Rich Documents (VRDs) are essential in academia, finance, medical fields, and
marketing due to their multimodal information content. Traditional methods for extracting …

被引用次数：2 相关文章所有 3 个版本

[PDF] aaai.org

On Disentanglement of Asymmetrical Knowledge Transfer for Modality-Task Agnostic Federated Learning

J Chen, A Zhang - Proceedings of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

There has been growing concern regarding data privacy during the development and
deployment of Multimodal Foundation Models for Artificial General Intelligence (AGI), while …

被引用次数：8 相关文章

高级搜索

QQ 群

On Task-personalized Multimodal Few-shot Learning for Visually-rich Document Entity Retrieval

Deep Learning based Visually Rich Document Content Understanding: A Survey

On Disentanglement of Asymmetrical Knowledge Transfer for Modality-Task Agnostic Federated Learning

引用