Rethinking token-mixing mlp for mlp-based vision backbone

T Yu, X Li, Y Cai, M Sun, P Li - arXiv preprint arXiv:2106.14882, 2021 - arxiv.org
In the past decade, we have witnessed rapid progress in the machine vision backbone. By
introducing the inductive bias from the image processing, convolution neural network (CNN)
has achieved excellent performance in numerous computer vision tasks and has been
established as\emph {de facto} backbone. In recent years, inspired by the great success
achieved by Transformer in NLP tasks, vision Transformer models emerge. Using much less
inductive bias, they have achieved promising performance in computer vision tasks …

[引用][C] Rethinking token-mixing mlp for mlp-based vision backbone. arXiv 2021

T Yu, X Li, Y Cai, M Sun, P Li - arXiv preprint arXiv:2106.14882
以上显示的是最相近的搜索结果。 查看全部搜索结果