Multi-path transformer is better: A case study on neural machine translation

文章

学术资源搜索

获得 3 条结果（用时0.06秒）

我的图书馆

Multi-path transformer is better: A case study on neural machine translation

在引用文章中搜索

[PDF] arxiv.org

Introduction to Transformers: an NLP Perspective

T Xiao, J Zhu - arXiv preprint arXiv:2311.17633, 2023 - arxiv.org

Transformers have dominated empirical machine learning models of natural language
processing. In this paper, we introduce basic concepts of Transformers and present key …

被引用次数：20 相关文章所有 4 个版本

[PDF] arxiv.org

PartialFormer: Modeling Part Instead of Whole

T Zheng, B Li, H Bao, W Shan, T Xiao, J Zhu - arXiv preprint arXiv …, 2023 - arxiv.org

The design choices in Transformer feed-forward neural networks have resulted in significant
computational and parameter overhead. In this work, we emphasize the importance of …

被引用次数：1 相关文章所有 2 个版本

[PDF] aclanthology.org

Partialformer: Modeling part instead of whole for machine translation

T Zheng, B Li, H Bao, J Wang, W Shan… - Findings of the …, 2024 - aclanthology.org

The design choices in Transformer feed-forward neural networks have resulted in significant
computational and parameter overhead. In this work, we emphasize the importance of …

高级搜索

QQ 群

Multi-path transformer is better: A case study on neural machine translation

Introduction to Transformers: an NLP Perspective

PartialFormer: Modeling Part Instead of Whole

Partialformer: Modeling part instead of whole for machine translation

引用