Content word aware neural machine translation

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks with different data modalities. A PFM (eg, BERT, ChatGPT, and GPT-4) is …

被引用次数：513 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] Neural machine translation: A review of methods, resources, and tools

Z Tan, S Wang, Z Yang, G Chen, X Huang, M Sun… - AI Open, 2020 - Elsevier

Abstract Machine translation (MT) is an important sub-field of natural language processing
that aims to translate natural languages using computers. In recent years, end-to-end neural …

被引用次数：148 相关文章所有 3 个版本

[PDF] aclanthology.org

Token-level self-evolution training for sequence-to-sequence learning

K Peng, L Ding, Q Zhong, Y Ouyang… - Proceedings of the …, 2023 - aclanthology.org

Adaptive training approaches, widely used in sequence-to-sequence models, commonly
reweigh the losses of different target tokens based on priors, eg word frequency. However …

被引用次数：20 相关文章所有 2 个版本

[PDF] arxiv.org

Wait-info policy: Balancing source and target at information level for simultaneous machine translation

S Zhang, S Guo, Y Feng - arXiv preprint arXiv:2210.11220, 2022 - arxiv.org

Simultaneous machine translation (SiMT) outputs the translation while receiving the source
inputs, and hence needs to balance the received source information and translated target …

被引用次数：17 相关文章所有 3 个版本

[PDF] aclanthology.org

CLIO: Role-interactive multi-event head attention network for document-level event extraction

Y Ren, Y Cao, F Fang, P Guo, Z Lin… - Proceedings of the 29th …, 2022 - aclanthology.org

Transforming the large amounts of unstructured text on the Internet into structured event
knowledge is a critical, yet unsolved goal of NLP, especially when addressing document …

被引用次数：11 相关文章

Improving neural machine translation with latent features feedback

Y Li, J Li, M Zhang - Neurocomputing, 2021 - Elsevier

Most state-of-the-art neural machine translation (NMT) models progressively encode feature
representation in a bottom-up feed-forward fashion. This traditional encoding mechanism …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

The great misalignment problem in human evaluation of NLP methods

M Hämäläinen, K Alnajjar - arXiv preprint arXiv:2104.05361, 2021 - arxiv.org

We outline the Great Misalignment Problem in natural language processing research, this
means simply that the problem definition is not in line with the method proposed and the …

被引用次数：19 相关文章所有 11 个版本

[PDF] mdpi.com

Grammatically derived factual relation augmented neural machine translation

F Li, J Zhu, H Yan, Z Zhang - Applied Sciences, 2022 - mdpi.com

Featured Application This paper introduces factual relation information into Transformer-
based neural machine translation to improve translation quality. Abstract Transformer-based …

被引用次数：5 相关文章所有 5 个版本

[PDF] mdpi.com

Machine Translation of Electrical Terminology Constraints

Z Wang, Y Chen, J Zhang - Information, 2023 - mdpi.com

In practical applications, the accuracy of domain terminology translation is an important
criterion for the performance evaluation of domain machine translation models. Aiming at the …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

Revisiting knowledge distillation for autoregressive language models

Q Zhong, L Ding, L Shen, J Liu, B Du, D Tao - arXiv preprint arXiv …, 2024 - arxiv.org

Knowledge distillation (KD) is a common approach to compress a teacher model to reduce
its inference cost and memory footprint, by training a smaller student model. However, in the …

被引用次数：7 相关文章所有 2 个版本

高级搜索

QQ 群