LexMatcher: Dictionary-centric Data Curation for LLM-based Machine Translation

Y Yin, J Zeng, Y Li, F Meng… - Findings of the Association …, 2024 - aclanthology.org
The fine-tuning of open-source large language models (LLMs) for machine translation has
recently received considerable attention, marking a shift towards data-centric research from …

Towards the Machine Translation of Scientific Neologisms

P Lerner, F Yvon - 2025 - hal.science
Scientific research continually discovers and invents new concepts, which are then referred
to by new terms, neologisms, or neonyms in this context. As the vast majority of publications …

LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation

Y Yin, J Zeng, Y Li, F Meng, Y Zhang - arXiv preprint arXiv:2406.01441, 2024 - arxiv.org
The fine-tuning of open-source large language models (LLMs) for machine translation has
recently received considerable attention, marking a shift towards data-centric research from …

HarmonicEval: Multi-modal, Multi-task, Multi-criteria Automatic Evaluation Using a Vision Language Model

M Ohi, M Kaneko, N Okazaki, N Inoue - arXiv preprint arXiv:2412.14613, 2024 - arxiv.org
Vision-language models (VLMs) have shown impressive abilities in text and image
understanding. However, existing metrics for evaluating the text generated by VLMs focus …

ReFF: Reinforcing Format Faithfulness in Language Models across Varied Tasks

J Yao, H Huang, Z Liu, H Wen, W Su, B Qian… - arXiv preprint arXiv …, 2024 - arxiv.org
Following formatting instructions to generate well-structured content is a fundamental yet
often unmet capability for large language models (LLMs). To study this capability, which we …

ZeUS: An Unified Training Framework for Constrained Neural Machine Translation

M Yang - IEEE Access, 2024 - ieeexplore.ieee.org
Unlike general translation, constrained translation necessitates the proper use of predefined
restrictions, such as specific terminologies and entities, during the translation process …

Vers la traduction automatique des néologismes scientifiques

P Lerner, F Yvon - Actes de JEP-TALN-RECITAL 2024. 31ème …, 2024 - inria.hal.science
La recherche scientifique découvre et invente continuellement de nouveaux concepts qui
sont alors désignés par de nouveaux termes, des néologismes, ou néonymes dans ce …

[PDF][PDF] Bridging the Gap Between Wikipedians and Scientists with Terminology-Aware Translation: A Case Study in Turkish

GG Şahin - openreview.net
This project addresses the gap between the escalating volume of English-to-Turkish
Wikipedia translations and the insufficient number of contributors, particularly in technical …

大規模言語モデル時代の機械翻訳の展望

東山翔平 - jxiv.jst.go.jp
大規模言語モデル時代の機械翻訳の展望 Page 1 大規模言語モデル時代の機械翻訳の展望 東山
翔平 ∗ 国立研究開発法人情報通信研究機構 ユニバーサルコミュニケーション研究所 ∗ 責任著者 …