Contrastive preference optimization: Pushing the boundaries of llm performance in machine translation

H Xu, A Sharaf, Y Chen, W Tan, L Shen… - arXiv preprint arXiv …, 2024 - arxiv.org
Moderate-sized large language models (LLMs)--those with 7B or 13B parameters--exhibit
promising machine translation (MT) performance. However, even the top-performing 13B …

Automated summarization of multiple document abstracts and contents using large language models

O Langston, B Ashford - Authorea Preprints, 2024 - techrxiv.org
The exponential growth of textual data across various domains necessitates the
development of efficient and accurate summarization techniques to facilitate quick …

Mathematical foundations of hallucination in transformer-based large language models for improvisation

A Gundogmusler, F Bayindiroglu… - Authorea Preprints, 2024 - techrxiv.org
Transformer architectures have revolutionized natural language processing through their
ability to handle longrange dependencies and generate contextually coherent text. Despite …

Monolingual or multilingual instruction tuning: Which makes a better alpaca

P Chen, S Ji, N Bogoychev, A Kutuzov… - arXiv preprint arXiv …, 2023 - arxiv.org
Foundational large language models (LLMs) can be instruction-tuned to perform open-
domain question answering, facilitating applications like chat assistants. While such efforts …

Automated methodologies for evaluating lying, hallucinations, and bias in large language models

G Ecurali, Z Thackeray - 2024 - researchsquare.com
As large language models become integral to various applications, ensuring the reliability
and impartiality of their outputs is of paramount importance. The proposed methodologies for …

Multilingual large language model: A survey of resources, taxonomy and frontiers

L Qin, Q Chen, Y Zhou, Z Chen, Y Li, L Liao… - arXiv preprint arXiv …, 2024 - arxiv.org
Multilingual Large Language Models are capable of using powerful Large Language
Models to handle and respond to queries in multiple languages, which achieves remarkable …

Assessing hallucination risks in large language models through internal state analysis

P Zablocki, Z Gajewska - ESS Open Archive eprints, 2024 - authorea.com
Natural language processing models, particularly those based on deep learning
architectures, have demonstrated remarkable capabilities in generating coherent and …

An empirical automated evaluation and analysis of symmetrical reasoning in large language models

S Yamamoto, K Kobayashi, R Tanaka - Authorea Preprints, 2024 - techrxiv.org
Artificial intelligence systems increasingly require robust evaluation techniques to ensure
their logical reasoning capabilities align with practical applications. This research introduces …

Mitigating structural hallucination in large language models with local diffusion

K Kiritani, T Kayano - 2024 - researchsquare.com
Large language models (LLMs) often produce text with inaccuracies, logical inconsistencies,
or fabricated information, known as structural hallucinations, which undermine their …

[PDF][PDF] Assessing the response strategies of large language models under uncertainty: A comparative study using prompt engineering

N Lainwright, M Pemberton - 2024 - files.osf.io
The ability of artificial intelligence to understand and generate human language has
transformed various applications, enhancing interactions and decision-making processes …