Culturellm: Incorporating cultural differences into large language models

C Li, M Chen, J Wang, S Sitaram, X Xie - arXiv preprint arXiv:2402.10946, 2024 - arxiv.org
Large language models (LLMs) are reported to be partial to certain cultures owing to the
training data dominance from the English corpora. Since multilingual cultural data are often …

Sabi\'a-2: A New Generation of Portuguese Large Language Models

TS Almeida, H Abonizio, R Nogueira… - arXiv preprint arXiv …, 2024 - arxiv.org
We introduce Sabi\'a-2, a family of large language models trained on Portuguese texts. The
models are evaluated on a diverse range of exams, including entry-level tests for Brazilian …

CRAFT: Extracting and Tuning Cultural Instructions from the Wild

B Wang, G Lin, Z Liu, C Wei, NF Chen - arXiv preprint arXiv:2405.03138, 2024 - arxiv.org
Large language models (LLMs) have rapidly evolved as the foundation of various natural
language processing (NLP) applications. Despite their wide use cases, their understanding …

CulturePark: Boosting Cross-cultural Understanding in Large Language Models

C Li, D Teney, L Yang, Q Wen, X Xie… - arXiv preprint arXiv …, 2024 - arxiv.org
Cultural bias is pervasive in many large language models (LLMs), largely due to the
deficiency of data representative of different cultures. Typically, cultural datasets and …

Survey of Cultural Awareness in Language Models: Text and Beyond

S Pawar, J Park, J Jin, A Arora, J Myung… - arXiv preprint arXiv …, 2024 - arxiv.org
Large-scale deployment of large language models (LLMs) in various applications, such as
chatbots and virtual assistants, requires LLMs to be culturally sensitive to the user to ensure …

PersianRAG: A Retrieval-Augmented Generation System for Persian Language

H Hosseini, MS Zare, AH Mohammadi… - arXiv preprint arXiv …, 2024 - arxiv.org
Retrieval augmented generation (RAG) models, which integrate large-scale pre-trained
generative models with external retrieval mechanisms, have shown significant success in …

PerkwE_COQA: enhance Persian Conversational Question Answering by combining contextual keyword extraction with Large Language Models

P Moradbeiki, N Ghadiri - arXiv preprint arXiv:2404.05406, 2024 - arxiv.org
Smart cities need the involvement of their residents to enhance quality of life. Conversational
query-answering is an emerging approach for user engagement. There is an increasing …

PsychoLex: Unveiling the Psychological Mind of Large Language Models

MA Abbasi, FS Mirnezami, H Naderi - arXiv preprint arXiv:2408.08848, 2024 - arxiv.org
This paper explores the intersection of psychology and artificial intelligence through the
development and evaluation of specialized Large Language Models (LLMs). We introduce …

Fine Tuning LLMs for Low Resource Languages

S Joshi, MS Khan, A Dafe, K Singh… - … on Image Processing …, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) hold immense potential, but their data hunger can limit its
performance in processing languages with limited resources. This research study explores …

SynTran-fa: Generating Comprehensive Answers for Farsi QA Pairs via Syntactic Transformation

F Farsi, S Sabouri, K Kashfipour, S Gooran, H Sameti… - 2024 - preprints.org
Generating coherent and comprehensive responses remains a significant challenge
Question-Answering (QA) systems when working with short answers especially for low …