Leveraging large language models for nlg evaluation: Advances and challenges

Z Li, X Xu, T Shen, C Xu, JC Gu, Y Lai… - Proceedings of the …, 2024 - aclanthology.org
In the rapidly evolving domain of Natural Language Generation (NLG) evaluation,
introducing Large Language Models (LLMs) has opened new avenues for assessing …

Data augmentation using llms: Data perspectives, learning paradigms and challenges

B Ding, C Qin, R Zhao, T Luo, X Li… - Findings of the …, 2024 - aclanthology.org
In the rapidly evolving field of large language models (LLMs), data augmentation (DA) has
emerged as a pivotal technique for enhancing model performance by diversifying training …

Leveraging large language models for nlg evaluation: A survey

Z Li, X Xu, T Shen, C Xu, JC Gu, C Tao - arXiv e-prints, 2024 - ui.adsabs.harvard.edu
In the rapidly evolving domain of Natural Language Generation (NLG) evaluation,
introducing Large Language Models (LLMs) has opened new avenues for assessing …

Unifying Bias and Unfairness in Information Retrieval: A Survey of Challenges and Opportunities with Large Language Models

S Dai, C Xu, S Xu, L Pang, Z Dong, J Xu - arXiv preprint arXiv:2404.11457, 2024 - arxiv.org
With the rapid advancement of large language models (LLMs), information retrieval (IR)
systems, such as search engines and recommender systems, have undergone a significant …

Symbolic task inference in deep reinforcement learning

H Hasanbeig, NY Jeppu, A Abate, T Melham… - Journal of Artificial …, 2024 - jair.org
This paper proposes DeepSynth, a method for effective training of deep reinforcement
learning agents when the reward is sparse or non-Markovian, but at the same time progress …

Clave: An adaptive framework for evaluating values of llm generated responses

J Yao, X Yi, X Xie - arXiv preprint arXiv:2407.10725, 2024 - arxiv.org
The rapid progress in Large Language Models (LLMs) poses potential risks such as
generating unethical content. Assessing LLMs' values can help expose their misalignment …

A prefrontal cortex-inspired architecture for planning in Large Language Models

TW Webb, SS Mondal, C Wang, B Krabach… - CoRR, 2023 - openreview.net
Large language models (LLMs) demonstrate impressive performance on a wide variety of
tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed …

Prompt and Prejudice

L Berlincioni, L Cultrera, F Becattini, M Bertini… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper investigates the impact of using first names in Large Language Models (LLMs)
and Vision Language Models (VLMs), particularly when prompted with ethical decision …

Grade Like a Human: Rethinking Automated Assessment with Large Language Models

W Xie, J Niu, CJ Xue, N Guan - arXiv preprint arXiv:2405.19694, 2024 - arxiv.org
While large language models (LLMs) have been used for automated grading, they have not
yet achieved the same level of performance as humans, especially when it comes to grading …

Improving Planning with Large Language Models: A Modular Agentic Architecture

SS Mondal, TW Webb, I Momennejad - openreview.net
Large language models (LLMs) demonstrate impressive performance on a wide variety of
tasks, but they often struggle with tasks that require multi-step reasoning or goal-directed …