Learning to play text-based adventure games with maximum entropy reinforcement learning

W Li, R Devidze, S Fellenz - Joint European Conference on Machine …, 2023 - Springer
Text-based adventure games are a popular testbed for language based reinforcement
learning (RL). In previous work, deep Q-learning is most often used as the learning agent. Q …

When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives

Y Hu, K Song, S Cho, X Wang, W Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
Reasoning is most powerful when an LLM accurately aggregates relevant information. We
examine the critical role of information aggregation in reasoning by requiring the LLM to …