Pangu-agent: A fine-tunable generalist agent with structured reasoning F Christianos, G Papoudakis, M Zimmer, T Coste, Z Wu, J Chen, ... arXiv preprint arXiv:2312.14878, 2023 | 8 | 2023 |
Bayesian Reward Models for LLM Alignment AX Yang, M Robeyns, T Coste, J Wang, H Bou-Ammar, L Aitchison ICLR 2024 Workshop on Secure and Trustworthy Large Language Models, 2024 | 7 | 2024 |