Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... The Twelfth International Conference on Learning Representations., 2023 | 165 | 2023 |
AutoWebGLM: Bootstrap And Reinforce A Large Language Model-based Web Navigating Agent H Lai, X Liu, IL Iong, S Yao, Y Chen, P Shen, H Yu, H Zhang, X Zhang, ... arXiv preprint arXiv:2404.03648, 2024 | 8* | 2024 |
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents X Liu, T Zhang, Y Gu, IL Iong, Y Xu, X Song, S Zhang, H Lai, X Liu, H Zhao, ... arXiv preprint arXiv:2408.06327, 2024 | 1 | 2024 |