Rubystar: A non-task-oriented mixture model dialog system H Liu, T Lin, H Sun, W Lin, CW Chang, T Zhong, A Rudnicky arXiv preprint arXiv:1711.02781, 2017 | 32 | 2017 |
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use J Ye, Z Du, X Yao, W Lin, Y Xu, Z Chen, Z Wang, S Zhu, Z Xi, S Yuan, ... arXiv preprint arXiv:2501.02506, 2025 | | 2025 |