PAL: Program-aided Language Models L Gao*, A Madaan*, S Zhou*, U Alon, P Liu, Y Yang, J Callan, G Neubig ICML 2023, 2022 | 493 | 2022 |
WebArena: A realistic web environment for building autonomous agents S Zhou*, FF Xu*, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, T Ou, Y Bisk, ... ICLR 2024, 2023 | 161* | 2023 |
Language Models of Code are Few-shot Commonsense Learners A Madaan, S Zhou, U Alon, Y Yang, G Neubig EMNLP 2022, 2022 | 131 | 2022 |
DocPrompting: Generating Code by Retrieving the Docs S Zhou, U Alon, FF Xu, Z Wang, Z Jiang, G Neubig ICLR 2023, 2022 | 90* | 2022 |
Bridging the gap: A Survey on Integrating (Human) Feedback for Natural Language Generation P Fernandes, A Madaan, E Liu, A Farinhas, PH Martins, A Bertsch, ... TACL 2023, 2023 | 59 | 2023 |
Codebertscore: Evaluating Code Generation with Pretrained Models of Code S Zhou, U Alon, S Agarwal, G Neubig EMNLP 2023, 2023 | 46 | 2023 |
Soft Gazetteers for Low-resource Named Entity Recognition S Rijhwani, S Zhou, G Neubig, J Carbonell ACL 2020, 2020 | 45 | 2020 |
Execution-based evaluation for open-domain code generation Z Wang, S Zhou, D Fried, G Neubig Findings of EMNLP 2023, 2022 | 43 | 2022 |
Visualwebarena: Evaluating multimodal agents on realistic visual web tasks JY Koh, R Lo, L Jang, V Duvvur, MC Lim, PY Huang, G Neubig, S Zhou, ... arXiv preprint arXiv:2401.13649, 2024 | 40 | 2024 |
Mconala: a Benchmark for Code Generation from Multiple Natural Languages Z Wang, G Cuenca, S Zhou, FF Xu, G Neubig Findings of EACL 2023, 2022 | 34 | 2022 |
Improving Robustness of Neural Machine Translation with Multi-task Learning S Zhou, X Zeng, Y Zhou, A Anastasopoulos, G Neubig Proceedings of the Fourth Conference on Machine Translation (WMT), 2019 | 32 | 2019 |
Improving Candidate Generation for Low-resource Cross-lingual Entity Linking S Zhou, S Rijhwani, J Wieting, J Carbonell, G Neubig Transactions of the Association for Computational Linguistics (TACL) 8, 109-124, 2020 | 26 | 2020 |
Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web Data S Zhou, L Zhang, Y Yang, Q Lyu, P Yin, C Callison-Burch, G Neubig ACL 2022, 2022 | 23 | 2022 |
Towards Zero-resource Cross-lingual Entity Linking S Zhou, S Rijhwani, G Neubig DeepLo Workshop at EMNLP 2019, 2019 | 19 | 2019 |
Causal Reasoning of Entities and Events in Procedural Texts L Zhang, H Xu, Y Yang, S Zhou, W You, M Arora, C Callison-Burch Findings of EACL 2023, 2023 | 17 | 2023 |
Osworld: Benchmarking multimodal agents for open-ended tasks in real computer environments T Xie, D Zhang, J Chen, X Li, S Zhao, R Cao, TJ Hua, Z Cheng, D Shin, ... arXiv preprint arXiv:2404.07972, 2024 | 14 | 2024 |
Hierarchical Prompting Assists Large Language Model on Web Navigation A Sridhar, R Lo, FF Xu, H Zhu, S Zhou Findings of EMNLP 2023, 2023 | 12 | 2023 |
Aggregated Semantic Matching for Short Text Entity Linking F Nie, S Zhou, J Liu, J Wang, CY Lin, R Pan CoNLL 2018, 2018 | 11 | 2018 |
Procedures as Programs: Hierarchical Control of Situated Agents through Natural Language S Zhou, P Yin, G Neubig Structured and Unstructured Knowledge Integration Workshop at NAACL 2022, 2021 | 10 | 2021 |
Webarena: A realistic web environment for building autonomous agents. CoRR, abs/2307.13854, 2023. doi: 10.48550 S Zhou, FF Xu, H Zhu, X Zhou, R Lo, A Sridhar, X Cheng, Y Bisk, D Fried, ... arXiv preprint arXiv.2307.13854, 0 | 5 | |