Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine W Jiao, W Wang, J Huang, X Wang, S Shi, Z Tu arXiv:2301.08745, 2023 | 593* | 2023 |
Improving Adversarial Transferability via Neuron Attribution-Based Attacks J Zhang, W Wu, J Huang, Y Huang, W Wang, Y Su, MR Lyu CVPR, 2022 | 132 | 2022 |
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Y Yuan, W Jiao, W Wang, J Huang, P He, S Shi, Z Tu ICLR, 2024 | 72 | 2024 |
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback W Jiao, J Huang, W Wang, Z He, T Liang, X Wang, S Shi, Z Tu EMNLP Findings, 2023 | 43* | 2023 |
Improving the Transferability of Adversarial Samples by Path-Augmented Method J Zhang, J Huang, W Wang, Y Li, W Wu, X Wang, Y Su, MR Lyu CVPR, 2023 | 32 | 2023 |
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews X Wang, Y Xiao, J Huang, S Yuan, R Xu, H Guo, Q Tu, Y Fei, Z Leng, ... ACL, 2024 | 25* | 2024 |
Revisiting the Reliability of Psychological Scales on Large Language Models J Huang, W Wang, MH Lam, EJ Li, W Jiao, MR Lyu arXiv:2305.19926, 2023 | 23* | 2023 |
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench J Huang, MH Lam, EJ Li, S Ren, W Wang, W Jiao, Z Tu, MR Lyu arXiv:2308.03656, 2023 | 22 | 2023 |
MTTM: Metamorphic Testing for Textual Content Moderation Software W Wang, J Huang, W Wu, J Zhang, Y Huang, S Li, P He, MR Lyu ICSE, 2023 | 22* | 2023 |
All Languages Matter: On the Multilingual Safety of Large Language Models W Wang, Z Tu, C Chen, Y Yuan, J Huang, W Jiao, MR Lyu ACL Findings, 2024 | 21 | 2024 |
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs J Huang, W Wang, EJ Li, MH Lam, S Ren, Y Yuan, W Jiao, Z Tu, MR Lyu ICLR, 2024 | 21* | 2024 |
AEON: A Method for Automatic Evaluation of NLP Test Cases J Huang, J Zhang, W Wang, P He, Y Su, MR Lyu ISSTA, 2022 | 19 | 2022 |
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models W Wang, W Jiao, J Huang, R Dai, J Huang, Z Tu, MR Lyu ACL, 2024 | 14 | 2024 |
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages W Jiao, Z Tu, J Li, W Wang, J Huang, S Shi WMT, 2022 | 13 | 2022 |
A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models Y Wan, W Wang, Y Yang, Y Yuan, J Huang, P He, W Jiao, MR Lyu arXiv:2401.00757, 2024 | 7 | 2024 |
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments J Huang, EJ Li, MH Lam, T Liang, W Wang, Y Yuan, W Jiao, X Wang, Z Tu, ... arXiv:2403.11807, 2024 | 6 | 2024 |
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software W Wang, J Huang, J Huang, C Chen, J Gu, P He, MR Lyu ASE, 2023 | 6 | 2023 |
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models T Liang, Z He, J Huang, W Wang, W Jiao, R Wang, Y Yang, Z Tu, S Shi, ... arXiv:2310.20499, 2023 | 4 | 2023 |
A Unified Debugging Approach via LLM-Based Multi-Agent Synergy C Lee, CS Xia, J Huang, Z Zhu, L Zhang, MR Lyu arXiv:2404.17153, 2024 | 1 | 2024 |
The Earth is Flat? Unveiling Factual Errors in Large Language Models W Wang, J Shi, Z Tu, Y Yuan, J Huang, W Jiao, MR Lyu arXiv:2401.00761, 2024 | 1 | 2024 |