Bag of Tricks to Boost Adversarial Transferability Z Zhang, R Zhu, W Yao, X Wang, C Xu arXiv preprint arXiv:2401.08734, 2024 | 6 | 2024 |
Random Smooth-based Certified Defense against Text Adversarial Attack Z Zhang*, W Yao*, S Liang, C Xu EACL 2024 (Findings), 2024 | 5 | 2024 |
Towards tracing trustworthiness dynamics: Revisiting pre-training period of large language models C Qian*, J Zhang*, W Yao*, D Liu, Z Yin, Y Qiao, Y Liu, J Shao ACL 2024 (Findings), 2024 | 5 | 2024 |
Fair Scratch Tickets: Finding Fair Sparse Networks Without Weight Training P Tang*, W Yao*, Z Li, Y Liu CVPR 2023, 2023 | 5 | 2023 |
Super (ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization W Yang, S Shen, G Shen, W Yao, Y Liu, Z Gong, Y Lin arXiv preprint arXiv:2406.11431, 2024 | 2 | 2024 |
Understanding Fairness Surrogate Functions in Algorithmic Fairness W Yao*, Z Zhou*, Z Li, B Han, Y Liu TMLR 2024, 2023 | 1 | 2023 |
Understanding Model Ensemble in Transferable Adversarial Attack W Yao, Z Zhang, H Tang, Y Liu arXiv preprint arXiv:2410.06851, 2024 | | 2024 |
Robust Graph Recommendation via Noise-Aware Adversarial Perturbation J Tang, Z Sun, W Yao, X Chen DASFAA 2024, 2024 | | 2024 |