Moderating new waves of online hate with chain-of-thought reasoning in large language models N Vishwamitra, K Guo, FT Romit, I Ondracek, L Cheng, Z Zhao, H Hu 2024 IEEE Symposium on Security and Privacy (SP), 788-806, 2024 | 9 | 2024 |
Understanding the generalizability of hateful memes detection models against covid-19-related hateful memes K Cuo, W Zhao, M Jaden, V Vishwamitra, Z Zhao, H Hu International Conference on Machine Learning and Applications, 2022 | 7 | 2022 |
An investigation of large language models for real-world hate speech detection K Guo, A Hu, J Mu, Z Shi, Z Zhao, N Vishwamitra, H Hu 2023 International Conference on Machine Learning and Applications (ICMLA …, 2023 | 5 | 2023 |
Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models K Guo, A Utkarsh, W Ding, I Ondracek, Z Zhao, G Freeman, N Vishwamitra, ... 33rd USENIX Security Symposium (USENIX Security 24), 5787--5804, 2024 | 2 | 2024 |
Exploring Vulnerabilities in Voice Command Skills for Connected Vehicles W Ding, S Liao, K Guo, F Zhang, L Cheng, Z Zhao, H Hu International Conference on Security and Privacy in Cyber-Physical Systems …, 2023 | 2 | 2023 |
AI-Cybersecurity Education Through Designing AI-based Cyberharassment Detection Lab E Okpala, N Vishwamitra, K Guo, S Liao, L Cheng, H Hu, Y Wu, X Yuan, ... arXiv preprint arXiv:2405.08125, 2024 | 1 | 2024 |
Moderating Embodied Cyber Threats Using Generative AI K Guo, F Guo, H Hu arXiv preprint arXiv:2405.05928, 2024 | | 2024 |
Understanding and Analyzing COVID-19-related Online Hate Propagation Through Hateful Memes Shared on Twitter N Vishwamitra, K Guo, S Liao, J Mu, Z Ma, L Cheng, Z Zhao, H Hu Proceedings of the International Conference on Advances in Social Networks …, 2023 | | 2023 |
Understanding and Measuring Robustness of Vision and Language Multimodal Models N Vishwamitra, K Guo, H Hu, Z Zhao, L Cheng, F Luo Proceedings of the International Conference on Secure Knowledge Management …, 2023 | | 2023 |