Training language models to follow instructions with human feedback L Ouyang, J Wu, X Jiang, D Almeida, C Wainwright, P Mishkin, C Zhang, ... Advances in neural information processing systems 35, 27730-27744, 2022 | 7718 | 2022 |
GPT-4 Technical Report OpenAI https://arxiv.org/abs/2303.08774, 2023 | 3082* | 2023 |
A holistic approach to undesired content detection in the real world T Markov, C Zhang, S Agarwal, FE Nekoul, T Lee, S Adler, A Jiang, ... Proceedings of the AAAI Conference on Artificial Intelligence 37 (12), 15009 …, 2023 | 110 | 2023 |
An Efficient Adversarial Attack for Tree Ensembles C Zhang, H Zhang, CJ Hsieh Advances in Neural Information Processing Systems (NeurIPS) 2020, 2020 | 31 | 2020 |
GPT-4V(ision) System Card OpenAI https://cdn.openai.com/papers/GPTV_System_Card.pdf, 2023 | 17 | 2023 |
New and improved content moderation tooling T Markov, C Zhang, S Agarwal, T Eloundou, T Lee, S Adler, A Jiang, ... OpenAI.< https://openai. com/blog/new-andimproved-content-moderation-tooling …, 2022 | 14 | 2022 |
Double Perturbation: On the Robustness of Robustness and Counterfactual Bias Evaluation C Zhang, J Zhao, H Zhang, KW Chang, CJ Hsieh NAACL 2021, 2021 | 11 | 2021 |
On the Robustness of Robustness and Counterfactual Bias Evaluation C Zhang University of California, Los Angeles, 2021 | | 2021 |