Holistic evaluation of language models P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ... TMLR, 2022 | 751 | 2022 |
When and Why Vision-Language Models Behave like Bags-Of-Words, and What to Do About It? M Yuksekgonul, F Bianchi, P Kalluri, D Jurafsky, J Zou Oral (Notable-Top-5%) @ ICLR, 2023 | 212* | 2023 |
GPT detectors are biased against non-native English writers W Liang*, M Yuksekgonul*, Y Mao*, E Wu*, J Zou Patterns, 2023 | 170 | 2023 |
A visual–language foundation model for pathology image analysis using medical twitter Z Huang, F Bianchi, M Yuksekgonul, TJ Montine, J Zou Nature medicine 29 (9), 2307-2316, 2023 | 144 | 2023 |
Post-hoc concept bottleneck models M Yuksekgonul, M Wang, J Zou Spotlight (Notable-Top-25%) @ ICLR, 2023 | 135 | 2023 |
Pretraining boosts out-of-domain robustness for pose estimation A Mathis, T Biasi, S Schneider, M Yuksekgonul, B Rogers, M Bethge, ... WACV, 1859-1868, 2021 | 134 | 2021 |
Meaningfully debugging model mistakes using conceptual counterfactual explanations A Abid, M Yuksekgonul, J Zou ICML, 66-88, 2022 | 79* | 2022 |
Discover and Cure: Concept-aware Mitigation of Spurious Correlation S Wu, M Yuksekgonul, L Zhang, J Zou ICML, 2023 | 33 | 2023 |
SkinCon: A skin disease dataset densely annotated by domain experts for fine-grained debugging and analysis R Daneshjou*, M Yuksekgonul*, ZR Cai, RA Novoa, J Zou NeurIPS, 2022 | 28 | 2022 |
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models M Yuksekgonul, V Chandrasekaran, E Jones, S Gunasekar, R Naik, ... ICLR, 2024 | 12 | 2024 |
Beyond Confidence: Reliable Models Should Also Consider Atypicality M Yuksekgonul, L Zhang, J Zou, C Guestrin NeurIPS, 2023 | 11 | 2023 |
Diversity of thought improves reasoning abilities of large language models R Naik, V Chandrasekaran, M Yuksekgonul, H Palangi, B Nushi arXiv preprint arXiv:2310.07088, 2023 | 5 | 2023 |
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data A Mathis, T Biasi, Y Mert, B Rogers, M Bethge, MW Mathis International Conference on Machine Learning 2020 - Workshop on Uncertainty …, 2020 | 5 | 2020 |
KITAB: Evaluating LLMs on Constraint Satisfaction for Information Retrieval MI Abdin, S Gunasekar, V Chandrasekaran, J Li, M Yuksekgonul, ... ICLR, 2024 | 4 | 2024 |
Learning prototypes for multiple instance learning ÖE Sivrikaya, M Yüksekgönül, MG BAYDOĞAN Turkish Journal of Electrical Engineering and Computer Sciences 29 (7), 2901 …, 2021 | 4 | 2021 |
How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis F Bianchi, PJ Chia, M Yuksekgonul, J Tagliabue, D Jurafsky, J Zou ICML, 2024 | 3 | 2024 |
ChatGPT exhibits gender and racial biases in acute coronary syndrome management A Zhang, M Yuksekgonul, J Guild, J Zou, J Wu medRxiv, 2023.11. 14.23298525, 2023 | 3 | 2023 |
TextGrad: Automatic" Differentiation" via Text M Yuksekgonul, F Bianchi, J Boen, S Liu, Z Huang, C Guestrin, J Zou arXiv preprint arXiv:2406.07496, 2024 | | 2024 |