Super-naturalinstructions: Generalization via declarative instructions on 1600+ nlp tasks Y Wang, S Mishra, P Alipoormolabashi, Y Kordi, A Mirzaei, A Arunkumar, ... arXiv preprint arXiv:2204.07705, 2022 | 423 | 2022 |
Promptaid: Prompt exploration, perturbation, testing and iteration using visual analytics for large language models A Mishra, U Soni, A Arunkumar, J Huang, BC Kwon, C Bryan arXiv preprint arXiv:2304.01964, 2023 | 183 | 2023 |
Sujan Reddy A, Sumanta Patro, Tanay Dixit, and Xudong Shen. 2022. Super-NaturalInstructions: Generalization via declarative instructions on 1600+ NLP tasks Y Wang, S Mishra, P Alipoormolabashi, Y Kordi, A Mirzaei, A Naik, ... Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 171 | 2022 |
Benchmarking generalization via in-context instructions on 1,600+ language tasks Y Wang, S Mishra, P Alipoormolabashi, Y Kordi, A Mirzaei, A Arunkumar, ... arXiv preprint arXiv:2204.07705 2, 2022 | 125 | 2022 |
Dqi: Measuring data quality in nlp S Mishra, A Arunkumar, B Sachdeva, C Bryan, C Baral arXiv preprint arXiv:2005.00816, 2020 | 30 | 2020 |
How robust are model rankings: A leaderboard customization approach for equitable evaluation S Mishra, A Arunkumar Proceedings of the AAAI conference on Artificial Intelligence 35 (15), 13561 …, 2021 | 24 | 2021 |
Our evaluation metric needs an update to encourage generalization S Mishra, A Arunkumar, C Bryan, C Baral arXiv preprint arXiv:2007.06898, 2020 | 21 | 2020 |
Real-time visual feedback for educative benchmark creation: A human-and-metric-in-the-loop workflow A Arunkumar, S Mishra, B Sachdeva, C Baral, C Bryan NeurIPS 2020 Workshop HAMLETS, 2020 | 6 | 2020 |
Bayesian modelling of alluvial diagram complexity A Arunkumar, S Ginjpalli, C Bryan 2021 IEEE Visualization Conference (VIS), 51-55, 2021 | 5 | 2021 |
Dqi: A guide to benchmark evaluation S Mishra, A Arunkumar, B Sachdeva, C Bryan, C Baral arXiv preprint arXiv:2008.03964, 2020 | 5 | 2020 |
Image or Information? Examining the Nature and Impact of Visualization Perceptual Classification A Arunkumar, L Padilla, GY Bae, C Bryan IEEE Transactions on Visualization and Computer Graphics, 2023 | 4 | 2023 |
LINGO: visually debiasing natural language instructions to support task diversity A Arunkumar, S Sharma, R Agrawal, S Chandrasekaran, C Bryan Computer Graphics Forum 42 (3), 409-421, 2023 | 4 | 2023 |
Pmu tracker: A visualization platform for epicentric event propagation analysis in the power grid A Arunkumar, A Pinceti, L Sankar, C Bryan IEEE Transactions on Visualization and Computer Graphics 29 (1), 1081-1090, 2022 | 4 | 2022 |
A proposal to study" is high quality data all we need?" S Mishra, A Arunkumar arXiv preprint arXiv:2203.06404, 2022 | 4 | 2022 |
Real-time visual feedback to guide benchmark creation: A human-and-metric-in-the-loop workflow A Arunkumar, S Mishra, B Sachdeva, C Baral, C Bryan arXiv preprint arXiv:2302.04434, 2023 | 2 | 2023 |
A Survey of Parameters Associated with the Quality of Benchmarks in NLP S Mishra, A Arunkumar, C Bryan, C Baral arXiv preprint arXiv:2210.07566, 2022 | 2 | 2022 |
rushang karia, Savan Doshi, Shailaja Keyur Sampat, Siddhartha Mishra, Sujan Reddy A, Sumanta Patro, Tanay Dixit, Xudong Shen, Chitta Baral, Yejin Choi, Noah A. Smith, Hannaneh … Y Wang Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 2 | 2022 |
Hardness of samples need to be quantified for a reliable evaluation system: Exploring potential opportunities with a new task S Mishra, A Arunkumar, C Bryan, C Baral arXiv preprint arXiv:2210.07631, 2022 | 1 | 2022 |
Investigating the failure modes of the auc metric and exploring alternatives for evaluating systems in safety critical applications S Mishra, A Arunkumar, C Baral arXiv preprint arXiv:2210.04466, 2022 | 1 | 2022 |
PMUVis: A Large-Scale Platform to Assist Power System Operators in a Smart Grid A Arunkumar, N Gupta, A Pinceti, L Sankar, C Bryan IEEE Computer Graphics and Applications 42 (6), 84-95, 2022 | 1 | 2022 |