Understanding (mis) behavior on the eosio blockchain Y Huang, H Wang, L Wu, G Tyson, X Luo, R Zhang, X Liu, G Huang, ... Proceedings of the ACM on Measurement and Analysis of Computing Systems 4 (2 …, 2020 | 94* | 2020 |
Look before you leap: An exploratory study of uncertainty measurement for large language models Y Huang, J Song, Z Wang, H Chen, L Ma arXiv preprint arXiv:2307.10236, 2023 | 58 | 2023 |
A semantic-aware representation framework for online log analysis W Meng, Y Liu, Y Huang, S Zhang, F Zaiter, B Chen, D Pei 2020 29th International Conference on Computer Communications and Networks …, 2020 | 58 | 2020 |
LogClass: Anomalous Log Identification and Classification With Partial Labels W Meng, Y Liu, S Zhang, F Zaiter, Y Zhang, Y Huang, Z Yu, Y Zhang, ... IEEE Transactions on Network and Service Management 18 (2), 1870-1884, 2021 | 44 | 2021 |
Patchcensor: Patch robustness certification for transformers via exhaustive testing Y Huang, L Ma, Y Li ACM Transactions on Software Engineering and Methodology 32 (6), 1-34, 2023 | 18* | 2023 |
Summarizing unstructured logs in online services W Meng, F Zaiter, Y Huang, Y Liu, S Zhang, Y Zhang, Y Zhu, T Zhang, ... arXiv preprint arXiv:2012.08938, 2020 | 12 | 2020 |
Generation-based Differential Fuzzing for Deep Learning Libraries J Liu, Y Huang, Z Wang, L Ma, C Fang, M Gu, X Zhang, Z Chen ACM Transactions on Software Engineering and Methodology 33 (2), 1-28, 2023 | 8 | 2023 |
PromptCharm: Text-to-Image Generation through Multi-modal Prompting and Refinement Z Wang, Y Huang, D Song, L Ma, T Zhang Proceedings of the CHI Conference on Human Factors in Computing Systems, 1-21, 2024 | 7 | 2024 |
DeepLens: interactive out-of-distribution data detection in NLP models D Song, Z Wang, Y Huang, L Ma, T Zhang Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems …, 2023 | 6 | 2023 |
An Exploratory Study of AI System Risk Assessment from the Lens of Data Distribution and Uncertainty Z Wang, Y Huang, L Ma, H Yokoyama, S Tokumoto, K Munakata arXiv preprint arXiv:2212.06828, 2022 | 5 | 2022 |
Online Safety Analysis for LLMs: a Benchmark, an Assessment, and a Path Forward X Xie, J Song, Z Zhou, Y Huang, D Song, L Ma arXiv preprint arXiv:2404.08517, 2024 | 4 | 2024 |
Deepseer: Interactive rnn explanation and debugging via state abstraction Z Wang, Y Huang, D Song, L Ma, T Zhang Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems …, 2023 | 4 | 2023 |
LUNA: A Model-Based Universal Analysis Framework for Large Language Models D Song, X Xie, J Song, D Zhu, Y Huang, F Juefei-Xu, L Ma IEEE Transactions on Software Engineering, 2024 | 3 | 2024 |
TESTEVAL: Benchmarking Large Language Models for Test Case Generation W Wang, C Yang, Z Wang, Y Huang, Z Chu, D Song, L Zhang, AR Chen, ... arXiv preprint arXiv:2406.04531, 2024 | 3 | 2024 |
Multilingual Blending: LLM Safety Alignment Evaluation with Language Mixture J Song, Y Huang, Z Zhou, L Ma arXiv preprint arXiv:2407.07342, 2024 | 1 | 2024 |
Enhancing Fault Detection for Large Language Models via Mutation-Based Confidence Smoothing Q Hu, J Wen, M Cordy, Y Huang, X Xie, L Ma arXiv preprint arXiv:2404.14419, 2024 | 1 | 2024 |
An Empirical Study of Code Generation Errors made by Large Language Models D Song, Z Zhou, Z Wang, Y Huang, S Chen, B Kou, L Ma, T Zhang | 1 | 2023 |
Vortex under Ripplet: An Empirical Study of RAG-enabled Applications Y Shao, Y Huang, J Shen, L Ma, T Su, C Wan arXiv preprint arXiv:2407.05138, 2024 | | 2024 |
Where Do Large Language Models Fail When Generating Code? Z Wang, Z Zhou, D Song, Y Huang, S Chen, L Ma, T Zhang arXiv preprint arXiv:2406.08731, 2024 | | 2024 |
When Simulator Meets Natural Deviation: A Study on Deviations in Simulation-based ADS Testing R Wang, Z Wang, Y Huang, L Ma 2023 IEEE 34th International Symposium on Software Reliability Engineering …, 2023 | | 2023 |