Hive: Harnessing human feedback for instructional visual editing S Zhang, X Yang, Y Feng, C Qin, CC Chen, N Yu, Z Chen, H Wang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 53 | 2024 |
Bolaa: Benchmarking and orchestrating llm-augmented autonomous agents Z Liu, W Yao, J Zhang, L Xue, S Heinecke, R Murthy, Y Feng, Z Chen, ... arXiv preprint arXiv:2308.05960, 2023 | 47 | 2023 |
High resolution face completion with multiple controllable attributes via fully end-to-end progressive generative adversarial networks Z Chen, S Nie, T Wu, CG Healey arXiv preprint arXiv:1801.07632 1 (4), 6, 2018 | 46 | 2018 |
Retroformer: Retrospective large language agents with policy gradient optimization W Yao, S Heinecke, JC Niebles, Z Liu, Y Feng, L Xue, R Murthy, Z Chen, ... arXiv preprint arXiv:2308.02151, 2023 | 35 | 2023 |
Tackling data heterogeneity in federated learning with class prototypes Y Dai, Z Chen, J Li, S Heinecke, L Sun, R Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (6), 7314-7322, 2023 | 34 | 2023 |
Gluegen: Plug and play multi-modal encoders for x-to-image generation C Qin, N Yu, C Xing, S Zhang, Z Chen, S Ermon, Y Fu, C Xiong, R Xu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 16 | 2023 |
LayoutDETR: detection transformer is a good multimodal layout designer N Yu, CC Chen, Z Chen, R Meng, G Wu, P Josel, JC Niebles, C Xiong, ... arXiv preprint arXiv:2212.09877, 2022 | 6 | 2022 |
Burn after reading: Online adaptation for cross-domain streaming data L Yang, M Gao, Z Chen, R Xu, A Shrivastava, C Ramaiah European Conference on Computer Vision, 404-422, 2022 | 6 | 2022 |
Robustness evaluation of transformer-based form field extractors via form attacks L Xue, M Gao, Z Chen, C Xiong, R Xu International Conference on Document Analysis and Recognition, 167-184, 2023 | 5 | 2023 |
Rex: Rapid exploration and exploitation for ai agents R Murthy, S Heinecke, JC Niebles, Z Liu, L Xue, W Yao, Y Feng, Z Chen, ... arXiv preprint arXiv:2307.08962, 2023 | 5 | 2023 |
Performance Characteristics of a Camera-Based Tangible Input Device for Manipulation of 3D Information. Z Chen, CG Healey, RS Amant Graphics Interface, 74-81, 2017 | 5 | 2017 |
Field extraction from forms with unlabeled data M Gao, Z Chen, N Naik, K Hashimoto, C Xiong, R Xu arXiv preprint arXiv:2110.04282, 2021 | 3 | 2021 |
Large image collection visualization using perception-based similarity with color features Z Chen, CG Healey Advances in Visual Computing: 12th International Symposium, ISVC 2016, Las …, 2016 | 2 | 2016 |
SQ-LLaVA: Self-Questioning for Large Vision-Language Assistant G Sun, C Qin, J Wang, Z Chen, R Xu, Z Tao arXiv preprint arXiv:2403.11299, 2024 | 1 | 2024 |
BOLAA: BENCHMARKING AND ORCHESTRATING LLM AUTONOMOUS AGENTS Z Liu, W Yao, J Zhang, L Xue, S Heinecke, RN Rithesh, Y Feng, Z Chen, ... ICLR 2024 Workshop on Large Language Model (LLM) Agents, 0 | | |
REX: Rapid Exploration and eXploitation for AI agents RN Rithesh, S Heinecke, JC Niebles, Z Liu, L Xue, W Yao, Y Feng, ... | | |
High Resolution and Fast Face Completion via Progressively Attentive GANs Z Chen, S Nie, T Wu, CG Healey | | |
Towards Controllable and Interpretable Face Completion via Structure-Aware and Frequency-Oriented Attentive GANs Z Chen, S Nie, T Wu, CG Healey | | |