VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation B Zou, M Cai, J Zhang, YJ Lee arXiv preprint arXiv:2407.10972, 2024 | | 2024 |
Testing learning-enabled cyber-physical systems with large-language models: A formal approach X Zheng, AK Mok, R Piskac, YJ Lee, B Krishnamachari, D Zhu, ... Companion Proceedings of the 32nd ACM International Conference on the …, 2024 | 1 | 2024 |
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy X Li, C Mata, J Park, K Kahatapitiya, YS Jang, J Shang, K Ranasinghe, ... arXiv preprint arXiv:2406.20095, 2024 | | 2024 |
MATE: Meet At The Embedding--Connecting Images with Long Texts YK Jang, J Kang, YJ Lee, D Kim arXiv preprint arXiv:2407.09541, 2024 | | 2024 |
Yo'LLaVA: Your Personalized Language and Vision Assistant T Nguyen, H Liu, Y Li, M Cai, U Ojha, YJ Lee arXiv preprint arXiv:2406.09400, 2024 | | 2024 |
Matryoshka Multimodal Models M Cai, J Yang, J Gao, YJ Lee arXiv preprint arXiv:2405.17430, 2024 | 2 | 2024 |
Llava-prumerge: Adaptive token reduction for efficient large multimodal models Y Shang, M Cai, B Xu, YJ Lee, Y Yan arXiv preprint arXiv:2403.15388, 2024 | 7 | 2024 |
Llm inference unveiled: Survey and roofline model insights Z Yuan, Y Shang, Y Zhou, Z Dong, C Xue, B Wu, Z Li, Q Gu, YJ Lee, ... arXiv preprint arXiv:2402.16363, 2024 | 12 | 2024 |
Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving Y Xie, H Chen, GP Meyer, YJ Lee, EM Wolff, M Tomizuka, W Zhan, Y Chai, ... arXiv preprint arXiv:2402.15583, 2024 | | 2024 |
Method and system of using a global transformer for efficient modeling of global context in point clouds YJ Lee, H Liu US Patent 11,908,202, 2024 | | 2024 |
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples J Zhang, M Cai, T Xie, YJ Lee arXiv preprint arXiv:2402.13254, 2024 | | 2024 |
Visual instruction inversion: Image editing via image prompting T Nguyen, Y Li, U Ojha, YJ Lee Advances in Neural Information Processing Systems 36, 2024 | 19 | 2024 |
Visual instruction tuning H Liu, C Li, Q Wu, YJ Lee Advances in neural information processing systems 36, 2024 | 2374 | 2024 |
Segment everything everywhere all at once X Zou, J Yang, H Zhang, F Li, L Li, J Wang, L Wang, J Gao, YJ Lee Advances in Neural Information Processing Systems 36, 2024 | 316 | 2024 |
Investigating the catastrophic forgetting in multimodal large language model fine-tuning Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma Conference on Parsimony and Learning, 202-227, 2024 | 63* | 2024 |
LLaVA-NeXT: Improved reasoning H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee OCR, and world knowledge 2, 2024 | 9 | 2024 |
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts M Cai, H Liu, SK Mustikovela, GP Meyer, Y Chai, D Park, YJ Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 2 | 2024 |
Llava-next: Improved reasoning, ocr, and world knowledge H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee | 146 | 2024 |
Edit One for All: Interactive Batch Image Editing T Nguyen, U Ojha, Y Li, H Liu, YJ Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
Computer vision on the edge: Individual cattle identification in real-time with readmycow system M Smink, H Liu, D Döpfer, YJ Lee Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | 4 | 2024 |